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Introduction 


Bernd Kortmann and Johan van der Auwera 


In much of 20th century and current linguistics, it could be argued, there is a strong 
Eurocentric undercurrent. The bulk of publications in linguistics still deal with 
European languages, especially the “big and rich" languages (including their var- 
ieties spoken in other parts of the world), or use data from (one of) the major Euro- 
pean languages (notably English) for illustrative purposes in contributions to 
linguistic theory. Many of the theoretical frameworks within which these and other 
publications in theoretical or descriptive linguistics have been written have either 
recently been developed at European research institutions or stand in the tradition 
of European schools of linguistic thought of the 20th century or even earlier 
centuries. Also the bulk of the big linguistics publishers are located in Europe. The 
present volume makes Eurocentricity its programme! Its 49 chapters, authored by 
top experts in their fields, offer unique state-of-the-art descriptions of a large var- 
iety of issues exclusively relating to the languages and the linguistics of Europe, 
including an account of how both the linguistic and the linguistics landscapes of 
Europe have become what they are today. 

The volume is organized into five sections. The first of these is called “Typol- 
ogy of European languages' and deals with the synchronic grammar of language 
families and languages. The second section, "Areal typology and language con- 
tact', describes grammatical, historical and societal issues that connect with lan- 
guage contact. Societal issues are also the focus of the third section, "Language 
politics and policies in Europe’. The fourth section, ‘History of European lan- 
guages', takes us from linguistic prehistory to the 21st century, with a focus on 
grammar as well as society relating to major historical events and developments in 
major multilingual states in modern Europe. The last section, *Research traditions 
in Europe', shifts the focus from the linguistic landscape of Europe to European 
linguistics. It discusses the theories and practices of various major European 
schools of language-related thought from Classical Antiquity onwards, their rel- 
evance in the past and, for many of them, their continuing relevance for the study 
of language as practiced in Europe today. The book as a whole subscribes to the no- 
tion of *Europe' in a wide (geographic) sense, taking the reader all the way to the 
Ural mountains and the Caucasus. Within that wide perspective, however, some 
chapters deal with a more narrow concept, restricted to the western and central 
parts of the *wide' Europe. 

‘The typology of European languages’ has fourteen chapters. Half of them 
deal with language families, viz. Baltic, Celtic, Germanic, Romance, Slavonic, 
(European) Uralic and (European) Turkic. There are also chapters on Caucasian 
languages (Chapter 6) and on Sign Languages (Chapter 14), even though they do 
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not form a family in the genealogical sense. Five languages get a chapter of their 
own, viz. in alphabetical order: Albanian, Basque, Greek, Maltese, and Romani 
(Chapters 9 to 13). The focus in all of these chapters is on morphology and syntax. 
Additionally, all chapters furnish information on where the languages are spoken 
and on the number of speakers, sometimes also on the official or non-official status 
of the languages. Most chapters also address phonetics and phonology and provide 
a typological profile. 

*Areal typology and language contact? has four subsections. The first one, 
* Areal typology', addresses a field of research that was sparked off in large part by 
the EUROTYP project (“Typology of European Languages") in the early 1990s, 
which in turn gave rise to more regional research projects on linguistic conver- 
gence areas in Europe. Chapters 15 to 18 deal with the nature, the members, the 
properties and the origin of four language areas (in the sense of Sprachbund) or 
contact superposition zones in Europe: Standard Average European, and the Bal- 
kan, Baltic and Mediterranean areas. Language contact is also discussed in the 
second subsection, where the focus is either on the grammar or the socio-historical 
dimensions. The six chapters (Chapters 19 to 23) are devoted to Northern Europe, 
Eastern and Northeastern Europe, Southwestern Europe, South-Eastern Europe, 
and minority languages in Central and Southern Europe. ‘Minority languages’, old 
and new, are the focus of Chapters 24 and 25 respectively and constitute the third 
subsection, whereas the general topic of the fourth subsection, ‘Non-standard var- 
ieties’, is addressed in Chapters 26 to 28. The first of these chapters offers a socio- 
linguistic typology of the standard-dialect contact, the second deals with border 
effects in European dialect continua, and the third illustrates how the better areal 
typology of Europe should pay close attention to non-standard varieties. 

‘Language politics and language policies in Europe’ figure in eight chapters 
(Chapters 29 to 36). The first of these deals with the European Charter on Regional 
or Minority Languages, drafted by the Council of Europe in 1992. The second 
chapter is concerned with language policies at the Germanic-Romance border in 
Belgium, Luxemburg and Lorraine. Chapter 31 discusses various aspects of fem- 
inist language politics in Europe. The special position of English is described in 
Chapters 32 and 33, dealing with its role in international and intranational public 
discourse respectively. The former addresses, in particular, the (unfounded) fear 
that English might pose a threat to multilingualism in Europe, while the latter takes 
a close look at the situation in Finland. In the following chapter, the language use 
and language policies in European politics, notably in the political institutions of 
the European Union, stand at the centre of interest. Chapter 35 explores the relation 
between the church or, rather, the Armenian, Catholic, Protestant and Orthodox 
churches and the development of national languages, while the final chapter deals 
with the teaching of languages in European schools. 

The ‘History of European languages’ is shed light on in eight chapters 
(Chapters 37 to 44). The first deals with prehistory, the position of Vasconic (the 
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family that now has Basque as sole member), and the arrival of Uralic and Indo- 
European. The role of the migrations following the fall of the Roman Empire and 
the role of Renaissance both get a chapter. Chapters 40 to 43 deal with issues of 
multilingualism and linguistic nationalism in multilingual empires and states and 
their successor states, viz. the Habsburg monarchy, the Ottoman empire, Tsarist 
Russia and the Soviet Union, and Yugoslavia. Chapter 44 focuses on the role of 
language for the European nation state. 

After four sections addressing the linguistic landscape of Europe, it is in the 
fifth and last section *Research traditions in Europe' that the discipline of lin- 
guistics takes centre stage. Chapters 45 to 49 offer insightful (and in part insider) 
accounts of linguistic research up to the 18th century, the linguistics of the 19th 
century, European structuralism, European functionalism, and the reception and 
development of generative grammar in Europe. 

Despite its 49 chapters, this volume does not cover everything that one would 
want to read about when it comes to European languages and linguistics. None of 
the chapters is of course exhaustive; the reference lists, in particular, are purposely 
kept to a minimum. This book is therefore neither an encyclopedia nor a bibli- 
ography. And yet it offers a fairly comprehensive picture, its 49 expert and up to 
date reports presenting, on the one hand, an exciting reflection of linguistic expert- 
ise on Europe at the beginning of the 21st century and, on the other hand, implicit 
(and sometimes even explicit) research avenues for the immediate future. 

Spontaneously, it may seem to many that the languages spoken in Europe, Eu- 
rope as a communication space, and linguistics as practiced in the “Old World" are 
all too well known. It is our sincere hope that this volume will cause its readers 
to reassess this assumption, and to acknowledge the many fascinating facets of 
the linguistic(s) landscape of Europe not or little known outside a small community 
of experts. We are optimistic that this first volume in the World of Linguistics series 
will be an eye-opener — as it was in many respects an eye-opener to us editors — 
to anyone ready to let themself in to a fresh look at “Old Europe". This “anyone” 
may be any individual, professional linguist or not; but especially for students and 
scholars involved in the growing number of linguistics programmes putting Eu- 
rope at their centre, several of this volume's sections and chapters will serve as 
ideal starting-points and points of reference. 
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4.4. The complex sentence 

5. Suggestions for future research 

1. Introduction 


Baltic is a branch of Indo-European comprising two living languages, Latvian and 
Lithuanian, as well as one extinct language, Old Prussian. They are spoken in the 
Republics of Latvia and Lithuania respectively. 

Both are attested from the 16th century, literary standards were established in 
the late 19th century for Latvian and the early 20th century for Lithuanian. Latvian 
is now spoken by about 1.5 million people in Latvia and about 150,000 abroad; 
Lithuanian is spoken by 3 million speakers in Lithuania and about 170,000 abroad. 
Old Prussian, once spoken by the autochthonous population of East Prussia and at- 
tested in a few 16th century printed texts, became extinct in the early 18th century. 

Within Indo-European, Baltic shows the closest association with Slavonic. On 
the grounds of a number of common innovations a Balto-Slavonic ancestor lan- 
guage is usually posited between Indo-European and historically attested Baltic 
and Slavonic. Many structural similarities springing from areal links seem to have 
contributed to the general similarity between Baltic and Slavonic. 

A Fennic substratum has strongly affected the development of Latvian (Fennic 
Livonian was ousted by Latvian in historical times), much less that of Lithuanian. 
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Slavonic languages (Old Russian, subsequently Belorussian, Russian, and Polish) 
have acted as adstratum languages for Lithuanian as a whole and (mainly) for the 
eastern (High Latvian) dialects of Latvian. For Low Latvian (and the standard Lat- 
vian language based on it) the main adstratum language has been German (initially 
Low German, from the 16th century onward High German). In the 19th and 
20th centuries, Russian and German (to a lesser extent, Polish) have marked their 
influence on Lithuanian and Latvian as languages of administration, education etc. 

Standard Lithuanian and Latvian are now well established as literary languages 
and they are used throughout public life. Alongside standard Latvian, there has 
been a tradition of writing in High Latvian or Letgalian (in Eastern Latvia, former 
Polish Livonia) since the 18th century. Its functional scope is restricted, being 
mainly associated with the pastoral activities of the Roman Catholic Church, but it 
enjoys a certain degree of official recognition. 


2. Phonetics and prosody 


pA The vowel systems of Latvian and Lithuanian 


The vowel systems of both Baltic languages are relatively simple. There are opposi- 
tions of long and short vowels. For Latvian, long and ultra-long vowels could be dis- 
tinguished (as in neighbouring Estonian), but this distinction is a concomitant of pitch. 

Some degree of vowel reduction in unstressed syllables occurs in both Lithua- 
nian and Latvian. In Lithuanian, unstressed long vowels are realised as semi-long 
or even short. In Latvian, unstressed long vowels are retained but short vowels in 
final syllables often undergo unvoicing; they are either whispered or completely 
lost, e.g., diena ‘day’ may become [diona] or [dion]. 

Both Lithuanian and Latvian have rich systems of diphthongs, both with rising 
degree of sonority, like Lith. and Latv. [iə] («ie»), [uo] (Lith. «uo», Latv. <o>) and 
with diminishing degree of sonority, like Lith. and Latv. [ai], [ei], [au] etc. In Li- 
thuanian, the articulation of falling diphthongs is highly sensitive to pitch, falling 
and rising pitch being realised as prominence of the first or second element respect- 
ively, cf. Lith. laikas [loi-kas] ‘time’ vs. /diskas [la-ifkos] ‘leaf, letter’. This is now 
the most salient feature associated with pitch; in monophthongs pitch oppositions 
are increasingly levelled out. In Latvian, pitch oppositions are more stable; they do 
not notably affect the quality of vowels and diphthongs. 


2:2: Consonantal systems 


The consonant systems are relatively rich, especially in Lithuanian, which has a 
correlation of non-palatalised and palatalised consonants. In addition to sonorants, 
stops, and fricatives, there are affricates such as /ts/and /tJ/; these are usually de- 


The Baltic languages 5 


scribed as a separate type of articulation though there is no phonetic opposition be- 
tween them and the sequences /ts/, /tf/ etc. on morpheme boundaries. The system 
of fricatives is traditionally poor but has recently been supplemented with new bor- 
rowed phonemes such as /f/ (Lith. fonas ‘background’) and /x/ (Lith. chaosas 
*chaos") (until recently /p/ and /k/ used to be substituted for them). 


23. Palatalisation 


Palatalisation has played an important role in Baltic phonetics. It is most pro- 
nounced in Lithuanian, which has both correlated series of non-palatalised and pa- 
latalised phonemes and (under Slavonic influence) assimilatory palatalisation of 
all consonants before front vowels, e.g., nesime ['nae[unae] ‘we will carry’. A 
similar situation (also induced by Slavonic influence) is observed in High Latvian 
but in the other dialects and in Standard Latvian there is no palatalisation as a sec- 
ondary articulation, only a series of palatal consonants comprising /c/, /}/, / n/ and 
/A/; neither is there assimilatory palatalisation before front vowels. 


2.4. Quantity 


Vowel quantity is distinctive in any position in both Lithuanian and Latvian, cf. 
Lith. liepa [liopa] ‘lime-tree (NoM.sc)' vs. liepa [liopa] ‘id. (Acc.sG)’. In stressed 
syllables, length is obligatorily accompanied by pitch variation, cf. 2.3.3. 

Consonant quantity is well-developed in Latvian as a result of Fennic substra- 
tum influence. Sonorants show distinctive quantity mainly in loanwords, cf. manna 
[man:a] ‘manna’ vs. mana [mana] (NOM.sG.FEM of Ist ps. sg possessive pronoun). 
Non-distinctive quantitative variation in obstruents occurs in native words: im- 
mediately post-tonic voiceless obstruents are automatically lengthened between 
short vowels, cf. lapa [lap:a] ‘leaf’ vs. lāpa [la:pa] ‘torch’, lapā [lapa:] ‘leaf 
(Loc.sc)'. In Lithuanian, there is no consonantal quantity and on the morphemic 
boundary geminates are shortened. 


2.5; Stress 


Lithuanian has unbounded word stress, cf. likime ‘let’s stay’: likime ‘destiny 
(voc.sG)’: likimé ‘destiny (Loc.sc)'. Latvian has generalised initial stress under 
Fennic substratum influence and allows but few exceptions from this principle. 


2.6. Pitch 


Both Latvian and Lithuanian are pitch languages. In Lithuanian, stressed long vo- 
calic segments (long vowels, diphthongs, and sequences of vowel plus sonorant) 
show a distinctive opposition of rising and falling pitch, cf. kaftų *time:GEN.PL' vs. 
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kdrty *hang:IRR.3'. In standard Latvian (and some of the dialects), long vocalic se- 
quences (of the same type as in Lithuanian) distinguish three varieties of pitch: 
‘even’, ‘falling’, and ‘broken’ (‘broken pitch’ being falling pitch with superadded 
glottalisation). They are fully differentiated in stressed syllables only; unstressed 
syllables have an opposition of glottalised and non-glottalised long vocalic seg- 
ments. Segments with ‘even’ pitch are ultra-long. Neither Lithuanian nor Latvian 
mark pitch in their standard orthography. 


3. Morphology 
3.]. Inflectional morphology 
3.1.1. The verb 


3.1.1.1. Person and number 


The Baltic verb marks person and number, but in the 3rd person there is no number 
distinction: Lith. medis/medZiai auga ‘a tree/trees grow(s)’. In Latvian, non-dis- 
tinction of person is also observed (in the irrealis, in some dialects also in the 
realis). 


3.1.1.2. Tense 


The Baltic verb has distinct forms for present, past, and future: Lith. aug-o ‘grew’, 
aug-a ‘grows’, aug-s ‘will grow’. The future has modal overtones (expression of 
probability). The past does not have irrealis function. Most Lithuanian dialects as 
well as standard Lithuanian have a special habitual past missing in Latvian: besides 
expressing iterativity, it suggests that the situation described is not continued into 
the present, e.g., sededavo ‘used to sit’. For compound tenses cf. the verb phrase 
(section 4.1). 


3.1.1.3. Mood 


Both Latvian and Lithuanian have a mood system comprising, in addition to the in- 
dicative and imperative, an irrealis (also called conditional, subjunctive, optative), 
which is used in counterfactual conditional clauses, volitional complements, ad- 
verbial purpose clauses, etc. Differences between Lithuanian and Latvian show es- 
pecially in complementation: Whereas Lithuanian requires the irrealis in volitional 
complements, Latvian allows realis vs. irrealis variation to mark expectations as to 
the degree of likelihood of the envisaged event: 
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(1) Lith. 
Noriu, kad tu man padetum. 
want:PRS.1sG COMP 2sG:NOM IsG:DAT help:iRR.2sG 


(2) Latv. 
Gribu, lai tu man palidzi / palidzetu. 
want:PRs.1sG COMP 2sG:NOM lsa:bar help:iND/IRR.2sG 
‘I want you (would like you) to help me.’ 


3.1.1.4. Modal markers 


A peculiarity of Latvian is the debitive, an inflectional form of the verb used to 
express all types of necessity (dynamic, deontic, and epistemic). It is (at least 
within Indo-European) a rare instance of a modal predicate being expressed inflec- 
tionally. It consists of an uninflected form with the prefix ja- and the auxiliary ‘be’. 
The agent is in the dative, the object in the nominative, sometimes in the accusative 
(in the case of 1st and 2nd person pronouns and reflexive pronouns; the accusative 
is spreading in other conditions as well). The debitive is traditionally called a 
mood, but this is hardly satisfactory as it can itself be marked for several moods. 
Tense and mood of the debitive are marked on the auxiliary: 


(3) Latv. 
Janim būtu jādzer ūdens. 
John:DAT be:ırr drink:DEB water:NOM 
‘John would have to drink water.’ 


The etymological meaning of man jādzer ‘I must drink’ is ‘I have which to drink’, 
the prefix ja- deriving from the Indo-European relative pronoun *ye/o-. 

The description of the debitive is a matter of controversy. It has been described 
as a voice because of the reassignment of case marking it induces (Nau 1998), but 
this is unconvincing as the debitive’s function is purely modal. The most obvious 
solution, then, is to associate the reassignment of case marking with the bound 
modal lexeme jd-. This bound lexeme could then be compared to an incorporated 
modal verb (like Hungarian -hat- ‘be able’), except that its incorporation is partial 
and coupled with periphrasis. 


3.1.1.5. Aspect 


Both Latvian and Lithuanian mark aspect in the verb, but its markers are deri- 
vational and are therefore dealt with in 3.2.4. 
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3.1.1.6. Evidentiality 


Both Latvian and Lithuanian have developed a system of evidential forms, which 
are halfway between morphology and syntax (see section 4.1.3). 


3.1.1.7. Voice 


Both Latvian and Lithuanian have developed middle voice (‘reflexive’) forms de- 
rived by accretion of an enclitic reflexive pronoun to the verb. These middle voice 
forms (often referred to as ‘reflexive verbs’ and listed separately in the diction- 
aries) are now formally and semantically distinct from the reflexive construction, 
which is based on separate (now always orthotonic) reflexive pronouns. The 
middle (reflexive) marker is enclitically attached word-finally or after the verbal 
prefix (the latter now only in Lithuanian). This development is shared with East 
Slavonic though in Baltic the new middle voice forms have not (except in a few 
dialects strongly influenced by Slavonic) acquired passive meaning, as they have 
in East Slavonic: The passive is periphrastic in Baltic, see section 4.3.9. 


(7) Lith. 
Ona maud-o-si. 
Ann:NOM bathe:PRs.3REFL 
“Ann is bathing.’ 
(8) Lith. 
Ona pamate save veidrodyje. 


Ann:NOM see:PST.3 REFL:ACC MIITOr:LOC.SG 
"Ann saw herself in the mirror.’ 


3.1.2. Nominals 


3.1.2.1. Number 


Both Lithuanian and Latvian distinguish singular and plural and have relics of the 
dual. Most Lithuanian dialects (and the standard language) now have dual forms 
for personal pronouns only, e.g. judu *you two' alongside jus *you' (with no match- 
ing dual verb forms). In Latvian the dual has disappeared as a separate number cat- 
egory; old duals survive as optional adnumerative forms used with numerals from 
2 to 9, e.g. ar divi zirgi *with two horses' alongside the regular plural ar diviem zir- 
giem. 

Baltic shows an exceptionally high frequency of pluralia tantum. Mass nouns 
are often plural, e.g. Lith. sultys ‘juice’, Latv. asinis ‘blood’. In Latvian, pluralia 
tantum also include action nouns and names of mental and physical states, e.g., 
gaidas ‘waiting’, slapes ‘thirst’, bailes ‘fear’. 
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3.1.2.2. Gender 


There is a binary (masculine-feminine) gender system for nouns. For animates, 
deviations of grammatical gender from natural gender are relatively few. Names of 
professions, actor nouns, attributive nouns, etc. usually have masculine and femi- 
nine endings, e.g., Latvian drst-s ‘male doctor’ vs. ärst-e ‘female doctor’, skaugis 
*envier (masc.)' vs. skauge 'envier (fem.)’. Animate nouns thus seem to inflect for 
gender. Expressive words referring to females, as well as hypocoristic forms of 
feminine names, can be grammatically masculine, cf. Latvian /evuks, deminutive 
of Ieva ‘Eve’, skukis ‘wench, slip of a girl’. 

Whereas in Latvian the binary gender system extends to all word classes, 
Lithuanian retains distinct neuter gender forms for adjectivals and some pronouns. 
In Latvian, the masculine gender of adjectivals is used for default agreement when 
there is no overt agreement controller, e.g. in the impersonal passive: 


(9) Latv. 
Par to ir gadäts. 
for this be:PRs.3 take_care:PTCP.PST.PASS.NOM.SG.M 
‘Care has been taken of this.’ 


In Lithuanian, the neuter forms of adjectivals (including participles) are mainly 
used for default agreement (though there are rare instances of noun-like use, as in 
daryti gera ‘do good’). They are used to agree with inanimate pronouns, e.g. 


(10) Lith. 
Tai yra nauja. 
that be:PRs.3 new:NOM.SG.N 
"That's new.’ 


Besides, they are used as default agreement forms when there is no overt agree- 
ment controller, e.g. in the impersonal passive: 


(11) Tuo jau pasirupinta. 
DEM.INSTR.SG already take_care: PTCP.PST.PASS.NOM.SG.N 
‘Care has already been taken of this.’ 


3.1.2.3. Case 


Baltic has rich case systems. Lithuanian has seven or eight cases: nominative, 
vocative, accusative, genitive, dative, instrumental, and locative (inessive) occur 
everywhere; the illative (mokyklon ‘to school’) is restricted to eastern High Lithua- 
nian. Some dialects retain relics of two other local cases, the adessive and allative 
(well attested in Old Lithuanian). This system of local cases was a Common Baltic 
innovation (perhaps induced by Fennic influence, cf. Laakso, this vol., 3.1.2.1 on 
Fennic). 
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Latvian has reduced the number of cases to six by abandoning the instrumental, 
which has phonetically coalesced with the accusative in the singular and with the 
dative in the plural. This has given rise to a peculiar pattern of prepositional gov- 
ernment. As the prepositional phrases replacing the bare instumental, such as ar 
māsu ‘with (the) sister’ vs. ar māsām ‘with (the) sisters’, appeared to contain dif- 
ferent cases governed by the same preposition for singular and plural, this pattern 
was extended to other prepositions: All prepositions now govern the dative in the 
plural, regardless of the case they govern in the singular, e.g., no mājas (GEN) ‘from 
the house’ vs. no mājām (DAT) “from the houses’. Though this testifies to the de- 
mise of the instrumental, there is a descriptive problem with the mainly idiomatic 
expressions containing original bare instrumentals: We have aizturétu elpu ‘with 
suspended breath’ (an instrumental formally identical with the accusative) along- 
side atriem soliem ‘with quick steps’ (an instrumental formally identical with the 
dative), both with the same instrumental function, which could best be accounted 
for by assuming that the instrumental case has been retained. This assumption, 
however, would not account for propositional government; it would only lead to a 
proliferation of cases (in addition to the instrumental, we would have to posit one 
more case, identical with the genitive in the singular and with the dative/instru- 
mental in the plural, in order to account for the above-mentioned forms no 
mdjas/no mājām) and would fail to capture the generalisation that all prepositions 
govern the same case in the plural. 


3.1.2.4. Definiteness 


The definiteness of the noun phrase is expressed, if at all, by definite or indefinite 
forms of the adjective. In Latvian, the definite forms are obligatory if the noun 
phrase is definite but in Lithuanian their use is inconsistent. The definite forms are 
used if it is the adjective that turns the noun phrase into a definite description, as in 
(12): 


(12) Lith. 
Paduok man melynqjq skarele. 
hand:IMPER 1SG.DAT blue:ACC.SG.F.DEF scarf:ACC.sG 
"Hand me the blue scarf.’ 


This example suggests that the scarf is identified by its colour. The deictic definite- 
ness of the noun phrase in (13), however, is not marked in the form of the adjective 
(as it would have to be in Latvian): 


(13) Lith. 
Paduok man tq melynq skarele. 
hand:IMPER 15SG.DAT that:Acc.sG.F blue:ACC.SG.F.INDF scarf:ACC.sG 
‘Hand me that blue scarf.’ 
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On the other hand, definite adjectives can be used in indefinite noun phrases when 
they are part of generic definite descriptions like the red fox, the black thrush etc.; 
their generic definiteness is retained in the adjective even if a single specimen of a 
species is meant: 


(14) Lith. 
Cia niekada nesu matęs 
here never NEG-be:PRs.lsG see:PTCP.PRET.ACT.NOM.SG.M. 
rudosios lapės. 
red:GEN.SG.F.DEF fox:GEN.SG 


‘T ve never seen a red fox here.’ 


Nominally used adjectives and participles often have the definite ending as well, 
regardless of whether they function as definite descriptions (Latv. izredzétie ‘the 
chosen ones’) or as indefinite descriptions (Latv. kads aklais ‘some blind man’). 


3.1.2.5. Pronouns 


Pronouns distinguish roughly the same categories as other nominals. A striking 
feature of interrogative and indefinite pronouns is the partial lack of animacy 
distinctions: Lith and Latv. kas means ‘who?’ or ‘what?’ (though Latvian distin- 
guishes neviens ‘nobody’ from nekas ‘nothing’ vs. Lith. niekas ‘nobody, nothing’). 
For 3rd person personal pronouns Latvian has a distinction between animate and 
inanimate: vins/vina ‘s/he’ is, in principle, restricted to animates whereas the dis- 
tant demonstrative tas/td ‘that’ is preferred for inanimates (a feature also found in 
Fennic). The proximative deictic Sis/si ‘this’ can also be used as a personal pro- 
noun; in certain dialects, especially High Latvian, it is used as a logophoric pro- 
noun in combination with the evidential (quotative) form of the verb (Nau 2006). 


3.25 Derivational morphology 


3.2.1. Types of derivation 


Baltic has a rich system of derivation but makes relatively little use of compound- 
ing. Affixation comprises prefixation (Lith. is-eiti ‘go out’) and suffixation (Lith. 
vaikst-ine-ti ‘walk_about:ITER.INF’). Owing to the archaic character of nominal in- 
flection, which retains many inflection classes, derivation often involves paradig- 
matic conversion, i.e. rather than by a distinct suffix derivatives are characterised 
by transfer to another declension. What opposes Lith. laisve ‘freedom’ to laisvas 
‘free’ (in addition to change of pitch, which is also used as a derivational device) is 
transfer from the -a-/-o- stem adjectival declension to the -e- stem declension 
(treating -e- as a suffix is synchronically problematic given the high degree of 
fusion). 
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3.2.2. Derivation vs. genitival modifiers 


Latvian has less derivational devices than Lithuanian; it has few relational adjec- 
tives and mainly uses non-determiner genitives instead, e.g. zelta gredzens sold 
ring’ (the adjective zeltains means only ‘goldish’), latviesu valoda “Latvian lan- 
guage’, lit. ‘Latvians’ language’ (the adjective latvisks means only “Latvian in 
character’). Lithuanian regularly derives relational adjectives in -inis, like mo- 
kyklinis "school" from mokykla ‘school’, but also often gives preference to non- 
determiner genitives. 


3.2.3. Special derivational types 


A derivational peculiarity of Latvian is the class of 'genitival nouns', derived 
through univerbation of genitival modifiers of the type ‘adjective + noun’, ‘nu- 
meral + noun’, e.g. šaursliežu (dzelzcels) *narrow-track (railway), divstavu (nams) 
*two-storeyed (house)’ (from divu stavu nams ‘house of two storeys’). They have 
the appearance of nouns occurring only in the genitive but function like adjectives. 


3.2.4. Aspectual derivation 


Derivation performs a semi-grammatical function in the case of verbal aspect, 
which is expressed by means of preverbs. Usually, preverbs change both lexical 
meaning and aspect but in many cases their function is purely aspectual: Lith. ra- 
Syti “write (IPFV)’ vs. pa-rasyti “write (PFv)'. In Latvian, phrasal verbs, consisting 
of a simple imperfective verb and a local adverb, provide imperfective counter- 
parts for prefixed perfective verbs: näkt iekšā ‘come in (IPFV)’ vs. ie-ndkt “come in 
(PFv)'. Lithuanian makes little use of this device, but has recently started using ite- 
rative suffixes to derive secondary imperfectives after the Slavonic fashion: nu- 
reng-ine-ti *un-dress-ITER-INF (IPFV)' vs. nu-rengti *un-dress-INF (PFV)'. In com- 
parison with Slavonic, aspect is weakly grammaticalised in Baltic (for an overview 
of the Slavonic aspect system cf. Hansen, this volume, section 3.1.1). 


4. Syntax 
4.1. The verb phrase 


4.1.1. Compound perfect tenses 


Combinations of ‘be’ with past active participles function as compound tenses 
marking anteriority (a model also used in Slavonic and Fennic, cf. Laakso, this vol- 
ume, 3.1.1.1). Lithuanian also has a rudimentarily developed possessive perfect 
with ‘have’ and the past active participle, as in (16): 
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(15) Lith. 
Esu mates tq žmogų. 
be:PRS.1SG see:PTCP.PST.ACT.NOM.SG.M that man:ACC 
‘T ve seen that man before.’ 


(16) Lith. 
Turiu atsineses maisto. 
have:PRs.lsG bring:PTCP.PST.ACT.NOM.SG.M food:GEN 
‘T ve brought some food with me.’ 


4.1.2. Progressive forms 


Baltic does not have progressive tenses now but in Old Lithuanian a system of pro- 
gressive forms, consisting of ‘be’ with the present active participle, seems to have 
been developing as in OLith. bu kriksztiiqs ‘was baptising'. This development was 
never completed (perhaps due to the rise of derivational aspect, cf. 3.2.4) and its 
only relic in modern Lithuanian is a ‘preterite of interrupted action’, as in (17) (be- 
is a progressive particle proclitically attached to the verb): 


(17) Lith. 
Mes buvome beiseinq, 
we  be:PsT.lPL PROGR-gO OUC:PTCP.PRS. ACT. NOM.PL.M 
kai tu paskambinai. 
when you phoned 
“We were about to go out when you phoned.’ 


4.1.3.  Evidentials 


The use of participles or a series of participles instead of finite verb forms conveys 
evidential meanings. The relative tense expressed by the participle (anteriority, 
simultaneity, posteriority) then becomes the absolute tense (past, present, future) 
of the evidential construction: 


(18) Lith. 
Jonas isvaziaves i Vilniu. 
John:NoM depart:PTCP.PST.ACT.NOM.SG.M to Vilnius 
‘John has reportedly (evidently) left for Vilnius.’ 


In some cases, the participial endings have spread to forms marked for categories 
characteristic of finite verbs, such as mood, so that the resulting forms cannot be 
considered participles any more; for instance, Latvian and Lithuanian dialects have 
evidential varieties of the irrealis: 
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(19) Latv. 
Saka, ka, ja izņemtot visu akmeni, 
say:PRS.3 COMP if remove:IRR.OBL whole stone:Acc 
butot redzams ceļš uz elli. 


be:IRR.OBL visible way to hell 
‘They say that, if the whole stone were removed, the way to hell would be- 
come visible.’ 


Lithuanian has developed evidential constructions based on passive participles, 
absent from Latvian. They are superficially similar to passives but their subjects, 
though expressed by the genitive like the agent phrase of a passive construction, 
always occur sentence-initially, which is untypical of real agent phrases, and the 
evidential constructions are often based on verbs that do not normally passivise, 
e.g. unaccusative intransitive verbs. They may also contain two passive participles 
(the auxiliary ‘be’ also occurs in the form of a passive participle), which is impos- 
sible for a normal passive. For these reasons the passive-like evidential should be 
set apart from the true passive both formally and functionally. It is beyond doubt, 
however, that it draws on the passive for its formal shaping. 


(20) Lith. 
Mano ` buta uzmigta. 
1sG:GEN be:PTCP.PST.PASS.N doze-off:PTCP.PST.PASS.N 
‘I must evidently have dozed off.’ 


With regard to meaning, there is a clear difference between Lithuanian, where evi- 
dential forms can be inferential, mirative, or quotative, and Latvian, where the evi- 
dential is exclusively quotative (as in neighbouring Estonian). Whereas the Latvian 
evidential is fully alive, the marking of evidentiality is on the decline in Lithuanian. 


4.1.4. Verb reduplication 


Verb phrases with reduplication of the verb (the finite verb being replicated by an 
infinitive or participle) are frequent. They can have an intensifying meaning: 


(21) Latv. 
Berni gaidīt gaida Ziemsvētkus. 
child:NoM.PL wait:INF waitPRs.3sc Christmas:Acc 
“The children are impatiently waiting for Christmas.’ 


But they can also be used as a means of topicalising the meaning of the verb: 


(22) Lith. 
Dirbti dirba, bet mazai uzdirba. 
work:INF work:PRs.3sc but little | earn:PRs.3sG 
“Work he does, but he earns little.’ 
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In constructions with reduplication, the infinitive or participle occurs before the fi- 
nite verb. Otherwise, word order in the verb phrase shows considerable freedom. 


4.2. The noun phrase and adpositional phrases 


4.2.1. The noun phrase 


In the noun phrase, as opposed to the verb phrase, word order tends to be rigid. The 
order is predominantly modifier-head. Not only adjectives, possessive, and de- 
monstrative pronouns precede the noun but genitives as well: 


(23) Lith. 
Cia mano brolio dviratis. 
here 1sG:GEN brother:GEN.sG bike:NOM.sG 
“This is my brother's bike.’ 


This rule extends to subject and object genitives: 


(24) Lith. 
Jono triusiu auginimas 
John:GEN rabbit:GEN.PL breeding:NoM 
‘John’s breeding of rabbits’ 


The only adnominal genitive regularly following the noun is the partitive genitive, 
which evidently reflects the shift of the head noun to quantifier status: 


(25) Lith. 
Isgeriau taure vyno. 
drink:pst.1sG glass:Acc wine:GEN 
‘I drank a glass of wine.’ 


The rule of preposing the adnominal genitives allows of exceptions in Lithuanian but 
not in Latvian, where extraction of a possessive modifier from its prenominal posi- 
tion requires replacement of the genitive by the dative of external possession, e.g. 


(26) Latv. 
Anna dzivo otra ielas pusē. 
Ann:NOM live:PRs.3 other:Loc.sG.F street:GEN side:Loc.sG 
(27) Latv. 
Anna dzivo otra puse ielai. 


Ann:NOM live:PRs.3 other:Loc.sG.F side:Loc.sG street:DAT 
"Ann lives on the other side of the street.’ 


The semantic range of adnominal genitives is broad in Baltic, involving not only 
possessive but also predicative relationships: 
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(28) Latv. 
palaidna zens 
wanton person:GEN boy:NOM 
"a wanton boy’ 


Noun reduplication, i.e. addition of the same noun in the genitive, can be used to 
indicate considerable quantity: 


(29) Latv. 
Vins | izstaigaja zemju zemes. 
3sG.M roam through:PsT.3 country:GEN.PL country:ACC.PL 
‘He roamed through many countries.’ 


4.2.2. Adpositional phrases 


Both Latvian and Lithuanian have a number of prepositions going back to IE ad- 
verbs or adpositions. In several stages of the development of Baltic, adstratal (Fen- 
nic) influence has led to the rise of postpositional constructions. Old Lithuanian 
still had a system of local cases that arose from postpositional constructions, but 
modern Lithuanian has a clear predilection for prepositions. In Latvian, pervasive 
Fennic influence has led to the spread of postposition-like noun-based construc- 
tions. Part of them involve relational nouns such as prieksa ‘front’, apaksa ‘space 
under something’, etc. 


(30) Latv. 
Vina visu dienu stav spoguļa priekšā. 
3sc.F whole day:acc stand:PRs.3 mirror:GEN front:LOC 
‘She’s standing in front of the mirror the whole day. 


The use of a relational noun is also necessary if the whole adpositional phrase is 
governed by another preposition, as Latvian (unlike Lithuanian) allows no double 
prepositions of the type from under the bed: 


(31) Latv. 
Es izvilku kasti no gultas apakšas. 
lsg  draw-out:PsT.lsG chest from bed:GEN place_under:GEN 
‘I drew the chest from under the bed." 


Though functioning as adpositions, such relational nouns retain their full nominal 
properties. The genitive used with them can often be replaced by a dative of exter- 
nal possession (cf. 4.2.1, 4.3.4). This substitution is often motivated by word order, 
as it allows the noun to be extracted from its position before the relational noun 
(though there may also be semantic differences associated with this, mainly con- 
nected with the figure-ground relation): 
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(32) Latv. 
Kugis izbrauca pa tilta apaksu. 
ship:NOM pass:PST.3 by bridge:GEN place under:Acc 


(33) Latv. 
Kugis izbrauca pa apaksu tiltam. 
ship:NoM pass:PST.3 by place under:Acc bridge:DAT 
“The ship passed under the bridge.’ 


Adpositional functions are also performed, in Latvian, by relational adverbs hav- 
ing no constant position with regard to the noun (they can be preposed, postposed 
or even separated from the noun); they always govern the dative: 


(34) Latv. 
Vins gāja garām baznīcai / baznīcai garām. 
3sG.M.NOM go:PST.3 past church:par church:DAT past 
“He walked past the church.’ 


4.3. The simple clause 


4.3.1. Subject and object marking 


Intransitive subjects and direct objects may be marked, in addition to the structural 
cases nominative and accusative, by several other constructions introducing a 
quantificational marking. 


4.3.1.1. Partitive subjects and objects 


The genitive of indefinite quantity (partitive genitive) marks (non-generic) indefi- 
niteness on intransitive subjects and direct objects (now only in Lithuanian): 


(35) Lith. 
Ten  radau grybu. 
there find:pst.lsG mushroom:GEN.PL 
‘I found some mushrooms there.’ 


4.3.1.2. Subject and object marking and negation 


The genitive of negation is historically connected with the partitive genitive but 
has become dissociated from it; whereas for intransitive subjects the use of the 
genitive is still linked to the genitive of indefinite quantity (except with nera 'there 
is no(t)', where it has been generalised), in the case of direct objects the genitive of 
negation is now used without any exception in Lithuanian: 
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(36) Lith. 
Neatejo svečių. 
NEG-cOme:PST.3 guest:GEN.PL 
"There came no guests.’ 


(37) Lith. 
Nera Jono. 
NEG-be:PRs.3 John:GEN 
‘John is not there.’ 


(38) Lith. 
Nematau Jono. 
NEG-See:PRS.1SG John:GEN 
‘I don't see John.’ 


Latvian, which also possessed this type of oblique marking of subjects and objects 
in the past, has now completely abandoned it both as a marker of indefiniteness and 
with negation; only with nav ‘there is no ...’ the genitive is still regularly used. 


4.3.1.3. Distributive subjects and objects 


Another type of quantificational marking on objects and intransitive subjects is the 
use of a prepositional phrase with Lith. po, Latv. pa expressing distributive mean- 
ing (generalisation of a certain quantificational characteristic over a number of 
situations), in Latvian also rare or dispersed occurrence: 


(39) Latv. 
Parka aug pa Jpriedei. 
park:Loc  grow:PRs.3 PREP fir-tree:DAT 
‘In the park a fir-tree grows here and there.’ 


4.3.2.  Predicate nominals 


A feature shared with neighbouring languages (Slavonic on the one hand and Fen- 
nic on the other hand) is the use of oblique marking for predicate nominals. In 
Lithuanian this is the instrumental. It is said to be opposed to the nominative as a 
marker of short duration of the predicated property, but nowadays it is mainly used 
(without semantic overtones) for default marking in clauses where no easily ac- 
cessible agreement target is available: 


(40) Lith. 
Apsimesti ligoniu nebuvo sunku. 
pretend-to-be:INF sick person:iNsTR NEG-be:PsT.3 difficult 
‘It was not difficult to feign illness (lit. pretend to be ill).’ 
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Latvian also has oblique marking of predicate nominals, but in connection with the 
demise of the instrumental (cf. 3.1.2.3), a prepositional construction with par ‘for, 
in lieu of’ is used instead. 


(41) Latv. 
Janis kluva par arstu. 
John:NoM become:PsT.3 for doctor:Acc 
‘John became a doctor.’ 


Unlike the Lithuanian instrumental, the Latvian construction with par is never 
used for default agreement; rather, it is employed in more marked predicative con- 
structions, such as complementary secondary predicates, or when a noun rather 
than an adjective functions as a predicate nominal. 


4.3.3. Secondary predicates 


Adjectivals functioning as resultative secondary predicates are frequent in Latvian 
but quite exceptional in Lithuanian, where adverbs are used to express resultative 
secondary predication: 


(42) Latv. 
Meitene nogerbas plika. 
girl:noM strip:PsT.3 naked:NoM.sG.F 


(43) Lith. 
Mergina nusirenge plikai. 
girl:noM strip:PsT.3 naked:ADv 
“The girl stripped naked.’ 


We can discern areal links here, as adjectivals used as resultative secondary predi- 
cates occur in Fennic but are absent from Slavonic. 


4.3.4. External possession 


Both languages use the dative of external possession but the scopes are different. 
While in Lithuanian it is mainly restricted to animates, in Latvian virtually every 
restriction has been lifted, and the motivation for using datives often seems to be 
the possibility of extracting the possessor from the immediately prenominal posi- 
tion to which the possessive genitive is rigidly tied. 


(44) Lith. 
Mama nuprause vaikui rankas. 
mum:NoM wash:PsT.3 child:paT hand:ACC.PL 
‘Mother washed the child's hands.’ 
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(45) Latv. 
Vini pacēla vāku naudas lādei. 
3PL:NOM lift:pst.3 lid:acc money:GEN coffer:DAT 
‘They lifted the lid of the money coffer.’ 


4.3.5. Null subjects 


Baltic makes ample use of zero subjects. Latvian has two types of zero subjects: 
The first is morphosyntactically masculine plural (for purposes of agreement), as 
can be seen in (46), where the past active participle is substituted in evidential 
function for the preterite: 


(46) Latv. 
Manu dzivokli kratijusi. 
my flat:Acc search:PTCP.PST.ACT.NOM.PL.M 


‘It’s said they had searched my flat (my flat had been searched).’ 


In the second type (probably borrowed from Fennic), the zero subject is treated as 
masculine singular; it is non-referential (indefinite non-specific) and corresponds 
to English one: 


(47) Latv. 
Varēja traks palikt no dusmām. 
can:PST.3 mad:NOM.MASC.SG become:INF with anger 
‘One could have got mad with anger.’ 


4.3.6. Oblique subjects 


Non-nominative-marked quasi-subjects are frequent, as in Slavonic (cf. Hansen, 
this volume, 4.3). Genitive-marked quasi-subjects alternate with nominative sub- 
jects (cf. 4.3.1). Many predicates occur with dative-marked quasi-subjects en- 
coding experiencers: 


(48) Lith. 
Man pagailo to vaiko. 
lsc.bAT be_sorry:PsT.3 that child:GEN 
‘I felt sorry for that child.’ 


4.3.7. Nominative objects 


Many dialects of Lithuanian (vestigially also Latvian) have nominative objects in 
infinitival complements if the construction is impersonal (i.e. there is only a dative 
marked quasi-subject): 
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(49) Lith. 
Man dabar reikia gyvuliai paserti. 
lsG:NoM now  be-necessary animals:NoM feed:INF 
‘I must feed the animals now.’ 


This construction seems to have originated through a kind of tough movement, but 
it is so strongly reminiscent of the use of nominative objects with the infinitive in 
Fennic that a connection has been assumed. Nominative objects also occur with the 
Latvian debitive (cf. 3.2.2), though they are being ousted by accusative objects. 


4.3.8.  Possessive constructions 


The possessive construction is of different types in Lithuanian and in Latvian. Li- 
thuanian has the verb tureti ‘have’ whereas Latvian has only ‘be’ with the dative 
(cf. Latin mihi est): 


(50) Latv. 
Tevam bija divi deli. 
father:DAT be:PsT.3 two sons:NOM 
‘Father had two sons.’ 


The construction with ‘be’ is Common Baltic and Lithuanian tureti ‘have’ seems to 
be a later development, but the Fennic substratum may have contributed to the re- 
tention of the “be’-construction in Latvian. The differences in structure of the pos- 
sessive construction are also reflected in the different shaping of possessive per- 
fects, cf. 4.1.1 and 4.3.9.3. The possessive “be’-construction has also given rise to a 
modal construction for necessity (or, in the negative, impossibility or prohibition) 
consisting of a dative nominal, the verb ‘be’ (often unexpressed) and the infinitive: 


(51) Lith. 
Mums jau važiuoti. 
lPL:DAT already go_by_vehicle:INF 
‘We should be off.’ 


(52) Latv. 
Tev nebus zagt. 
2SG:DAT NEG-be:FUT.3 steal:INF 
"Thou shalt not steal.’ 


4.3.9. Passives 


The passive is periphrastic in both languages. In Lithuanian it is based on either 
present or past tense passive participles, the difference being mainly aspectual. In 
Latvian, only the so-called past passive participle (which actually functions simply 
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as a passive participle) is used, and aspectual distinctions are conveyed only by the 
inherent derivational aspect of the verb (cf. 3.2.4). As an auxiliary, only būti ‘be’ is 
used in Lithuanian whereas Latvian uses verbs meaning ‘become’, mainly tikt, as 
auxiliaries. 


4.3.9.1. Agented and agentless passives 


In Lithuanian the passive may be either agentless or agented, the agent phrase 
being in the genitive. 


(53) Lith. 
Sis namas buvo pastatytas mano tevo. 
this house:NoM be:PsT.3 built:PTCP.PST.PAass my father:GEN 
‘This house has been built by my father.’ 


In Latvian, the passive proper is always agentless. An agentive genitive can, how- 
ever, be added to adnominal passive participles (54). A noun phrase with a parti- 
ciple modified by an agentive genitive may also be shifted to predicative position, 
yielding not a passive but a special kind of agentive construction referring not to 
the process itself but to the authorship of the completed action (55): 


(54) Latv. 
manis uzceltā mdja 
1sG.GEN build:PTCP.PST.PASS.NOM.SG.F house 
‘the house built by me’ 


(55) Latv. 
Te istabā visas lietas ir mana pasa 
here room:Loc.sG allthings:NoM be:prs.3 myself:GEN 
taisitas. 


make:PTCP.PST.PASS.NOM.PL.F 
“All things in this room are of my own making.’ 


The existence of an agentive construction while there is no agented passive is also 
reminiscent of a similar situation in Fennic (Finnish, Estonian). 


4.3.9.2. Impersonal passives 


Both Lithuanian and Latvian have impersonal passives referring to implicit human 
subjects: 


(56) Latv. 
Par to tiek gddats. 
of that become:prs.3 take-care:PTCP.PST.PASS.NOM.SG.M 
‘Care is being taken of that.’ 
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Lithuanian also has constructions superficially resembling impersonal passives, re- 
ferring to non-human (ambient) subjects, but they are actually evidential construc- 
tions which cannot be called properly passive, see section 4.1.3. 


4.3.9.3. Possessive passives 


Latvian has a kind of possessive passive (with a possessive dative) actually per- 
forming the function of a periphrastic perfect. While the possessive construction is 
based on ‘be’ rather than ‘have’, the possessive-based perfect of the type J have 
seen is passive, cf. (57). A similar construction occurs in neighbouring Estonian: 


(57) Latv. 
Man viss ir noskaidrots. 
lsc.bAT everything:NOM be:prs.3 figure out:PTCP.PST.PASS 
‘I have figured it all out.’ 


4.3.10. Word order 


Sentential word order shows a considerable freedom in both Latvian and Lithua- 
nian, being driven mainly by topic-comment structure. In sentences with unmarked 
word order the predominant pattern is SVO, with SOV as a less frequent variety. 


4.3.11. Clitics 


Clitics are now exceptional in Baltic. Whereas Old Lituanian shows relics of pro- 
nominal clitics, these have subsequently been eliminated by generalisation of the 
orthotonic variants: 


(58) OLith. 
Reykia mi eit. 
be-necessary:prs.3 1sG:DAT gO:INF 


(59) Lith. 
Man reikia eiti. 
lsc:bAT be-necessary:PRs.3 gO:INF 


This process is shared with East Slavonic. Also in common with East Slavonic is 
accretion of the old enclitic reflexive pronoun to the verb as a middle voice marker, 
cf. 3.1.1.7. 
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4.4. The complex sentence 


4.4.1. Complement clauses 


Complement clauses can be classified into three types: finite (obligatorily intro- 
duced by complementisers), infinitival, and participial. Participial complements 
mark coreferential subjects: 


(60) Lith. 
Jis suvokia esqs kaltas. 
3sG.M:NOM realise:PRs.3 be:PTCP.PRS.ACT.NOM.SG.M guilty 
‘He realises that he is guilty.’ 


The coreferentiality of the subjects of main and complement clause can be addi- 
tionally marked by using the middle-voice (reflexive) form of the verb of saying 
(logophoric middle): 


(61) Latv. 
Pat veci laudis teicäs tik bargu ziemu 
even old people:NoM say:PsT.3.REFL such harsh winter:acc 
neredzéjusi. 


NEG-See:PTCP. PST. ACT. NOM.PL.M 
“Even old people said they hadn't seen such a cold winter.’ 


The sentential complements of verbs of perception are either finite, introduced by 
Lith. kaip, Latv. kā ‘how’, or they contain an undeclinable participle (the equival- 
ent of the ACI construction of other languages): 


(62) Lith. 
Maciau, kaip ji nusišypsojo. 
see:PST.l1SG how 3sG.F:NOM smile:PsT.3 

(63) Mačiau ją besišypsant. 
see:PST.1SG 3SG.F:ACC PROGR-Smile:PTCP.PRS.INDECL 
‘I saw her smiling.’ 


4.4.2. Relative clauses 


Relative clauses are introduced by relative pronouns and adverbs, and typically 
occur after their heads. Relative clauses may be finite, infinitival, or participial. 
Both Lithuanian and Latvian have headless infinitival relative clauses expressing 
purpose, especially (but not exclusively) in existential sentences: 


(64) Lith. 
Atnesk antko ` atsisesti. 


bring:iMPER.2sG on what sit down:INF 
‘Bring something to sit on.’ 
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In Latvian, infinitival relative clauses (functionally relative purpose clauses) may 
be headed by a noun as well: 


(65) Latv. 
Vinam ir maja, kur dzivot. 
3sG.M:DAT be:PRs.3 house:NoM where live:INF 
‘He has got a house to live in.’ (‘in which to live’) 


Participial relative clauses are mainly used for coreferential subjects: 


(66) Latv. 
Es daru, ko varédams. 
1sc:NOM do:pst.lsG what:acc be able:PTCP.ACT.SEMIDECL.M.SG 
‘I do what I can.’ 


4.4.3. Adverbial clauses 


Adverbial clauses may be finite or participial. Participial adverbials are frequent in 
the case of coreferential subjects, but they may also be 'absolute', i. e. their subject 
may not coincide with that of the main clause, in which case they often have their 
own subject in the dative (this is called the ‘dative absolute"): 


(67) Lith. 
Mums _ besikalbant saule nusileido. 
]PL:DAT PROGR-talk:PTCP.PRS.INDECL sun:NOM go down:PsT.3 
“While we were talking, the sun went down.’ 


Such participial adverbials may also be introduced by prepositional subordinators: 


(68) Lith. 
lai atsitiko prieš man gimstant. 
that happen:pst.3 before 1sG:DAT be born: PTCP.PRS.INDECL 
“That happened before I was born.’ 


5. Suggestions for future research 


Owing to the traditional diachronic bias of Baltic scholarship, the Baltic languages 
are typologically underinvestigated, though they can boast a number of typologi- 
cally rare features with interesting theoretical implications. The issues raised by 
the demise ofthe Latvian instrumental (section 3.1.2.3) pose fundamental problems 
for any account of (prepositional) government as well as of case syncretism. The 
Latvian debitive poses interesting problems to morphological analysis (section 
3.1.1.4). The Baltic aspect system, which looks like the Slavonic aspect system in 
an embryonic stage, raises the question of how to measure grammaticalisation in an 
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aspect system with derivational markers. Non-canonical marking of grammatical 
functions is well represented in both Lithuanian and Latvian (it was hinted at in 
4.3.6 and also, in connection with the Latvian debitive, in 3.1.2.3). The Baltic evi- 
dentials deserve further investigation from an areal perspective, as should the in- 
teresting examples of logophoric devices mentioned in sections 3.1.2.5 and 4.3.1. 


Table of typological features 


Structural features Lithuanian Latvian 
animacy distinctions animacy distinctions partly ` animacy distinctions in 
lacking in pronouns personal pronouns 


gender distinctions 


(a) in the noun 2 2 

(controller gender) 

(b) in adjectivals 3 2 

(target gender) 
formal expression of yes, inconsistently expressed yes, expressed only in the 
definiteness in the form of the adjective form of the adjective 
number 

dual in personal pronouns relics of dual retained as 


adnumerative form 


evidential inferential, mirative, quotative only 
quotative 
aspect derivational, weakly derivational, weakly 
grammaticalised grammaticalised 
tense 
(a) absolute 3 values (present, past, 3 values (present, past, future) 
future) 
(b) relative yes yes 
(c) special possessive yes, weakly grammaticalised special possessive passive 
perfect 
(d) habitual yes (past) no 
(e) other past of interrupted action — 


mood and modality 


(a) mood system 3 values 3 values 
(b) inflectional marking of — special debitive form 
modality 
passive periphrastic, with ‘be’ periphrastic, with ‘be’ and 
‘become’ 
agent phrase in passive yes no 


construction 
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Structural features Lithuanian Latvian 

possessive construction of the ‘have’ type of the ‘be’ type 
resultative secondary exceptional frequent 

predicates 

adpositions mainly prepositions mainly prepositions, but 


supplemented by relational 
nouns and adverbs 


prepositional government different according to number 


non-canonical subjects datival quasi-subjects exceptionally high frequency 
of datival quasi-subjects 


partitive marking on frequent now obsolete 
subjects and direct objects 


non-canonical objects many genitive, dative and dative marked objects 
instrumental marked objects 


external possession yes, mainly restricted to yes, virtually no restrictions 
animates 
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2.1. Stress 
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2.4. Initial mutations 

3. Morphology 

3.1. Inflectional morphology 
32. Derivational morphology 
4. Syntax 


4.1. The verb phrase 
4.2. The noun phrase 
4.3. The simple clause 


4.4. The complex clause 
SÉ Typological Survey 
1. Introduction 


The modern Celtic languages are commonly divided into the British, p-Celtic lan- 
guages (Welsh, Breton, Cornish) and the Gaelic, q-Celtic languages (Irish, Scots 
Gaelic, Manx), based on the development of the Indo-European sound q". The fore- 
runner variety of Gaelic probably reached Ireland either via Britain or directly from 
the Continent in the first millennium BC (e.g. Schmidt 2002). Present day Irish (Ir) 
falls into the three main dialect areas Northern, Western and Southern, with West- 
ern (Connemara) Irish also linguistically occupying a middle position. According 
to the 2006 census, about 72,000 speakers use Irish on a daily basis and 1.66 mil- 
lion claim to have some knowledge of it (Central Statistics Office of Ireland, 2007). 

Scots Gaelic (ScGI) first arrived in Scotland with settlers from the north of 
Ireland around 500 AD, and there remained a common standard language until 
about the 17th century (Gillies 2002: 145, MacAulay 1992: 137). Currently, ScGl 
is mainly used on the west coast, and particularly on the western islands, i.e. the 
Inner and Outer Hebrides (cf. Gillies 2002: 146). In 2001 there were about 59,000 
Gaelic speakers in Scotland (General Register Office for Scotland, 2005). Emi- 
grants also brought the language to Nova Scotia, Canada. 

Modern Welsh (Wls), which belongs to the British language group, can be di- 
vided into two main dialect areas, northern and southern. In addition, four regions 
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are distinguished: northwestern Welsh, northeast and mid-Welsh, southwestern 
Welsh, and southeastern Welsh (Watkins 2002: 291—293). In 2001 there were ap- 
proximately 600,000 Welsh speakers in Wales (National Statistics Online, 2004). 

Modern Breton (Brt) developed from early British when British speakers 
migrated to the Continent in the 5th and 7th centuries (Jackson 1953: 14). It is com- 
monly divided into four dialects: Gwenedeg (Vannetais) in the south of Brittany, 
Kerneveg (Cornouaille) in central and western Brittany, Leoneg (Léon) in the 
northwest, and Tregerieg (Trégor) in the northeast. There is no agreed standard that 
could be referred to and different orthographic systems are in use (cf. Heinecke 
2003: 310-311). Humphreys (2002: 628) estimates the number of speakers at 
about 250,000 for 1991 (down from 615,000 in 1983). 

Manx and Cornish are spoken as revived versions today but have not been pre- 
served uninterruptedly. They will therefore not be treated in this chapter. 


2. Phonetics and phonology 


2.1. Stress 


There is no emphatic stress in Celtic; all modern languages use syntactic or mor- 
phological marking instead. Gaelic generally stresses the first syllable, in com- 
pounds stress may also fall on the second element, e.g. ScGl an taigh beag /aN ,toi 
‘beg/ ‘the bathroom’ [lit.: ‘the small house'] (Gillies 2002: 165). 

Northern Irish and Scots Gaelic de-stress other syllables, West Irish retains second- 
ary stress. Southern Irish stresses later vowels if they are long, e.g. the word ‘cup’ 
is ScGl, Northern Ir: cúpan, West: cupan, South: cupán ‘cup’. 

Welsh is generally stressed on the penultimate syllable. In addition to stress, sen- 
tence pitch occurs (Watkins 2002: 302). Like Welsh, Breton mainly has stress on 
the penultimate syllable. One dialect, Vannetais, shows stress on the final syllable. 
British differs from Gaelic in having a movable accent (cf. Ternes 1978: 204). 


2:2. Vowels 


The Gaelic languages have a long-short distinction throughout. The long vowels 
are marked by the length-marks Ir «^ and ScGl <> respectively. In unstressed 
syllables, Irish largely has /o/, ScGl also /io/ and /uə/ (Gillies 2002: 153). Diph- 
thongs mainly close on /a/, especially /io/ and Jual, 

Other diphthongs are frequently indicated by vowels plus consonants gra- 
phemically: /oi/ can be spelled «agh» or <adh>, /ou/ is <amh> or <abh>. Front or 
back location of vowels determines the quality of the surrounding consonants. The 
inventory of Welsh vowels has short variants of /a, i, i, e, o, o, u/. Except for /a/ 
these also have long variants (Thomas 1992: 327). Diphthongs are on /-i/, on /-i/, 
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and on /-u/. In the south, /i/ and /i/ have fallen together (Thomas 1992: 328, Wat- 
kins 2002: 299-300). 

Breton shows clear influence of French in its vowel system. Generally, it has 
the short vowels /i, y, u, e, ø, o, €, æ, w/ and /a/. All these, except for /9/ and /w/, 
also exist in nasalised variants. These can have phonemic long variants in stressed 
syllables (Heinecke 2003: 311). In addition, diphthongs and triphthongs have been 
Observed but differ in the dialects (ibid.). 

Overall, the Gaelic languages have more diphthongs as a side effect of the pa- 
latalized consonants. Breton is clearly influenced by French nasalisation and lip 
rounding. 


2.3. Consonants 


There are two separate systems in Gaelic and British. Irish Gaelic has voiced and 
voiceless stops /ptk/, /bdg/, /f/, /s/, /h/, liquids and nasals /l, r, m, n/. /I/ and /n/, 
in Scots Gaelic /m/ and /n/, also exist in strong forms. The strong forms regionally 
give way to lengthened vowels or diphthongs, e.g. gall /gaL/ ‘foreigner’ may 
be realized as /ga:l/ and /gaul/ (MacAulay 1992: 155). Stops also have fricativised 
parallels, indicated in spelling by a postscript <h>. All of these additionally exist 
in distinct palatalised forms. For the fricatives this leads to: «ph, ch» /f, c/, «bh, 
mh» /v/, «dh, gh» /j/, s /[/. Scots Gaelic voiceless stops have the additional 
feature set pre-aspirated and pre-velarised. Comparable phenomena can be ob- 
served in Icelandic. For a more detailed description of Scots Gaelic phonology 
see Gillies (2002: 154—157). The most notable feature of the British consonant 
system is the lateral fricative /4/ spelled «ll». Breton, like French, has a lateral 
palatal /A/ (Watkins 2002: 300, Heinecke 2003: 317). A survey of Welsh conson- 
ants can be found in Watkins (2002: 300) and Thomas (1992). Breton shows 
a similar distribution to Welsh as regards consonants but it has more palatals. In 
addition to the same set of plosives and nasals, it also has a palatal nasal /p/ and 
the palatals /f/ and /3. Further, there are the semi-vowels w and j (Heinecke 2003: 
311-313). 


2.4. Initial mutations 


All modern Celtic languages display the feature of grammatically conditioned in- 
itial mutations. These result from earlier external sandhi phenomena which have 
been grammaticalised during language history. The contexts in which initial mu- 
tations are typically found are: after verbal particles, prepositions, possessive pro- 
nouns, or some (case forms of) articles. The modern languages show three differ- 
ent types of initial mutation, four in the case of Breton. The realisations differ in 
each language but can be roughly summarized as lenition of initial fortis conson- 
ants to their lenis equivalents, nasalisation of initial consonants and/or vowels, and 
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fricativisation or h-proclisis. For further details see, for example, Mac Eoin (2002: 
109-113), Watkins (2002: 304—309), Gillies (2002: 166-169), and Stephens 
(2002: 359—364). 


3. Morphology 
3.1. Inflectional morphology 
3.1.1. The verb 


3.1.1.1. General characteristics 


The rather complex verbal systems of the earlier Celtic languages, involving 
features such as varied declension classes and infixation of object pronouns, have 
been given up in favour of less complex patterns. Vestiges of the earlier complex 
patterns are retained in some high-frequency modern forms. The moods are indica- 
tive, imperative, and subjunctive. The latter is vestigial, especially in British. 
While British retains person marking in inflected verb forms, the Gaelic languages 
have abandoned person marking to a larger degree and commonly use unmarked 
forms followed by nouns or pronouns. The grammatical labels used below are 
those of the Leipzig Glossing Rules, with the addition of MUT ‘initial mutation’ 
and VN ‘verbal noun’. 


(1) Ir Chonaic me sibh 
see. PRET PRON.1SG.OBL PRON.2PL.OBL 
‘I saw you’ (Mac Eoin 2002: 122). 


(2) Ir Chonaic  sibh me 
see PRET PRON.2PL.OBL PRON.1SG.OBL 
“You saw me’ (ibid.). 


This holds particularly true for northern dialects of Irish and for Scots Gaelic. 
Southern dialects, though, retain personal inflections to a larger degree (cf. Mac 
Eoin 2002: 126-128). In ScGl, special endings are used for the 1st singular and 
plural conditional, and the 1st and 2nd plural imperative and the relative (cf. Gillies 
2002: 186, 190; MacAulay 1992: 215). 


(3) Ir Mholfadh muid thi 
praise. COND PRON.1PL.OBL PRON.2SG.OBJ 
“We would praise you’ (cf. Mac Eoin, 2002: 127) 


(4) Ir Mholfaimis thi 
praise.COND.1PL PRON.2SG.OBJ 
“We would praise you’ (ibid.). 
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However, in strong verbs, i.e. original verbs with a more complex inflectional pat- 
tern than that of weak verbs, Scots Gaelic has retained an earlier distinction in the 
set of verbal endings used for dependent inflection, which is used if a particle pre- 
cedes the verb, and independent inflection, used in absolute initial position (Ma- 
cAulay 1992: 216, Gillies 2002: 190). The differences in verbal forms are shown in 
the following: 


(5) ScGl Gabhaidh am fear... 
take.PRES-FUT.INDEP ART man.OBL 
“The man will take’ (Gillies 2002: 190). 


(6) ScGl An gabh am fear ....? 
QPART take.PRES-FUT.DEP ART man.OBL 
“Will the man take?’ (ibid.) 


3.1.1.2. Flectional systems 


In the modern Celtic languages, the synthetic present is typically generic or habit- 
ual. For the verb ‘be’, separate actual forms are distinguished from the habitual 
ones. Further general synthetic categories are the imperfect and the preterite and the 
subjunctive. Irish and Breton both have a present and a past subjunctive. In Welsh 
these are also vestigially found for the verb bod ‘be’ (cf. Watkins 2002: 324). ScGl, 
on the other hand, uses a conditional based on the future and a past subjunctive for- 
mally identical to the imperfect. While Irish and Breton have a separate future 
tense, ScGl and Welsh use the original present tense also to indicate future. Welsh 
furthermore has a morphologically marked pluperfect. In order to illustrate this sys- 
tem, the stem forms of the Irish verbal stem mol- ‘praise’ may be given as follows: 
pres. molann, fut. molfaidh, subj. mola, pret. mhol. Derived from these are the ipfv. 
mholadh, cond. mholfadh, and pst. subj. moladh (cf. Mac Eoin 2002: 126-128). 


3.1.1.3. Periphrastic formations 


Periphrastic tense and aspect formations are used in all modern Celtic languages. 
They typically employ the Celtic equivalent to the infinitive, the verbal noun. Ob- 
jects ofthese nominalised constructions are expressed as genitives, or, if pronomi- 
nal, as possessive pronouns. The following are the most prominent ones: 


1. A progressive present, created analytically with a punctual form of the verb 
‘be’, the prepositions ‘at’ or ‘in’, and the verbal noun: 


(7) a.ir ta mé ag canadh 
b. Wis dw i "n canu 
be.PRES PRON.1SG. PART sing.VN 
‘I am singing.’ 
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ii. A perfect, created analytically with a form of the verb ‘be’, the prepositions 
‘after’, and the verbal noun. In Welsh this functions as a general preterite as well: 


(8) a. Ir ta mé tréis canadh 
b. Wis dw i wedi canu. 
be.PRES PRON.1SG after sing. VN 
‘I have sung.’ 


ii. In Welsh, Breton, and Scots Gaelic periphrastic preterites are formed with 
the verb ‘do’. In the British languages this periphrasis is also found in the 
present tense: 


(9) ScGl rinne mi codail 
do.PRET PRON.ISG.OBL sleep. VN 
‘I slept.’ 
(10) Brt Diskin a rae ar vugale gante. 


learn. NN PART do.PET.3SG ART MUT\children from.3PL 
“The children learned from them’ (Heinecke 2003: 315). 


iv. A periphrastic future can be formed with ‘be’, a purposive preposition, and 
the verbal noun: 


(11) ScGl tha sin ri dhèanamh fathast 
be.PRES this to MUT\do.VN still 
"That is to be done still’ (Gillies 2002: 205). 


v. In the Gaelic languages a further result-state perfect is created with the past 
participle of transitive verbs and the verb ‘be’: 


(12) Ir tá an leabhar | molta agam 
be.PRES ART book.OBL praise.PST.PTCP at.1SG 
‘I have praised the book.’ 


3.1.1.4. The verb 'have' 


Breton is the only Celtic language to have developed a verb *HAVE', endevout. Its ver- 
bal forms derive from the direct object pronouns, followed by the 3rd person singular 
of one of the forms of bezañ ‘be’ (Heinecke 2003: 314, Hemon 1995: 26, 51), e.g. 


3rd sg masc. en dues [PRON.3SG.M.OBJ. MUT\be.3SG.PRES] 
2nd pl. hoc'h eus [PRON.2PL.OBJ be.3SG.PRES] 
3rd pl. o dues [PRON.3PL.OBJ MUT\be.3SG.PRES] 


‘he has’, 
“you (pl.) have’, 
‘they have’. 
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3.1.2. The noun 


3.1.2.1, Number and gender in the noun 


All modern languages have masculine and feminine gender. In the Gaelic lan- 
guages the cases are unmarked, genitive, vocative, and vestiges of datives in fixed 
expressions. The British languages have given up case inflection completely. There 
are some vestiges of dual marking with a prefixed numeral ‘two’ in all languages, 
most prominently in Breton. The dual noun may be singular or plural marked and is 
preceded by the numeral daou (masc.) or di(v) (fem.), e.g. lagad ‘eye’, daoulagad 
‘two eyes’, or daoulagadou ‘two eyes’ with additional plural marking (Hemon 
1995: 24; Heinecke 2003: 313). 

While Welsh plural formation is very varied, Breton has largely standardised 
the plural morphemes -(i)où and -ed for animates and trees: Brt lenn, lennoù ‘lake, 
lakes’, merc’h, merc’hed ‘girl, girls’ (Hemon 1995: 22). Welsh and Breton further 
use collective nouns, mainly for plants and animals, and a singular is formed by ad- 
dition of the suffix Wls masc. -yn, fem. en, e.g. plentyn ‘child’ to plant ‘children’, 
Brt -enn, e.g. geot, geotenn ‘herbs, herb’, gwenan, gwenanenn ‘bees, bee’ (Hemon 
2003: 23). 


3.1.2.2. Pronouns 


The Celtic languages have separate subject, object, and possessive pronouns. In 
ScGl the former two have merged. Furthermore, there are special emphatic forms, 
used to indicate contrastive emphasis on pronouns or pronominal forms of prep- 
ositions, as in the following ScGl example: Tha mise ann, chan eil tusa ann. ‘I am 
here, you are not here.' To express reflexive notions, combinations of the personal 
pronoun and the particles ‘self-’ are used: Ir fein, ScGl fhèin, W1s hunan ‘self’, pl. 
hunain in the South, hun ‘self, -selves’ in the North. (Watkins 2002: 314), Brt unan 
‘one’, e.g. va-unan ‘myself’? (Hemon 1995: 35). 


3.1.2.3. Adjectives 


Attributive adjectives follow their head noun and in Ir and ScGl correspond to it in 
number, case, and gender. Predicative adjectives are also used; these have no con- 
cord with the noun (Gillies 2002: 202-203). In Welsh only some highly frequent 
adjectives show vowel alterations for gender: masc. w > fem. o, masc. y > fem. e, 
e.g. trwm 'heavy', fem. trom, gwyn 'white', fem. gwen. Plurals are formed by 
vowel alteration, or by suffixation of -(i)on. Gender and number concord is reces- 
sive. In Breton, adjectives are largely immutable, the male-female distinction is 
only exceptionally retained. Comparison is synthetically marked on the adjective, 
with additional analytic marking by preceding comparative or superlative par- 
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ticles, e.g. Ir is é an fear níos/is óige ‘he is the young-er/est man’. Equatives are 
marked by particles, e.g. Brt ken gwan ha(g) ‘as weak as’ [lit: ‘so weak and 
(Hemon 1995: 30). In less formal varieties of Welsh, comparison can also be ana- 
lytic. In that case the equative is marked with the adjective mor ‘great’, the com- 
parative with mwy ‘bigger’, and the superlative with mwyaf ‘biggest’, e.g. mor 
ddiog, mwy diog, mwyaf diog ‘as lazy, more lazy, most lazy’ (Watkins 2002: 312). 


3.1.3. Prepositions 


In the modern Celtic languages, if a preposition is followed by a personal pronoun, 
the two are contracted to what is typically termed conjugated prepositions, e.g. the 
preposition ‘at’, Ir ag, Sg. lst agam ‘at me’, 2nd agat ‘at you’, 3rd masc. aige, 3rd 
fem. aicci ‘at him/her’, Pl 1st againn ‘at us’, 2nd agaibh ‘at you’, 3rd acu "at 
them’. Secondary prepositions, derived from nouns during more recent language 
history, have no inflected forms. The person is expressed by possessive pronouns: 
Ir ar mo cheann, lit. ‘onto my head’, i.e. ‘towards me’, WIs ar eich hol lit. “in your 
track’, i.e. ‘after you’. 


3:2; Derivational morphology 


Modern Celtic languages use affixation and compounding. Prefixes are typically 
intensifying, e.g. Wls ad-, gor-, Wls negative af-, an-, dad-, di- or comparative 
cyd- (Watkins 2002: 329). Many noun-forming suffixes create either masculine or 
feminine nouns, e.g. Wls -es, -aeth, -as, -fa, or Ir -óig, -dil, -acht. British uses 
quantifying suffixes: Wls -aid denotes a particular quantity, e.g. cwpan-aid 'cup- 
ful’, sach-aid ‘sack-ful’, and -edig and -adwy form adjectives from nouns and 
verbs, e.g. caredig ‘kind’ related to caru ‘love’, ofnadwy ‘awful’ from ofn ‘fear’. 

Compounding mostly follows the normal syntactic ordering, the second el- 
ement shows initial mutation: Wls hindda ‘fair weather’ from hin ‘weather’ + da 
‘good’, Wis Llanfair ‘church of Mary’ from llan ‘church’ and Mair ‘Mary’ (Wat- 
kins 2002: 330). While the Gaelic languages have a number of prepositional verbs, 
British uses few of these and employs a wider variety of simple verbs (Ternes 
1978: 209). 


4. Syntax 


4.1: The verb phrase 


The verb phrase consists of (conjunct/relative particle) (tense marker) verb (per- 
son) (emphatic/contrastive suffix) (adverb). As the subject follows the verb, verbal 
complements are separated from the verb by the subject. Adverbs are formed by 
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adding a particle, Ir go, ScGl gu, Brt ez, Wls yn, in front of the adjective, e.g. Brt 
leal ‘loyal’ > ez leal ‘loyally’ (Hemon 1995: 65). 

Negations and questions are introduced by a negation or a question particle 
which cause initial mutations. The question particle is often deleted, leaving only 
the mutation as a marker: 


(13) a. I an mbeidh sí ann? 
b. Wis (a) fydd hi yno? 
PART MUT\be.FUT PRON.3F there 

“Will she be there?’ 


While Gaelic dialects do not necessarily mark subject-verb agreement, the British 
languages have restricted plural concord: if a subject is expressed, Modern Welsh 
marks plural on the verb only with a pronominal subject, not with a nominal one. In 
Breton the verb is in 3rd person singular whenever the subject precedes it, irrespec- 
tive of whether the subject is nominal or pronominal (Hemon 1995: 61). 


(14) Brt Ar baotred a wel al levr 
ART boys PART see.PRES.3SG ART book.OBL 
"Ihe boys see the book" (Heinecke 2003: 315). 


Historically, these structures derive from relative clauses. Conversely, in negative 
clauses the verb also shows subject agreement if the subject precedes the verb: 


(15) Brt E vreudeur ne hadjont ket 
POSS.3.M brothers NEG sow.PRET.3PL NEG 
*His brothers did not so' (ibid.). 


4.2. The noun phrase 


The modern Celtic languages have the overall order head-modifier. Thus, the noun 
phrase consists of (article) (numeral) noun (adjective) (demonstrative): 


(16) a. Ir an  t-aon fhear mór seo 
ART MUT\NUM MUT\nan big DEM 

b. Wis yr un dyn mawr hwn 
ART NUM man big | DEM 


‘this one big man’ 


Genitives and relatives also follow their head noun. Some frequent adjectives may 
also precede their noun, in which case they cause initial mutation. 

In Irish, Scots Gaelic, and Welsh, indefinite articles do not exist; only Breton 
has developed one during its language history. Otherwise, the unmodified noun is 
used, thus Ir fear ‘man, a man’. In Welsh the definite article y, with its postvocalic 
variant ’r and prevocalic variant yr, is employed for all numbers and cases. Ir and 
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ScGl use an as a general singular article. In the plural and the genitive singular 
feminine, there is a distinct form na, ScGl adds gen. pl. nan. Demonstratives fol- 
low the group article-noun. Irish, Scots Gaelic, and Breton have a three-term sys- 
tem of near, further, and distant deixis: e.g. Ir an fear seo/sin/siud ‘this, that, 
yonder man'. Welsh has a four-term system y dyn hwn/hwnna/hwnacw/hwnnw 
‘this, that, yonder, the non-present man’ (Watkins 2002: 313-314). 


4.3. The simple clause 
Irish and Scots Gaelic have a general VSO order: 


(17) ScGl Bha Tain a’ ceannach an leabhair 
be.PRET lain PART buyVN ART book.GEN 
‘Iain was buying the book’ (MacAulay 1992: 170). 


Some permutations of this rule are possible, however. For example, cleft construc- 
tions are frequently used in order to emphasize certain elements of the sentence: 


(18) ScGl Is mi a chunnaic thu 
COP PRON.ISG PART MUT\see.PRET. PRON.2SG.OBL 
‘It is I who saw you’ (Gillies 2002: 211). 


These structures may lead to some apparent exceptions to VSO order, e.g. in 
Welsh, where the subject precedes the copula and the predicate in identifying 
copula clauses (19a), or in focus structures (19b): 


(19) a. Wis John yw ’r brenin 
John COP ART king 
‘John is the king’ (Watkins 2002: 340). 


b. Wis brenin yw John. 
King | COP John 
‘It is a king that John is’ (Watkins 2002: 340). 


Focus clauses are distinguished by a relative verbal form, where available. They 
are cleft clauses which often show deletion of the initial copula: 


(20) Wls nhad sy "n cario "r rhaff 
MUTidad be.PRES.REL PART carry.VN ART rope.OBL 
*Father is carrying the rope' (Watkins 2002: 337). 


(21) Brt Yann a wel ar chi. 
Yann PART  see.PPRES ART dog 
*Yann sees the dog’ (Heinecke 2003: 315). 


In Breton, these former focus structures have grammaticalised to the extent that 
some scholars still consider VSO, with some SVO and OVS variants, to be the 
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basic order (cf. Heinecke 2003: 315). But other scholars (Heinecke 2003, Ternes 
1978) consider its basic order to be XV. Therefore, Breton arguably is the only ex- 
ception to the VSO order of modern Celtic. While Welsh has more focus structures 
than Gaelic, the Gaelic languages typically retain verb-initial status also in focus 
clauses, even though an initial copula may be formally absent. 

None of the modern Celtic languages has any separate passive forms. Instead, an 
impersonal form is used, to which an agent can be added in a prepositional phrase: 


(22) ScGl Rinneadh an t-òran le Tain. 
do.PRET.IMPERS ART MUT\song with lain. 
"The song was made by John' (Gillies 2002: 187) 


4.4. The complex clause 


The Gaelic languages have a high degree of paratactic ordering (cf. e.g. Gillies 2002: 
216) but some types of subordination can be found. While some subordinate sen- 
tence types are typically finite, such as conditional clauses, temporal clauses are typi- 
cally expressed non-finitely with prepositions and verbal nouns, as in (23) and (24): 


(23) Ir Chonaic mé Máirtín. 
see. PRET PRON.1SG Máirtín 
agus é ag caoineadh 
and PRON.3.M.OBJ at cry. VN 
‘I saw Martin when he was crying’ (Wigger 2003: 269) 
(24) ScGl Sheas Tain an-déidh 
MUT\stand.PRET Iain after. COMP.PREP 
do Anna suidh 
to Anna ` sit.VN 


‘Iain stood up after Anna sat down’ (MacAulay 1992: 163). 


In British, particularly in Breton, temporal and spatial subordinate clauses are 
more typically finite and marked by subordinating particles, such as Breton ma 
(Heinecke 2003: 316). 

Relative clauses differ for subject or object antecedent, on the one hand, and 
genitive or prepositional relations, on the other hand. In the former case, an in- 
flected verbal form is preceded by a verbal particle, commonly a or negative na, 
ScGl and Ir nach. Where available, a special relative verbal form used: 


(25) Ir sin í an bhean 
DEM.COP PRON.1SG.OBL ART MUT\woman 
a labhair liom 


REL.PART speak.PRET with. 1SG 
"That is the woman who talked to me’ (Wigger 2003: 266). 
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(26) Brt an den na welan ket 
ART person NEG.PART see.PRET.1ISG NEG 
"The person whom I do not see' (Hemon 1995: 75). 


Genitival and prepositional relative clauses typically use a subordinating particle 
followed by the verb. The preposition is in its normal position. Alternatively, the 
preposition can precede the relative particle. 


(27) a. Ir Is é an teach 
COP PRON.28GM.OBJ ART house 
a bhfuil an cat ann 


REL.PART be.PRES.DEP ART cat in.3SGM 
‘This is the house in which the cat is.’ 


b. Ir Is é an teach 
COP PRON.2SGM.OBJ ART house 
in a bhfuil an cat. 
in REL.PART be.PRES.DEP ART cat 


‘This is the house in which the cat is.’ 


Breton again takes a special position in being the only modern Celtic language to 
have developed reflexive verb forms. These are formed by placing en em before 
any verb in any person, as in (28): 


(28) Brt al laer a ZO en-em lazhet 
ART thief PART be.PRES.3SG REFL kill. PTCP 
‘the thief has killed himself’ (Hemon 1995: 61) 


5. Typological Survey 


The following typological generalisations can be made about the contemporary 
Celtic languages treated in this survey. The criteria used are drawn from those pro- 
posed by van der Auwera (1998), Kortmann (1998), and Haspelmath (2001). The 
feature set in Table 1 highlights properties that distinguish Celtic from SAE and the 
Celtic languages amongst each other. 
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Table 1: The Presence of linguistic features in Celtic 


Features Irish Scots Gaelic Welsh Breton 
Analytic expressions of tense and aspect Yes Yes Yes Yes 
Simplified case paradigms Yes Yes Yes Yes 
Definite and indefinite articles No No No Yes 
have and be auxiliaries No No No Yes 
Subject verb agreement marking None None Partly Yes 
Verb second order No No No Partly 
Phrasal order: prepositions & postposed genitives Yes Yes Yes Yes 
Standard of comparison: particle marked Yes Yes Yes Yes 
Intensifier-reflexive identity Yes Yes Yes No 
Participial passives Yes Yes No Yes 
Mainly nominative experiencers No No partly Yes 
Particle relative clauses Yes Yes Yes Yes 
Finite subordinate clauses Partly Partly Partly Partly 
Free initial subordinators Partly Partly Yes Yes 


This table illustrates that Breton is the modern Celtic language that is typologically 
closest to the Standard Average European core. While the other languages con- 
sidered do not feature indefinite articles, verb-second order, a have auxiliary, and 
restricted or no subject-verb agreement marking, these are all features of Breton. 
In the case of verb-second order, this is only arguably so, however. It seems re- 
markable that Breton, as the only modern Celtic language which is in contact with 
French rather than English, deviates most from the other Celtic languages. It is an 
urgent desideratum to investigate the mechanisms that have influenced these typo- 
logical developments. Generally, it must be noted that, with the exception of work 
by Ternes (e.g. 1978), comparative typological studies of the Celtic languages are 
still too rare. 
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1. Introduction 


The present-day Germanic languages spoken in northern and central Europe in- 
clude the following (with the estimated number of native speakers in parentheses): 
Icelandic (300,000), Faroese (50,000), Norwegian (4,800,000, with the two 
written varieties Riksmal/Bokmal and Nynorsk), Swedish (9,500,000), Danish 
(5,300,000), (Modern West) Frisian (300,000), Dutch (22,000,000), Luxemburgish 
(400,000), German (94,000,000), and English (63,000,000 in Europe, 420,000,000 
worldwide). The non-European Germanic diaspora comprises Afrikaans 
(4,000,000), Yiddish (5,000,000) and Pennsylvania German (700,000). (Cf. Har- 
bert 2007: 13-20.) 

On historical grounds, a distinction is drawn between North and West Ger- 
manic. The latter group comprises the Continental languages German, Dutch and 
Frisian on the one hand, and non-Continental English on the other. North Germanic 
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is divided into an Insular Scandinavian branch, comprising Icelandic and Faroese, 
and a Mainland Scandinavian branch consisting of Norwegian, Swedish and Dan- 
ish. 

The majority of Germanic languages have a written history that dates back to 
the Middle Ages and provides reliable information on their typological develop- 
ment. In what follows, the main emphasis will be on the morphology and syntax of 
the modern European standard languages. (Regarding phonology, the reader is re- 
ferred to Harbert 2007: 41—88, with references.) Historical facts will be referred to 
when relevant in connection with present-day typological variation. 

The Germanic language family was, from its first attestation as Runic Nordic 
and Wulfilian Gothic and well into the Middle Ages, characterized by typological 
unity and stability. Old Germanic was, in a traditional sense, highly synthetic (cf. 
Schlegel 1971 [1818]: 16), with all the languages sharing the same basic morpho- 
logical categories and syntactic characteristics. This period was followed by a 
(post-medieval) period of differentiation and a drift towards analyticity resulting in 
the structural and typological diversity that characterizes modern Germanic. 


2. Case marking and linear order 


2.1. Case marking 


There is a basic difference between (1) those modern Germanic languages that have 
case marking of full NPs, and (ii) those that only have a residual case opposition 
between a subject and a non-subject (oblique) form in certain pronouns (/ — me, 
etc.). (The modern s-genitive is a special phenomenon.) All ancient Germanic lan- 
guages belonged to the former type, possessing nominative, accusative, dative, and 
genitive case; in addition, Gothic and West Germanic had a separate instrumental 
form in a number of pronouns and masculine and neuter nouns (Harbert 2007: 
103-104). Today the traditional NP-case-marking type is represented by Icelandic 
and German, both of which have all four traditional cases, and by Faroese, where 
the genitive, being restricted to syntactically non-productive uses (Thráinsson et 
al. 2004: 433—434), is marginal. The other modern languages belong to the second, 
case-neutralizing type. 


2.2; Verb order 


The relevant verb order parameters are: (1) the position of the finite verb as first 
(V-1), second (V-2), third (V-3) (or later, but non-final), or final (V-final) position 
in the clause; (ii) the clause-medial or clause-final position of verb chains consist- 
ing of finite and non-finite verb forms; and (iii) the internal ordering of verb forms 
within verb chains (Askedal 2009: 12—20). 
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In main clauses, all the languages have V-1 in yes—no questions and V-2 in con- 
stituent questions and declaratives. V-1 exists as a secondary option in Icelandic 
“narrative” declaratives (1) (marginally also found in German joke telling: Kommt 
ein Mann zum Arzt und ...). V-3 (or later) in English is due to pre-subjectal fronting 
and certain adverbial placement rules (2): 


(1) Koma þeir nú að stórum helli og... (Ice) 
come they now to big cave and... 
‘Then they come to a big cave and ...’ 


(2) a. Suddenly he left. (Eng) 
b. Then he all of a sudden realized ... (Eng) 


Verb chains, i.e. sequences consisting of finite and non-finite verb forms constitut- 
ing a hierarchy of head-modifier relationships (Bech 1955: 25-30, 60-72), are 
clause-medial and uniformly right-branching in non-Continental Germanic. Cf. (3) 
(higher rank within the chain is indicated by lower superscript number): 


(3) a. She may! have? wanted? to do* this for a long time. (Eng) 
b. Ellers ville! hun aldri ha? besluttet? å forlate* ham. (Nor) 
otherwise would she never have decided to leave him 
‘Otherwise she would never have decided to leave him.’ 


In Continental West Germanic, main-clause verb chains have a clause-medial and a 
clause-final part (4): 


(4) a. Er wird! es schon verstanden? haben?. (Grm) 
he will it already understood have 
"Hell have understood it already.’ 
b. Hij kan! niets gezien? hebben? | hebben? gezien?. (Dut) 
he can nothing seen have / have seen 
“He won't have been able to see anything.’ 


The individual languages vary with regard to the position of the finite verb in sub- 
ordinate clauses. English and Icelandic stand apart in having essentially identical 
verb order in main and subordinate clauses (not counting the complementizer). The 
other languages show more or less radical main-subordinate clause asymmetries. 
Mainland Scandinavian and Faroese subordinate clauses are, with certain excep- 
tions (cf. Harbert 2007: 402), V-3. Continental Germanic has clause-final finite 
verbs; but whether this manifests itself as absolute clause-finality or as some other 
position within a clause-final verb chain is dependent on language-specific rules 
(Askedal 2009: 12-17). Frisian is most consistently left-branching, Dutch least, 
with German in between. Consider the examples (5)-(7), which illustrate lan- 
guage-specific constraints on 3-2-1 order (the a-examples), 3-1-2 order (the 
b-examples), and 1-2-3 order (the c-examples): 
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(5) a. .. omt ik moarn alles meand? hawwe? sil!. (Frs) 
.. because I tomorrow everything mown have shall 
... because tomorrow I shall have mown all of it.’ 
*... om't ik moarn alles meand? sil! hawwe?. (Frs) 
c. *... om't ik moarn alles sil! hawwe? meand?. (Frs) 


(6) a. ... weil er es schon verstanden? haben? wird!. (Grm) 
.. because he it already understood have will 
... because he'll have understood it already.’ 
.. Weil er es schon verstanden? wird! haben?. (Grm, regional) 
C. *... weil er es schon wird! haben? verstanden?. (Grm) 


(7) a. *..omdat hij niets gezien? hebben? kan!. (Dut) 


.. because he nothing seen have can 
b. ... omdat hij niets gezien? kan! hebben?. (Dut) 
.. because he nothing seen can have 
* ... because he can't have seen anything.’ 
c. ... omdat hij niets kan! hebben? gezien?. (Dut) 
.. because he nothing can have seen 
2:3: Interrelations of case marking and verb order 


According to Greenberg’s “Universal 41” (1966: 96), verb-finality (SOV) norm- 
ally implies the existence of a case system. No Germanic language has verb-final- 
ity as the only or dominant order in all kinds of clause types; therefore, no modern 
Germanic language is V-final in the sense of Greenberg’s “Universal 41”. A mod- 
ern Germanic adaptation of the Greenbergian parameters underlying the universal 
in question yields four groups: (1) Insular Scandinavian is morphologically conser- 
vative with regard to case marking and has V-2 order (with V-3 in subordinate 
clauses as an option in Faroese); (ii) in Mainland Scandinavian as well as in Eng- 
lish, case morphology is reduced to a basic subject-oblique opposition in a few 
pronouns and verb order is V-2 or V-3, depending on clause type in Scandinavian 
or on other syntactic factors in English; (iii) Dutch and Frisian show the same kind 
of reduction of case morphology as English and Mainland Scandinavian but com- 
bine this with verb-finality in subordinate clauses; (iv) German comes closest to 
conforming to Greenberg's "Universal 41" with its combination of conservative 
case marking as well as consistent verb-finality in subordinate clauses and clause- 
final non-finite verbs in main clauses. 


2.4; Other ordering relationships 


Ordering differences in syntactic constructions are most spectacular in the domain 
of verb order. In other kinds of construction, the individual languages are more 
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similar. In some cases, modifier-head order reigns supreme or is predominant (8a); 
in other cases, it is the other way around (8b) (with representative English 
examples): 


(8) a. Modifier-head order: 
Determiner-noun: this book; numeral-noun: five books; adjective-noun: 
beautiful books; adverb—adjective: extremely beautiful. 

b. Head-modifier order: 

Adposition (as preposition)-noun (NP): with this book; complementizer- 
subordinate clause: that she may thrive; comparative-standard of com- 
parison: greater than Shakespeare; noun-attributive prepositional 
phrase: women in love; noun-attributive clause: her hope that all be well, 
the hope that helped her survive. 


The comparatively high frequency of occurrence in German of expanded pre-nom- 
inal adjective and participle constructions that are functionally equivalent with 
relative clauses corresponds to the OV characteristics of this language. Cf.: 


(9 die vor einem Jahr eingeführten neuen Regelungen (Grm) 
the before one year introduced new regulations 
*the new regulations that were introduced a year ago' 


On the other hand, German deviates from OV expectations in having the post-nom- 
inal genitive as the unmarked variety (10a); the pre-nominal genitive (10b) is an ar- 
chaizing stylistic device: 


(10) a. die hóchsten Wünsche des Kanzlers (Grm) 
the highest wishes ^ the-GEN chancellor-GEN 
*the chancellor's highest wishes' 
b. des Kanzlers höchste Wünsche (Grm) 
the-GEn chancellor-GEn highest wishes 


3. Verb forms and verbal periphrases 


3:1. Inflectional categories of the verb 


In its earliest attested stage, Germanic possessed finite verbs marked for person 
(1, 2, 3), number (singular, plural, dual), tense (present, past), mood (indicative, 
subjunctive, imperative), and voice (active, passive). Dual and passive forms are 
only attested in Gothic (Braune and Heidermanns 2004: 145-146, 157-158 and 
146-148). The only morphological opposition that has been retained universally in 
modern Germanic is that between present and past (preterit) tense. 

Concerning mood, a distinctive 2 sg imperative form has remained everywhere 
except in English, where formal conflation with the infinitive has taken place. 


52 John Ole Askedal 


The indicative-subjunctive opposition has been strongly reduced or lost in all 
Germanic languages except in Icelandic and German. A temporal opposition be- 
tween the present and past subjunctive in accordance with consecutio temporum 
rules is retained only in Icelandic: 


(11) a. Jón  segir að Anna taki blöðin. (Ice) 
John says-PRES.IND that Anna takes-PRES.SUBJ papers-DEF 
‘John says that Anna takes (will take) the papers.’ 
b. Jón sagði að Anna tæki blöðin. (Ice) 
John said-PRET.IND that Anna took-PRET.SUBJ papers-DEF 
‘John said that Anna took the papers.’ 


In German, the morphological present-preterit opposition is retained in the sub- 
junctive but the consecutio temporum rule has been lost; in the subjunctive, tem- 
poral anteriority is now expressed by periphrastic forms (12b) contrasting with 
non-anterior simple forms (12a): 


(12) a. Er sagte, dass seine Eltern reich seien. 
he said-PRET.IND that his parents rich are-PRES.SUBJ 
"He said that his parents were rich.’ 


b. Er sagte, dass seine Eltern eine grofe Bibliothek 
he said-PRET.IND that his parents a large library 
gehabt hätten. 
had had-PRET.SUBJ 


‘He said that his parents had possessed a large library.’ 


In the other languages, the subjunctive has mostly been lost as a productive inflec- 
tional category. Minor exceptions exist like the more or less fossilized "present 
subjunctives" that I/ he be and the preterit I / he were in English, and vore ‘would, 
should be’, sutte ‘would, should sit’, etc. in Swedish (SAG 1999: 549). In general, 
such preterit forms have counterfactual or hypothetical meanings. On the other 
hand, the simple past tense form, deriving historically from the indicative, allows 
for counterfactual use: 


(13) Ifonly I had the money! (Eng) 


More often, counterfactuality etc. is expressed by analytic constructions with the 
past tense form of a desemanticized modal: 


(14) a. He would certainly do it. (Eng) 


b. Als ik geld zou hebben, zou ik op reis 
if I money should-PRET have, should-PRET I on travel 
gaan. (Dut) 
go 


‘If I had the money, I would go traveling.’ 
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In the languages where the subjunctive is lost, the consecutio temporum rule is re- 
tained, as in (15): 


(15) a. He claims that he is ill. (Eng) 
b. He claimed that he was ill. (Eng) 


3.2. Passive 


All old and modern Germanic languages including Gothic have periphrastic (ana- 
lytic) passive constructions consisting of a copula verb and an inherently passive 
participle of a transitive verb agreeing with the subject. Agreement has been re- 
tained in a northern area comprising modern Icelandic, Faroese, Swedish, and Nor- 
wegian Nynorsk: 


(16) a. Barna vart sende heim att. (Nor-NN) 
children-su.PL.DEF were sent-PART.PL home again 
"The children were sent home again.’ 
b. Báturin bleiv málaður av henni. (Far) 
boat-sU.MASC.SG.DEF became painted-PART.MASC.sG by her 
"The boat was painted by her.’ 


The other languages have non-inflecting participles (supine forms). In West Ger- 
manic these presumably derive from non-inflected (“nominal”) forms of the parti- 
ciple (Fleischer 2007). Cf. (17): 


(17) Het huis wordt gebouwd. (Dut) 
the house becomes built-PART(SUP) 
"The house is being built.’ 


Impersonal passives with a, by definition, non-agreeing participle of an intransitive 
verb appeared quite early: 


(18) thes ér iu ward giwahinit (OHG, Otfrid) 
that-GEN earlier already became mentioned-PART(SUP) 
"This was already mentioned earlier.’ 


The Germanic languages differ, though, with regard to passive auxiliary. English 
and Icelandic have retained ‘be’; Continental Germanic, Faroese, and Norwegian 
Nynorsk have desemanticized verbs originally meaning ‘become’; Mainland Scan- 
dinavian and, optionally, Faroese have descendants of the Middle Low German 
borrowing bliven ‘remain’, whose continuative aspectual meaning first changed 
into "become" and was then neutralized in passive auxiliary use. 

In addition to these core passive constructions, a variety of other more recent 
passive periphrases have come into existence in individual languages (Harbert 
2007: 321—322). 
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3.3. Periphrastic tenses: perfect and future 


The original Germanic tense system consisting of simple present and past tense 
forms has been expanded to include periphrastic tenses. 

The two original types of perfect constructions were: (a) copula constructions 
with ‘be’ and a past participle agreeing in case with the nominative subject of the 
sentence; and (b) co-predicative constructions with a verb meaning *have, possess' 
and a past participle agreeing in case with the (direct) accusative object. In the 
former case, the participle was intransitive, having inherently active meaning; in 
the latter case, the inherently passive participle of a transitive verb was reanalyzed 
as active in the course of the grammaticalization process that led to its establish- 
ment as a perfect auxiliary (Askedal 2009: 30—35). In the overall development of 
the Germanic perfect constructions four separate stages can be distinguished, rep- 
resenting increasing analyticity and constructional generality: 


(i) An initial stage with an agreeing participle in both *be'- and *have’-constructions. 
(ii) Retention of agreement with ‘be’ but loss of agreement with ‘have’: 


(19) a. Dei var komne heim. (Nor-NN) 
they-sU.PL were come-PART.PL home 
“They had come home.’ 


b. Ho hadde kjøpt boka. (Nor-NN) 
she had ^ bought-PART(sUP) book-DEF 
*She had bought the book." 


(iii) Retention of ‘be’ but generalization of non-agreement in the participle: 


(20) a. Hij is naar Zuid-Amerika gereisd. (Dut) 
he is to South-America traveled-PART(SUP) 
“He has gone to South America.’ 
b. Hij heeft'm het boek gegeven. (Dut) 
he has'him the book given-PART(SUP) 
‘He has given him the book 


(iv) Generalization of ‘have’ at the expense of ‘be’: 


(21) a. He has gone home now. (Eng) 
b. He has read the book. (Eng) 


The first stage ceased to exist in the early Middle Ages. The second stage is still 
represented in Faroese and Norwegian Nynorsk, and the third stage in Danish and 
Continental Germanic. The fourth stage is found in Icelandic, Swedish, and Eng- 
lish. Norwegian Riksmal/Bokmal is now rapidly moving towards the fourth stage. 
An exponent of analyticity, the periphrastic perfect in Continental Germanic, and 
particularly in South German, has wider application than in the other languages 
and is often used synonymously with the simple past tense. 
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No ancient Germanic language showed traces of a separate future tense. Lan- 
guage-specific future periphrases with an infinitive, representing a variety of gram- 
maticalization paths (Abraham 1989), made their appearance in the course of the 
Middle Ages. Most common are constructions with the modals ‘shall’ and ‘will’, 
occurring in Mainland Scandinavian, English, Dutch, and Frisian. Similar modal 
constructions were used in Old and Middle High German, too, but in German the 
grammaticalization process was reversed and the modals in question were replaced 
by aspectually desemanticized werden ‘become’. In addition to desemanticized 
modals, Mainland Scandinavian makes use of the verb ‘come (to)’ with the particle 
infinitive (cf. Norwegian komme til å ...) and Dutch employs the deictic antonym 
gaan ‘go’ (cf. French aller): 


(22) a. Hon kommer att neka till allt. (Swd) 
she comes to deny-ınr to all 
‘She is going to deny everything.’ 
b. Het gaat regenen. (Dut) 
it goes rain-INF 
‘It’s going to rain.’ 


3.4. Aspectual periphrases 


Since medieval times, individual Germanic languages have added a number of ex- 
pressions for aspectual distinctions to their inventory of auxiliary-like construc- 
tions. The most highly grammaticalized example of this is presumably the English 
progressive, which contrasts semantically with the simple present: 


(23) a. She writes beautiful poems. (Eng) 
b. He was writing a letter when the news arrived. (Eng) 


Some languages use posture or movement verbs like ‘sit’, ‘stand’, ‘lie’, ‘walk’ to- 
gether with a second lexical verb that likewise denotes an on-going action. In 
Dutch, a particle infinitive, and in Frisian, a particle gerund is used (Haeseryn et al. 
1997: 973-974; Tiersma 1985: 127). The corresponding constructions in Mainland 
Scandinavian are ‘pseudo-coordinations’ with two finite verbs (Hesse 2009). 
Dutch and German share another progressive construction with ‘be’ and a preposi- 
tional phrase consisting of the preposition German an, Dutch aan ‘at’ and a nomi- 
nalized infinitive (Van Pottelberge 2005). Icelandic has a tripartite subsystem of its 
own (see 24a-c) comprising the verbs or verb expressions vera ‘be’, vera buinn 
*(lit.) be finished’, and fara ‘go’ with an infinitive plus the prepositional marker ad 
(Kress 1982: 154—162). Only the last of these is found in neighbouring Faroese 
(Thráinsson et al. 2004: 315-316). Cf.: 
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(24) a. Eg er að spyrja. (Ice) 

I am to ask-INF 
‘I am asking (right now).’ 

b. Eg er büinn að gleyma. (Ice) 
I am finished to  forget-INF 
‘I have (just) forgotten it.’ 

c. Eg fer að hlæja. (Ice) 
I go to laugh-ınr 
‘I am (just) about to laugh.’ 


3.5. Modality 


A conspicuous trait of early Germanic verb morphology is a special group of so- 
called preterit-present verbs. This group was formerly larger than it is today; the 
remaining verbs have, however, developed a clearer common semantic profile in 
the domain of modality. In addition to retaining their special inflection, these verbs 
are, in individual languages, also characterized by specific morphosyntactic traits 
indicating auxiliary-like status, such as: (i) use of the bare infinitive in contrast to 
the particle infinitive found with most other infinitive-governing verbs; (ii) dis- 
tributional restrictions and idiosyncratic perfect participle morphology (‘substitute 
infinitive’) in German and Dutch; (iii) weakening or outright loss of the present- 
preterit tense opposition in connection with modal specialization of the tense forms 
in English (He could/should/might do it). Of special interest is the development, in 
several languages, of an epistemic use (25b), (26b) in addition to, and partly syn- 
tactically different from, the traditional deontic (“root”) uses (25a), (26a). The ep- 
istemic use occurs in an epistemological (25b) as well as a quotative (26b) var- 
lety: 


(25) a. Sie hatten das Problem sehr schnell lösen müssen. (Grm) 
they had the problem very fast solve-INF must-INF(PART) 
"They had been forced to solve the problem very quickly.’ 

b. Sie müssen das Problem in sehr kurzer Zeit gelöst 

they must the problem in very short time solved-PART 
haben. (Grm) 
have-INF 
"They must have been able to solve the problem in a very short time.’ 


(26) a. Hun skulle ta seg av barna. (Nor) 
she  should-PRET take-ınr her-REFL of children-DEF 
‘She was to take care of the children.’ 
b. Han skal ha vert en meget begavet mann. (Nor) 
he  shall-PREs have-INF been-PART a very gifted man 
‘He is reputed to have been a very gifted person.’ 
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In several languages both the older “root” subsystem and the newer epistemic one 
have to a certain extent proved semantically productive, the former attracting verbs 
meaning ‘have’, ‘need’, ‘know’, and the latter attracting certain other verbs mean- 
ing ‘seem’, ‘threaten’, ‘promise’. 


4. The noun phrase: inflectional categories, morphology, and syntax 


4.1. Gender 


In nouns, the three original Germanic genders masculine, feminine, and neuter are 
retained in High German and a northern area comprising Icelandic, Faroese and 
Norwegian Nynorsk. At the other extreme, gender marking is lost in English. In the 
remaining languages Dutch (and Low German), Frisian, Danish, and Swedish, the 
masculine-feminine opposition has been neutralized, yielding an utrum category 
that contrasts with the neuter. (Norwegian Bokmal allows for both a two- and 
three-gender system.) Morphological gender distinctions in the plural of nouns are 
to a varying extent retained in Icelandic, Faroese, Swedish, and Norwegian Ny- 
norsk and Bokmal but neutralized in the remaining, southern languages (Harbert 
2007: 100-102). 

In the three-gender languages, personal pronouns in the singular, more rarely in 
the plural, agree in accordance with grammatical gender, cf., e.g., German der 
Mann / der Stuhl — er ‘the man / the chair — he’, die Frau / die Krone — sie ‘the 
woman / the crown - she’, das Kind / das Wort — es ‘the child / the word — it’. In the 
languages with partial or total neutralization of gender oppositions, masculine or 
feminine singular personal pronouns are used with reference to animate (human) 
utrum-gender noun phrases in accordance with natural sex. Compare Norwegian 
Riksmäl/Bokmäl mannen — han ‘the man — he’, kvinnen — hun ‘the woman — she’; 
with reference to inanimate masculine-, feminine- or utrum-gender noun phrases a 
corresponding utrum pronoun is used: masculine/utrum stolen — den ‘chair — it’, 
utrum lampen / feminine lampa — den ‘the lamp - it’ (Harbert 2007: 97-98). In 
Dutch, especially in Belgium, hij (etc.) ‘he’ and zij (etc.) ‘she’ are still used with 
reference to etymologically masculine and feminine gender nouns (noun phrases) 
regardless of animacy (Dekeyser 1980: especially 107-109; ANS 1997: 243—244). 
English has, with certain notable exceptions, generalized the neuter pronoun if 
with inanimate (non-human) reference. According to Tiersma (1985: 64), Frisian 
seems to represent a mixture of the Dutch and English systems. 

Gender oppositions in strong adjectives most often follow nouns. Norwegian 
Bokmal is an exception, having an utrum-neuter opposition in adjectives in com- 
bination with optional two- or three-gender system in nouns. In general, weak ad- 
jectives show more, in some languages total neutralization of gender distinctions. 
In Dutch and Frisian, adjective inflection has been reduced to a Ø — /-e/ opposition 
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(ANS 1997: 404—412), whereas English has done away with adjective endings 
altogether. (On the morphological ‘strong’—‘weak’ opposition in Germanic adjec- 
tives cf. Harbert 2007: 130-137.) 


4.2. Number 


Since the earliest stages of Germanic, nouns and adjectives have been restricted 
to a singular-plural opposition. Gothic, Old English, Old Saxon, and Old Norse 
1st and 2nd person pronouns also have dual forms. These are generally lost during 
the Middle Ages (with the marginal exception of North Frisian; Harbert 2007: 177) 
but are occasionally, in Icelandic and in Bavarian dialects, retained with plural 
meaning (cf. Kress 1982: 104). 


4.3. Articles and the morphological marking of definiteness 


Germanic definite articles are of two different kinds: (1) lexical article derived from 
a pre-nominal demonstrative; and (ii) suffixal morpheme inflecting for whatever 
categories are required by the noun or noun phrase of which it is a part. The lexical 
type is found in Continental Germanic and in English as the only type, and in Scan- 
dinavian under certain conditions. The suffixal type is characteristic of North Ger- 
manic. It arose during the Viking age, prior to the onset of the drift towards analy- 
ticity, as the result of the cliticization of an agreeing, post-nominal demonstrative 
pronoun to a preceding noun. The original doubly inflecting structure is still trans- 
parent in modern Icelandic (27a); in the other Scandinavian languages it has been 
simplified to yield portmanteau definiteness morphemes (27b): 


(27) a. hestsins (Ice) VS. b. hestens (Nor) 
horse-GEN--DEF. GEN horse-DEF.GEN 
*the horse's' 


The interaction of suffixal and lexical articles with attributive adjectives yields 
the following distribution patterns: (i) weak adjective and suffixal article (28a); 
(ii) lexical article and weak adjective (28b); and (iii) lexical article, weak adjective 
and suffixal article (“definiteness doubling") (28c): 


(28) a. rauói hesturinn (Ice) 

red-wEAK horse-NOM.SG+DEF.NOM.SG 
‘the red horse’ 

b. den gamle dreng (Dan) 
the old-wEAK boy 
‘the old boy’ 

c. den røde hesten (Nor) 
the red-wEAK horse-DEF.SG.UTRUM 
‘the red horse’ 
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Icelandic has patterns (i) and (ii), Faroese, Norwegian and Swedish (ii) and (iii), 
and Danish (ii). 

In North Germanic, weak adjectives are to a certain extent still carriers of defi- 
niteness (cf. Haberland and Heltoft 2008); compare the contrast between weak ad- 
jective in (29a) and strong adjective in (29b): 


(29) a. Kjere venn! (Nor) vs. b. Lieber Freund! (Grm) 
dear-wEAK friend dear-sTRONG friend 
"Dear friend" 


5. Argument alignment and the coding of syntactic relations 


3.1; Accusativity and the relationship between morphology and syntax 


Germanic has always possessed the syntactic characteristics of a nominative— 
accusative (not ergative-absolutive or agentive) type language, where subjects of 
transitive and intransitive verbs have basically the same kind of syntactic rule 
properties. Traditionally, a number of important syntactic rules, such as subject- 
verb agreement, imperative formation, the passive conversions and Equi, are nat- 
urally correlated with the nominative and accusative as so-called “structural” (as 
opposed to “lexical”) cases (Harbert 2007: 112-122), as they still are in modern 
German in contrast to languages with loss of morphological case distinctions. Cf., 
e.g., the German passive clauses in (30) and their English translations: 


(30) a. Er wurde von den Freunden unterstützt. 
he-sU.NOM was by the friends assisted 
‘He was assisted by his friends.’ (Grm, personal passive of transitive 
verb) 
b. Ihm / *Er wurde von den Freunden geholfen. 
him-bAT/ he-NOM was by the friends helped 
*He was helped by his friends.' (Grm, impersonal passive of intransitive, 
dative-governing verb) 


3225 Loss of morphological case and the development of positional 
(topological) syntax 


According to the traditional view, loss of morphological case usually entails tran- 
sition to a positional syntax where ordering restrictions become relevant as a 
means of encoding syntactic relations. In certain instances, however, the syntac- 
tic behaviour characteristic of a certain morphological case seems to survive the 
loss of the case marking. In Dutch (31b), retention of a previously dative-marked 
indirect object, and in (32b), of a previously dative-marked direct object, 1s re- 
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cognizable by the singular form of the finite verb in construction with a plural 
(object) NP (cf. ANS 1997: 1323, 1406-1412; on Frisian, cf. Tiersma 1985: 
111): 


(31) a. De politie bood de overvallers een vrijgeleide aan. 
the police-su offered the robbers-io.PL a safe-conduct-po to 
"The police offered the robbers safe conduct.' (Dut, active) 


b. De overvallers werd door de politie een 
The robbers-10.PL was-sG by the police a 
vrijgeleide angeboden. 


safe-conduct-su.sG to-offered 
‘Safe conduct was offered the robbers by the police.’ (Dut, passive with 
subjectiviziation of the direct object) 


c. *De overvallers ` werden door de politie een 
the robbers-su.pL were-PL by the police a 
vrijgeleide angeboden. 


safe-conduct-Do.sG  to-offered 
(Dut, illicit passive with subjectiviziation of the indirect object) 


(32) a. Helpt men u al? 

helps one-su you-DO.OBL(< DAT) already 
*Do they help you already?' (Dut, active) 

b. Wordt u al geholpen? 
is-SG  you-DO.OBL(«— DAT) already helped 
‘Are you being helped already?’ (Dut, impersonal passive with retention 
of the object) 

c. Word je al geholpen? 
are-2.PL you-sU(«-— DAT) already helped 
(Dut, personal passive with subjectivization of the object) 


In cases like (31)-(32), it appears that a formerly general case-based restriction has 
been reinterpreted as a partly obligatory, partly optional lexical rule. 

The assumption that the transition to topological syntax is caused by loss of 
case morphology as a kind of compensatory device is contradicted by the so-called 
‘oblique subjects’ of modern Icelandic and Faroese. Oblique subjects are non- 
nominative, typically clause-initial arguments in the dative, accusative and, mar- 
ginally, genitive case that share syntactic rule properties with true nominative sub- 
ject NPs (cf. Thráinsson 2007: 156-167; Thráinsson et al. 2004: 252—257), as in 
Equi constructions (33b): 


(33) a. Eg vonast til að mig vantar ekki peninga. 
I hope to that me-acc lacks not money-Acc 
‘I hope that I don't lack money.’ (Ice, finite complement clause) 
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b. Ég vonast til ad vanta ekki peninga. 
I hope to -Acc to lack not money-Acc 
‘I hope not to be lacking money.’ (Ice, infinitival complement clause) 


Oblique subjects are also documented as a transitional stage in the syntactic devel- 
opment of English and Norwegian (Allen 1996; Mørck 1995). 

Concerning the relationship between morphological case and syntactic rules, 
one may also note that in Faroese (but not in Icelandic), certain dative objects are 
even turned into nominative subjects in the passive (34): 


(34) a. Teir róstu henni altíð. 
they praised her-bAT always 
"They always praised her.' (Far, active of dative-governing verb) 
b. Hon varð altíð róst. 
she-NoM was always praised 
*She was always praised.’ (Far, personal passive with subjectivization of 
dative object) 


Loss of morphological case in connection with transition to topological syntax is 
a characteristic feature of English as well as Mainland Scandinavian. For instance, 
in Norwegian presentational passive sentences direct objects retain their syntactic 
object properties (NRG 1997: 845—847), as is evident from their behaviour in VP- 
pronominalization: 


(35) a. Mange bøker ble utgitt den høsten. 
many  books-su were published that autumn (Nor, personal pass- 
ive with lexical subject) 
b. Ble det utgitt mange bøker den hösten? 
was it-sU.EXPL published many  books-bo that autumn 
*Were many books published that autumn?' (Nor, impersonal passive 
with non-referential, expletive subject and retained direct object) 
—Ja, det ble det. (corresponding VP-pronominalization) 
— yes, it-SU.EXPL was it-vPPRO [i.e. published many books] 
*— Yes [many books were published that autumn].’ 


Even more spectacularly, notional subjects in active existential-presentational sen- 
tences also have syntactic object properties to a certain extent (NRG 1997: 833-835): 


(36) a. Mange bøker utkom den høsten. 
many  books-su appeared that autumn (Nor, active with lexical sub- 
ject) 
b. Utkom det mange bøker den høsten? 
appeared it-ExPL.su many books that autumn 
‘Did many books appear that autumn?’ (Nor, impersonal active with non- 
referential, expletive subject) 
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— Ja, det gjorde det (corresponding VP-pronomina- 
lization) 
— yes, | it-EXPL.SU did it-vPPRO De appeared many books] 


*— Yes [many books appeared that autumn." 


5.3. Impersonal constructions and expletives 


A distinction has to be made between lexical impersonals, comprising meteoro- 
logical and similar types of predicates, on the one hand, and active and passive 
existential-presentational constructions on the other. The former have lexically or 
syntactically, the latter syntactically conditioned expletives (cf. Harbert 2007: 
224-233). 

One group of languages consisting of Insular Scandinavian, Norwegian, Swed- 
ish, and German has the neuter singular personal pronoun in both kinds of imper- 
sonals (cf. German Es schneit ‘It’s snowing’ and Norwegian (35b), (36b) above). 
The other, North Sea Germanic group comprising Danish, Frisian, Dutch, and Eng- 
lish, uses the neuter singular personal pronoun as a lexical expletive, but a form of 
the deictic *there'-adverb as a syntactic expletive (/t's snowing vs. There is trouble 
ahead). With regard to distribution, Icelandic and Faroese differ from all the other 
languages in using the neuter singular pronoun as a mere placeholder in clause-in- 
itial position in lexical as well as existential-presentational impersonals. In Ger- 
man, the neuter singular is, with few and marginal exceptions, obligatory as a for- 
mal subject in lexical impersonals; the corresponding syntactic expletive, however, 
sides with its Icelandic and Faroese counterparts in appearing only as a mere place- 
holder in main-clause initial position. Consider: 


(37) a. Es schneit heute wieder. (Grm) 
it-SU.EXPL snows today again 
‘It’s snowing again today.’ 
b. Heute schneit es wieder. (Grm) 
today snows it-SU.EXPL again 
“Today, it’s snowing again.’ 


(38) a. Es standen drei Männer an der Ecke. (Grm) 
it-EXPL.SG stood-PL three men-su.PL at the corner 
"Three men were standing at the corner.' 
b. An der Ecke standen (*es) drei Männer. (Grm) 
at the corner stood-PL (it) three men 


Dutch and Frisian have an obligatory formal subject in lexical impersonals. In 
contrast with German, the syntactic 'there'-expletive also turns up clause-inter- 
nally in existential-presentational constructions: 
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(39) a. Toen stond er plotseling een onbekende voor de deur. 
then stood there-ExpL suddenly an unknown before the door 
"Then suddenly an unknown person was standing in front of the door.’ 
(Dut, active existential-presentational sentence) 

b. Moarn wurdt (der) in hinder ferkocht. 

tomorrow becomes (there-ExPL) a horse sold 
"Tomorrow a horse is going to be sold.’ (Frs, passive existential-presen- 
tational sentence) 


Both the Scandinavian neuter singular syntactic expletive and its English 
*there" -counterpart are largely obligatory, which testifies to the subject-like func- 
tion even of the latter. In German, the syntactic expletive is a non-subject. Distribu- 
tionally, the syntactic expletives of non-case-marking but verb-final Dutch and Fri- 
sian seem to occupy an intermediate position between those of non-case-marking, 
non-verb-final English and Mainland Scandinavian on the one hand, and that of 
case-marking, verb-final German, on the other, indicating a connection with more 
general typological parameters. 


5.4. Aspects of ditransitive constructions 


In ditransitive constructions, IO-DO is the unmarked and most often obligatory 
order. When both objects are personal pronouns, German, Dutch, and Frisian have 
DO-IO (40a); this order is also marginally present in English (40b): 


(40) a. Er hat sie ihm geschenkt. (Grm) 
he has her-po.acc him-Io.DAT given 
‘He has given it to him.’ 
b. She gave it him. (British English) 


When both objects are full NPs, DO-IO order is available in German for purposes 
of focusing or rhematicization (41b): 


(41) a. Er hat seiner Freundin einen | den Ring 
he has his girlfriend-10.bAT a/ the ring-DO.ACC 
geschenkt. (Grm) 
given. 
‘He has given his girlfriend a / the ring.’ 
b. Er hat den Ring seiner / einer Freundin 
he has the ring-po.acc his/a girlfriend-10.DAT 
geschenkt. (Grm) 
given. 
“He has given the ring to his / a girlfriend.’ 


64 John Ole Askedal 


Similar linear transpositions are known to be possible in Icelandic, although they are 
less frequent in actual usage (Thráinsson 2007: 131-132). In the majority of modern 
Germanic languages, indirect and direct objects are thus coded by relative position. 
The beneficiary element of the ditransitive construction also manifests itself as 
part of a prepositional phrase rather than as a positional or dative IO. Two kinds of 
variation have to be noted. First, the prepositions used vary between languages and 
even within the same language (Harbert 2007: 109—110). Second, the IO-DO-con- 
struction with two noun phrases and the construction with a beneficiary (etc.) prep- 
ositional phrase are not equally common or indeed acceptable in the individual lan- 
guages. For instance, German allows for two noun phrase objects in a fair number of 
cases where Norwegian does not (cf. Askedal 1997). Cf., e.g., (42a) and (42b): 


(42) a. Lichtenberg musste ihn ihr 
Lichtenberg-sU.NOM must-PRET him-Do.Acc her-10.DAT 
beschreiben. (Grm) 


describe 
“Lichtenberg had to describe it to her.’ 
b. Lichtenberg matte beskrive det for henne. (Nor) 


Lichtenberg-su must-PRET describe it-po for her 


Compare also Thráinsson (2007: 231—232) on similar differences between Ice- 
landic, Faroese, and Swedish. Such findings support the general structural con- 
clusion that positional constraints are relationally important to a different degree in 
the various modern Germanic languages. 


6. Concluding remarks 


In the course of their history, the Germanic languages have been transformed ty- 
pologically to a different degree. In general, all of them have developed new ana- 
lytic means of expression, such as various adnominal constructions substituting for 
the attributive genitive (cf. Harbert 2007: 158—160) and, in particular, an abun- 
dance of semantically diverse grammaticalized verb constructions. Still, the indi- 
vidual languages differ with regard to the extent to which they have undergone 
morphological simplification. German and Icelandic, and to a lesser degree Fa- 
roese, are morphologically conservative. Compared with these, all the other lan- 
guages have undergone significant simplification. In the former case, the result is 
syntheticity enriched by new analytic means of expression; in the latter case, it is 
predominant analyticity with a more or less comprehensive synthetic morphologi- 
cal residue. 

In a central Germanic case-neutralizing VX (VO) area, comprising English and 
Mainland Scandinavian, a characteristically ‘positional’ syntax has developed in 
conjunction with morphological simplification. It should, however, be noted that 
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the case-marking VX (VO) languages Icelandic and Faroese have made significant 
steps in the same direction, as evidenced in particular by the ‘oblique subjects’ of 
these languages, which are thus morphologically conservative but syntactically in- 
novative. Hence, the relational significance of constraints on linear order turns out 
to be an important typological characteristic of all non-Continental Germanic lan- 
guages. 

Within Continental Germanic, German has the more obvious characteristics of 
an XV (OV) type language, with more left-directionality in verb chains, function- 
ally relevant case marking, more flexible IO and DO order, and stricter dependence 
of syntactic rules on morphology. Although Dutch and Frisian share basic XV 
(OV) characteristics with German, and like German have pronominal DO-IO 
order and (often) lack subjectivization of indirect objects in the passive, these 
languages are also similar to English and Mainland Scandinavian in having case 
neutralization and strict non-pronominal IO-DO order. Besides, Dutch has more 
right-directionality in verb chains and both Dutch and Frisian employ syntactic ex- 
pletives more freely than does German. 

These observations seem to justify the general conclusion that the present-day 
Germanic languages occupy different positions on a scale defined in terms of basic 
verb order differences, (more or less) case marking, and syntactically relevant con- 
straints on constituent order, where, generally speaking, Mainland Scandinavian 
and English on the one hand, and German on the other, occupy opposite positions 
at the most innovative and most conservative poles of the scale, respectively. 
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Abbreviations 

Dan Danish 

Dut Dutch 

Eng English 

Far Faroese 

Frs (Modern West) Frisian 

Grm German 

Goth Gothic 

Ice Icelandic 

NN (Norwegian) Nynorsk (‘New-Norwegian’) 
Nor Norwegian (examples without further specification are RM/BM) 
RM/BM (Norwegian) Riksmal/Bokmal 

Swd Swedish 

acc accusative 

adj adjective 

dat dative 

def definite 

do direct object 

expl expletive 

gen genitive 

ger (Modern West Frisian) gerund 

ind indicative 

indef indefinite 

inf infinitive 

inf(part) infinitive with participial function (substitute infinitive) 
1o indirect object 


nom nominative 
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obl 
part 
part(sup) 
pass 
pl 
pres 
pret 
refl 
sg 

su 
subj 
sup 
utr 
vppro 


1 


(pronominal) oblique case 
(past/passive) participle 
(non-agreeing participle as) supine 
passive 

plural 

present 

preterit 

reflexive 

singular 

subject 

subjunctive 

supine 

utrum (non-neuter gender) 

VP (verb phrase) pronominalization 


first person 
second person 
third person 
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1. Introduction 


Of all European language families Romance has the largest number of native 
speakers worldwide. A branch of Indo-European, it subsumes all the languages 
that historically evolved from Latin. Most prominent among these are global lan- 
guages such as Spanish, Portuguese and French as well as European languages 
such as Italian, Romanian, Catalan. But there are also “lesser used” languages such 
as Occitan, Rhaeto-Romance, Sardinian and Galician. The number of Romance 
languages varies due to the notoriously imprecise boundary between language and 
dialect. In recent years, attempts to protect linguistic minorities have been accom- 
panied by tendencies to classify linguistic varieties as languages rather than as 
dialects, and in some classifications of the Romance languages varieties such as 
Asturian, Aragonese, Valencian, Gascon, Corsican, Piemontese, Venetian or Aro- 
manian appear as languages. We will not go further into this discussion, but it 
seems important to point out that all the lesser-used Romance languages and var- 
ieties must be considered, at least to some degree, as endangered. 

A general distinction can be established between the “Old Romania”, i.e. the 
Romance languages spoken in Europe, and the “New Romania”, Romance spoken 
due to colonization or migration in other parts of the world, above all in the Ameri- 
cas and in Africa. Traditional, i.e. genetically and geographically based, classifi- 
cations distinguish between Western and Eastern Romance languages (with a line 
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from La Spezia to Rimini, across the centre of the Italian peninsula, separating 
both according to phonetic and morphological differences); another classification 
distinguishes areas such as Balkan-Romance, Italo-Romance, Gallo-Romance and 
Ibero-Romance (with Catalan and Dalmatian, a language that died out at the end of 
the 19th century, as so-called “bridge languages" between the areas). 


Table 1: 


Areal classification of the major Romance languages 


Ibero-Romance 


Portuguese 


Galician 


Spanish 


Gallo-Romance 


Catalan 


Occitan 


French 


Franco-Provengal 


Italo-Romance 


Italian 


Sardinian 


Rhaeto-Romance 


Balkan-Romance 


Dalmatian + 


Romanian 


The following two maps show the worldwide distribution of the Romance lan- 
guages and the European Romance language areas. Figure | shows the areas where 
Romance languages are spoken as L1 but also those areas (particularly in Africa) 
where Romance languages are official / national languages or lingua francas for a 
large part of the population: 


Spanish 


French tomania 
Spani ie T 


Portuguese i" Ki 


gh 


Ki 


Figure 1: The Romance language areas of the world 


Scattered presence of: 


1 French 
2 Italian 
3 Portuguese 
4 Spanish 
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Figure 2 gives an overview of the traditional language territories of the different 
Romance languages according to dialectological criteria. Striped areas indicate 
territories where Romance languages are nowadays used as official / national lan- 
guages but which do not belong to their traditional dialectologically defined area of 
extension: 


E 


French 


Romanian 


Figure 2: The Romance language areas in Europe 


Considering the Romance languages as a typologically coherent object of study 
is justified not only on genetic grounds; areal and purely synchronic reasons can 
also be given. The genetic perspective is implied by the mere fact of classifying the 
Romance languages as a family. As for areal reasons, there exists an "alliance" 
relationship between the Romance languages, as Bossong (1998: 1005) baptized it: 
apart from their historical kinship, the Romance languages have also had a shared 
contact history during their independent evolution — above all with classical Latin, 
but also with other languages. Some scholars even claim (with evident shortcom- 
ings) that after the post-Latin period of divergence, there are now dominant ten- 
dencies towards convergence observable within the family. Apart from the inner- 
Romance areal convergence, areal parallels have been observed with Germanic 
(for the Gallo-Romance and partly the Italo-Romance area) and with Balkan lan- 
guages (for Balkan-Romance within the “Balkan-Sprachbund”). In other cases, 
such as Brazilian Portuguese, areal arguments (claiming indigenous or African in- 
fluence) seem more speculative. 
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If we look at the Romance languages from a morphological, syntactic or con- 
tent-oriented synchronic perspective, there are several features common to all of 
them that justify the assumption of a more or less coherent Romance type different 
from Latin. Within this common Romance type, various subtypes can be identified. 
Traditionally, these subtypes have been related to different stages of evolution: 
whereas all the Romance languages have preserved until today a series of common 
typological features since the Middle Ages, there are a number of typological char- 
acteristics which emerged in the course of the last centuries, with French (contrary 
to Old French, which still belongs to the common type) as the most advanced and 
most innovative language among the Romance literary languages. However, it 
would be too simplistic to reduce the different subtypes to some sort of panchronic 
positions on a single line of evolution in the sense of a Sapirian drift. As will be 
shown in section 5, some of the fundamental evolutionary tendencies within the 
Romance family have clearly been divergent since the Middle Ages and led to re- 
sults that must be described separately. 

The Romance language family is one of the best documented ones among the 
language families of the world. Romance Philology as a discipline goes back to the 
19th century, and even the evolution of typology itself is closely linked to Ro- 
mance linguistics (see Bossong 2001). An historiographical overview of Romance 
Linguistics is offered by the monumental Lexikon der Romanistischen Linguistik 
(Holtus, Metzeltin and Schmitt 1988—2005); for a recent synopsis on historical and 
typological aspects see Maiden, Smith and Ledgeway 2011. 

Almost all Romance languages are written in Latin orthography. Until the 19th 
century, Romanian used Cyrillic spelling, as did Moldavian, the Romanian variety 
spoken in Moldova, even in recent times when Moldova was part of the Soviet 
Union. Ladino, the Spanish varieties preserved by Sephardic Jews after their ex- 
pulsion from Spain in 1492, is partly written in Hebrew spelling. 


2. Phonology 


2.1: Vowel system 


Classical Latin had a rather simple vowel system with five cardinal vowel pho- 
nemes /a/, /e/, /i/, /o/, /u/ and two quantities. However, the common base for the 
Romance languages is not classical Latin but so-called Vulgar Latin, a term refer- 
ring to the partly reconstructed spoken varieties in the Roman Empire. In Vulgar 
Latin, the classical quantities collapsed; the emerging system distinguished seven 
vowel phonemes (/a/, /e/, /e/, /i/, lol, ol, /u/); this system can still be found in sev- 
eral Romance languages such as Italian, Sardinian and Galician. The vowel system 
is reduced to the classical five vowels in Spanish (due to diphthongization of Vul- 
gar Latin stressed E and O to ie and ue [BENE» bien; BoNU»bueno]); in all the other 
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Romance languages, the systems have evolved into more complex ones. Central 
(Standard) Catalan has an eight vowel system, adding a central (al to the common 
Vulgar Latin base; Occitan adds a rounded /y/ to the Catalan system; European 
Portuguese adds to the Vulgar Latin system two centralized vowels (/e/ and /i/) and 
five nasalized vowels that emerged from the contact with nasal consonants as well 
as nasal diphthongs. Romanian presents a seven-vowel system with /a/, /e/, //, /i/, 
/u/ plus the marked /i/ and a Schwa /o/. In Rhaeto-Romance varieties, systems be- 
tween six and ten vowels can be found. The most complex Romance vowel system 
is that of European French, with 12 oral and four nasal vowels. 


2:2. Consonants 


Most initial consonants in Latin were preserved in Romance. Some consonants — 
with the exception of some Sardinian dialects — were palatalized in contact with 
front vowels, like k*! gesi. Initial clusters were partly reduced. Intervocalic voice- 
less stops were preserved in Eastern Romance and sonorized in Western Romance; 
intervocalic voiced consonants were partly lost in Western Romance. The tendency 
of Vulgar Latin to prefer vocalic word endings “remains very strong in Italian, is 
less pronounced in Spanish and Portuguese, even less so in Catalan, Occitan and 
Romanian, and it is weakest in French" (Jensen 1999: 299, see also section 4.2.). 
“Geminated” consonants are characteristic for Italian and Sardinian, where 
stops in the syllable coda are generally long. The Ibero-Romance languages have 
preserved the Latin distinction between /r/ (a tap) and /r/ (a multiple vibrant); in 
other cases, the geminated consonants of Latin have generally been simplified. 


2.3. Prosody 


If we distinguish three dimensions of syllable structure — universal naturalness, 
L1-types and learned structures of later, L2-like acquisition — we can observe that, 
in contrast to their Latin ancestor, the Romance languages tend to simplify their 
syllable structures and to dominantly prefer the (universally "more natural") CV 
type (cf. Lehmann 2005; like Lat. TEMPUS » Sardinian Logudorese tempusu). In the 
course of the history of the Romance languages, however, the contact with Classi- 
cal Latin has repeatedly introduced more complex syllables associated with written 
language and oral standard, i.e. with "distance" varieties (see Koch/Oesterreicher 
2001, Heinz 2010). Frequently this has led to a difference between more complex 
syllables in the standard variety and reduced and simplified syllables in the more 
colloquial varieties (like Standard Spanish examen [eksamen] vs. popular 
[esame]). Some Romance languages are syllable-timing and tend to preserve the 
syllable structure. Others (like French, European Portuguese or Central Mexican 
Spanish) show some characteristics of the stress-timing type (or rather of the word 
rhythm type in the sense of Auer's 1993 distinction between syllable rhythm and 
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word rhythm), with tendencies towards vowel syncopations which, in turn, lead to 
consonant clustering and more complex syllables, like in European Portuguese 
(e.g. desprezar [d[pr.zar]) or in Mexican Spanish (e.g. bloques para apuntes 
[bloks.pa.ra.punts]). Stress is phonologically relevant in several languages (e.g. 
Spanish término ‘border’ — termino ‘I finish’ — terminó ‘he/she/it finished"); in 
French, the accent position is fixed at the end of the phonological word. 


3. Morphology and syntax: some preliminary remarks 


Before describing some of the morphological and syntactic categories of the Ro- 
mance languages, some general remarks on the Romance type seem to be neces- 
sary. As is commonly known, Latin is a highly synthetic language with a very rich 
morphology, a fact that allows an almost free word order in Latin and a "flat" syn- 
tactic structure (see Ledgeway 2011). The evolution from Latin to Romance is 
marked by the emergence of a series of syntagmatic functions that replace the Latin 
inflectional morphemes. This has traditionally been described as the shift from a 
synthetic (or more paradigmatic) to an analytic (or more syntagmatic) type, an idea 
already present in linguistics since August Wilhelm Schlegel at the beginning of 
the 19th century (see Bossong 2001: 719) and repeatedly mentioned in descrip- 
tions of the main transformations from Latin to Romance. This shift with a series 
of fundamental changes can be exemplified by a Latin sentence and its Romance 
translations, where almost every Latin word is replaced by at least two words in the 
Romance languages (see Table 2): 


Table 2: | Some major changes from Latin to Romance (adapted from Tecavéié 1972: 20; 
cf. also Ledgeway 2011) 


Latin VENDIDI LIBRUM ` MAIOREM PETRO MINOREM TIBI 
Italian ho venduto illibro più grande  aPietro quello più piccolo ate 
French jai vendu  lelivre plus grand A bere le plus petit à toi 
Spanish he vendido ellibro más grande a Pedro el menos grande ati 
Romanian am vándut  cartea mai mare lui Petru cea mai mica tie 
Ihave sold the book bigger to Peter the smaller to you 


I have sold Peter the bigger book and you the smaller one. 


Some comments must be added to this general observation. The first point was re- 
peatedly claimed by Eugenio Coseriu from the 1960s onwards (see Coseriu 1988), 
stressing that a distinction should be made between those functions expressed syn- 
tagmatically in Romance - thus corresponding to Schlegel's shift —, and others that 
were not affected by it. Several inflectional functions are still morphologically ex- 
pressed via nominal or verbal endings in Romance (with the exception of French; 
see section 4.1), especially, person, tense and number for verbs and gender and 
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number for nouns (see Table 3 for some examples of Spanish). This could simply 
be regarded as a casual effect of categories not yet affected by the shift, but in fact 
gender and number are even productive categories in the emergence of the 
Romance languages, and in several cases Romance languages have developed 
overt distinctions where they were opaque in Latin or Vulgar Latin. (Contrast, for 
example, ambiguous Latin CAsAE with Spanish casa / casas or Classical Latin 
ERAM, ERAS, ERAT; Vulgar Latin era, era, era with Italian ero, eri, era). According 
to Coseriu (1988), the distribution of analytic and synthetic constructions, histori- 
cally possibly related to Ancient Greek influence on Vulgar Latin, follows a typo- 
logical principle that expresses formally a more general, inner principle of gram- 
matical organization (in the sense of Humboldt's or Klimov's viewpoints on 
typology): in contrast to Latin, Romance codes "inner" functions, i.e. functions re- 
ferring to the category itself, by inflection, whereas "external" relationships be- 
tween several categories are coded by analytic constructions. (For French as a 
“purely” analytic language, see section 5) 


Table 3: Some synthetic and analytic functions in Romance (here: in Spanish) according 
to Coseriu (1988) 


synthetic, "inner functions" 


number perro, ‘dog’ perros ‘dogs’ 
gender perro ‘dog’ perra ‘female dog’ 
elative (without direct comparison) grandisimo ‘very big’ 


analytic, “external functions” 


case la casa del padre ‘the father’s house’ 
se lo doy a Juan ‘I give it to John’ 
superlative (with comparison) más grande que ‘bigger than’ 


Two further examples of this typological principle are, on the one hand, the emerg- 
ence of Romance verbal periphrasis expressing relational tense functions (i.e. re- 
lations between two temporal points like in Spanish voy a amar ‘I will love’ relat- 
ing present and future) and, on the other hand, the fusion of non-relational 
periphrastic forms to new synthetic forms, as in the case of the Vulgar Latin future 
AMARE HABEO > amar aio > Fr j'aimerai, It amero, Spn amaré. 

This general organization principle leads to a kind of syntactic iconicity: inner 
functions that determine a referent or the category of the referent are expressed by 
different means than syntactic relations between various referents, where the ana- 
lytic construction ‘mirrors’ the external relationship. This is illustrated in Figure 3: 
the diminutive in Spanish casita ‘small house’ is expressed by a suffix whereas 
the relationship between the doghouse and the dog is expressed by the preposition 
de: 
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La casita (dé Y perro 


Figure 3: “Iconic” effect of the Romance distinction between synthetic and analytic forms 


Another, more recent viewpoint considers the numerous shifts from analytic to syn- 
thetic constructions within a formal framework as epiphenomena related to a deeper 
change. This view denies the necessary existence of an organized syntactic linearity 
in (classical) Latin and postulates rather that the emergence of a fixed word order ac- 
companied the shift towards Romance. This shift included the rise of a DP, a linearly 
structured NP and a structured VP, with a default order as described in Figure 4: 


a NP: (DET) (QUANT) (*ADJ) N (*ADJ) (*PP) 
b VP: (AUX) V (*ADV) (*OBJ) (*ADV) 


Figure 4: Romance syntactic ordering contrasting with Latin (see Ledgeway 2011) 


As several authors have pointed out, the deeper change underlying all these phe- 
nomena can be described as the shift towards a “configurational” structure, with an 
almost English-like linear syntax completely different from the rather “flat” struc- 
ture of Latin. This means that both syntactic and morphological features need to 
be taken into consideration at the same time since diachronic changes made Ro- 
mance syntax develop categories which were expressed by morphology in Latin. 
Other categories such as the article did not exist in Latin and were newly generated 
in Romance. 


4. Morphology 
4.1. Inflectional morphology 


4.1.1. Verbs 


The Romance languages have preserved conjugation systems, but the rich Latin 
verbal morphology is generally reduced in Romance. It is extremely reduced in 
French, where only tense is systematically marked by endings while person, 
number and mood distinctions have partly disappeared. For subject conjugation, 
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French employs preverbal subject clitics that grammaticalized from subject pro- 
nouns, whereas the personal endings have been preserved only in writing but not in 
the spoken language. The loss of endings in the Middle French period (between the 
14th and the 16th century) affected not only the verb, but also the noun (see 4.1.2) 
and led to a complete reorganization of the French system: formerly postmodifying 
elements move to a premodifier position. Contemporary spoken Brazilian Portu- 
guese also tends towards an overall reduction of endings and substitution by pre- 
verbal and prenominal elements. In the other Romance languages, apart from some 
local or partial tendencies towards the loss of endings, tense, person, number and 
mood are generally distinguished by verbal suffixes: 


(1) Latin (1') Spanish (1") French 
Leg-o. Le-o. Je lis. 
read-1sG read-1sG lsG read 

‘T read.’ 


Object conjugation is grammaticalized to different degrees and marked by clitic 
pronouns in some Romance languages. In European Spanish, object conjugation 
distinguishes human objects from non-human objects (the so-called leísmo de per- 
sona); this correlates with differential object marking by prepositions (see 5.3.2). 
In Portuguese, Galician, and a number of other Romance varieties (Sardinian, Si- 
cilian, Ancient Napolitan, Ancient Romanian), inflectional endings can be added 
to infinitives. 

Tense is the basic category of the Romance verb systems. In the history of the 
individual languages, secondary periphrastic systems have emerged which gram- 
maticalized temporal verbal periphrasis to different degrees. This happened to a 
high degree in French, Northern Italian and Romanian, where the simple past per- 
fective forms have been replaced by periphrastic forms. In other languages like Ca- 
nadian French, Southern Italian or Spanish, synthetic and periphrastic perfect tense 
forms coexist with more or less differentiated meanings. The archaic verbal system 
in Galician is characterized by the non-existence, or perhaps rather marginal exist- 
ence, of temporal verbal periphrases. Catalan has — as the only Romance language — 
grammaticalized a past perfect periphrasis formed on the basis of the movement 
verb anar ‘to go’ + infinitive: 


(2) Catalan 
Ahir vaig anar a casa de-l meu amic. 
yesterday go.1sG go.INF at house of=DET my.M friend 
“Yesterday, I went to my friend's house.’ 


All verbal periphrases in Romance have secondary aspectual meanings deriving 
from the interaction of the semantics of the auxiliary and the non-finite form. The 
distinction between an imperfective and a perfective aspect is generally present in 
the Romance verb, but it is considered secondary compared with tense distinctions. 
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4.1.2. Nouns 


By means of suffixes, the Romance languages distinguish morphologically differ- 
ent genders (male and female, with female gender generally being the marked 
form) and numbers (singular and plural, plural being generally marked). French 
again is an exception: in Middle French, endings were lost and the task of coding 
the gender/number distinction passed over to the determiner, which at that time 
began to become a gender/number prefix. (A similar tendency can be observed in 
contemporary Brazilian Portuguese.) The complex Latin case system was replaced 
by a reduced 'dual case' system in the ancient Gallo-Romance languages, and mor- 
phological case marking disappeared completely in modern Gallo-Romance as it 
did in the other Romance areas at even earlier diachronic stages. The only excep- 
tion in this respect is Romanian, where a distinction between NOM and ACC, on 
the one hand, and GEN and DAT, one the other hand, is still marked morphologi- 
cally, as in Table 4: 


Table 4: | Case marking in Latin and Romance 


NOM ACC DAT GEN 
Latin homo hominem homini hominis 
Romanian omul omului 
Spanish el hombre al hombre al hombre del hombre 


In the other Romance languages, case is marked by word order or by prepositions, 
as in the Spanish examples for DAT and GEN in Table 4. 

The Latin neuter gender, however, was lost in Romance. Some vestiges may 
be found in pronominal systems (like French ca ‘this’). In some languages 
and varieties, the Latin neuter has taken on new functions, like in the case of the 
so-called Spanish neuter article /o. This element no longer denotes a third 
gender, but has rather assumed the function of denoting imprecise, non-discrete 
reference (Pomino and Stark 2009). A similar case is the Asturian “material 
neuter": apparently masculine adjectives accompanying female nouns indicate 
that the nouns are mass nouns, e.g. mantega fresco 'fresh butter' (see Fernández- 
Ordóñez 2006). 


4.2. Derivational Morphology 


The most productive procedure of word formation in Romance is derivation, above 
all suffixation and, to a lesser degree, prefixation. Categorial conversion without 
overt marking, almost impossible in Latin, is less generalized than in English or 
Chinese, but possible in several Romance languages (Adj —^ Noun Fr noir — le 
noir; Verb > Noun It parlare > il parlare). 
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The dominant type of compounding consists in preposing the determined ele- 
ment to the determining element, so that the general direction of compounding is 
the opposite of that to be found in English or other Germanic languages (e.g. Fr 
wagon-lits, Spn coche-cama, 'sleeping car’; Fr/Spn OTAN Organisation du traité 
de l'Atlantique Nord | Organización del Tratado del Atlántico Norte; Engl NATO 
‘North Atlantic Treaty Organisation’). However, some loanwords are Germanic- 
type compounds (like Fr television/Spn televisión *television'). Apocopations are 
common in several Romance languages, primarily in informal speech, and very 
common in spoken French, where they have phonotactic consequences when lead- 
ing to final stops (baccalauréat > bac ‘high school graduation’). For an overview 
on lexical typology, including word formation, see Koch (2001). 


5. Syntax: General remarks 


The traditional classifications leading to clear-cut, discrete subcategorizations of 
the Romance languages are primarily geographic and generally based on pho- 
netic and/or morphological features. More recent views prefer to consider the 
Romance languages as forming a "continuum with clines" (Bossong 2008: 273). 
We have seen that the general morphosyntactic evolution from Latin to Romance 
clearly allows us to distinguish a Romance type different from the type repre- 
sented by Latin (section 3). At first glance, this seems to be confirmed by syntac- 
tic properties, where the Romance type appears to develop towards (S)VO, to 
prefer analytic, periphrastic forms instead of morphological case marking, to 
show overt determiners, and to place nominal modifiers to the right of the noun. 
This overall view is certainly justified; upon closer inspection, though, arguments 
can be, and have been, presented for a subclassification into two fundamental 
syntactic types (what Körner 1987 called “A-languages” and "De-languages"). 
An exhaustive list of typological features justifying such a classification would 
include features such as: non-obligatory vs. obligatory subject clitics, a more 
flexible word order vs. strict SVO, differential object marking vs. partitives, par- 
ticiple invariability vs. participle agreement, generalized auxiliaries vs. auxiliary 
selection, no partitive clitics vs. partitive clitics, etc. Spanish and French might be 
regarded as the prototypical exponents of the two Romance sub-types, and it is 
worth observing that these features, although having medieval roots, emerged or 
were enhanced after the Middle Ages, with some of the changes still going on 
(see Figure 5): 
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Latin 


Romance 


Romance type A Romance type B 
(e.g. Spanish) (e.g. French) 


Figure 5: Typological shift from Latin to modern Romance 


However, a critical view will have to add two further aspects: On the one hand, 
departing from the “prototypes” Spanish and French, several Romance languages 
and varieties cannot easily be placed in either category. A differentiation into sev- 
eral continua would show a less clear, but more adequate picture (see the synoptic 
Table 5 at the end of this chapter). On the other hand, even if correlations between 
the different features are evident, it is much more difficult to find an overall expla- 
nation that would allow us to discover the deeper principles underlying all the dif- 
ferent historical developments. It has been claimed that if we look for a hierarchi- 
cal order among the various continua, we should consider different means of 
expressing transitivity (Hopper and Thompson 1980; Fiorentino 2003) along the 
ergativity-accusativity continuum as the leading principle. Other semantic cat- 
egories such as aspectuality or specificity also seem to play a crucial role. In sec- 
tions 5.1 to 5.4, we will take a closer look at the most prominent typological fea- 
tures marking the contrast between different Romance subtypes. 


Sd: Basic word order and alignment 


All Romance languages are basically nominative-accusative languages, coding 
the distinction between nominative and accusative predominantly by word order. 
There is a rich literature on the overall changes in basic word order from Latin 
to Romance. The general evolution, related to the loss of Latin case marking, is 
towards fixation of word order. Latin, despite having a rather free word order, 
tended to prefer (S)OV. For the contemporary Romance languages, it can be stated 
that SVO is the most generalized type; it is rather strict in French and tends to be 
clearly dominant in those languages and varieties that have obligatory, or almost 
obligatory, subject clitics. For Old French, it has been claimed that the functional 
principle underlying the different overt structures was TVX. For Spanish and 
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other Romance languages, V1-sentences are unmarked for certain constructions. 
If we consider (3) and (4), several observations on Romance word order can be 
made: 


(3) Spanish 
Juan  trabaj-a. 
John ` work-3sa.PRs 
‘John works / is working.’ 


(4) Entr-a Juan. 
come in-3sc.PRs John 
‘John is coming in.’ 


Both sentences represent unmarked cases in Spanish word order, and the syntactic 
difference between them has been accounted for in a series of theoretical ap- 
proaches. One of the basic assumptions is that word order in Romance is governed 
by information structure. As in most of the world's languages, rhematic elements 
tend to follow thematic elements, and (3) is interpreted as a sentence where the 
"new" and relevant information is John's action of working, whereas in (4) the rel- 
evant information is the person who comes in. But it would be too simple to restrict 
the basic word order difference to a purely discourse-pragmatic one: in fact, as 
has been claimed repeatedly, both sentences are semantically different. The prop- 
osition in (3) is of a “categorical” nature (i.e. it is a real proposition with two 
members), while it is of a “thetic” nature (with only one member) in (4) (Ulrich 
1985). The logical argumentation for universally different kinds of propositions 
that may lead to different expressions in the languages of the world distinguishes 
scene-like "events", such as a person coming in as in (4), from predications as in 
(3), where something is stated about someone. Following Burzio's (1981) claim, a 
lexical difference between three fundamentally different classes of verbs is con- 
sidered to be responsible for this syntactic differentiation (for discussion see 
Mackenzie 2006); the traditional distinction between transitive verbs like to build 
and intransitive verbs is reformulated with the addition of the distinction between 
two classes of intransitive verbs: unergative verbs like to sleep, where no agentive 
semantic subject can be identified, and ergative intransitive (or unaccusative) 
verbs like to arrive, where what is superficially a subject occupies the object posi- 
tion due to its being semantically a deep object of the verb in a (semantically) sub- 
jectless sentence (like Juan in example (4)). What can be claimed for modern Ro- 
mance — with the exception of French and French-type varieties — is that basic 
word order is SVO for transitive and intransitive-unergative sentences, and that 
there exists the possibility, in unaccusative constructions, of moving the (syntactic) 
subject into object position when no real "semantic subject" is present. 

Whereas in languages like Spanish this distinction is expressed by changes in 
word order, French has fixed word order but allows the choice between two differ- 
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ent auxiliaries: the auxiliary selected in unaccusative constructions is étre ‘to be’, 
while it is avoir ‘to have’ in unergative intransitive constructions. Consider the fol- 
lowing examples: 


(5) French 
Jean est arriv-é. 
John is  arrive-PsTPTCP 
‘John has arrived.’ 


(6) Jean a dorm-i. 
John has sleep-pstptcp. 
*John has slept.’ 


Spanish does not select different auxiliaries; other languages, like Italian or some 
Catalan varieties, combine both auxiliary selection and word order change. 
A series of further features can be related to this distinction, such as participle 
agreement, partitive object clitization (only possible with unaccusative intransi- 
tives in French and Italian, impossible with unergatives) and blocking of passive 
transformations with ergative verbs (Gabriel and Müller 2008: 64—70). Whilst 
the first and the second actant are marked positionally in Romance, the third actant 
(as other case functions) is marked by prepositions and thus behaves in a more flex- 
ible way with regard to its position. 


3:2; The noun phrase 


All Romance languages are article languages with a definite article that emerged 
from the Latin demonstrative ILLE (exceptionally, in Sardinian and in some Catalan 
dialects, from iPsE) and an indefinite article that emerged from the quantifier UNUS. 
The grammaticalization of the article seems to be a proto-Romance phenomenon 
since all Romance languages have article systems. What is frequently ignored is 
that the "positive" grammaticalization of the Latin demonstratives, as illustrated in 
(7), is not an isolated fact and leads, in turn, to the grammaticalization of zero as a 
marker for mass nouns, as in (8). 


(7) Spanish 
Com-o la/un-a manzana. 
eat-1sG.prs det[r]/one-F apple 
‘I eat the/an apple.’ 


(8) Com-o Ø manzana. 
eat-1sG.PRS DET apple 
‘I eat apple.’ [+mass.] 


This distinction makes it possible to mark syntactically the conversion of mass 
nouns into count nouns, and vice-versa (for partitives see 5.3.4). 
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Most of the Romance languages place determiners to the left of the noun. 
Only Romanian, in harmony with other Balkan languages and as a member of the 
Balkan Sprachbund (see chapter 16 by Tomié this volume), has an enclitic article: 
Rom om ‘man’ / omul ‘the man’ (but also: cel om) — Fr homme /l’ homme; It uomo 
/l'uomo. 

Attributive adjectives generally follow their head nouns; marked, pre-nominal 
adjectives tend to form units with the noun and to express idiomatic senses (Spn 
buen chico ‘good chap’), postnominal adjectives are of specifying nature (Spn un 
chico bueno ‘a boy who is good"), with some exceptions due to Germanic influence 
or due to a certain stylistic effect of the marked structure (Spn la moderna filosofía 
‘modern philosophy’, la madrilefia plaza mayor ‘Madrid’s major square’). 

Sentential attributes in the form of relative clauses follow the noun and are in- 
troduced either by neutral relative pronouns (que/che) or by relative pronouns with 
gender and number agreement (Fr lequel, Sp. el cual, It il quale etc., see also 5.4). 

Predicative nominals without copulas were common in Latin, but are rare in 
Romance. In most Romance languages, there is a general copula deriving from Lat. 
ESSE (with participles partly derived from STARE ‘to stand’, ‘to stay’). However, 
Ibero-Romance distinguishes two copular aspects, with a stative copula estar and 
an identifying copula ser, as in (9) and (10) respectively: 


(9) Spanish 
Juan es nervios-o. 
John cop.3sG.PRS nervous-M 
‘John is a nervous person.’ 


(10) Juan est-á nervios-o. 
John cop-3sG.PRS nervous-M 
*John appears to be nervous (in this moment).’ 


5.3. The verb phrase 


5.3.1. Subject clitics 


Like their Latin ancestor, the medieval Romance languages did not require prever- 
bal subject marking. However, in some Romance varieties, nominal or pronominal 
preverbal overt subjects became more and more obligatory, as in French. During 
the Middle French period, verbal and nominal endings got lost and the function of 
subject marking was more and more assumed by preverbal markers: Spn trabajo / 
It lavoro / Fr je travaille (see section 4.1). Obligatory subject marking can also be 
found in Rhaeto-Romance and Northern Italian and is becoming more and more 
common in spoken Brazilian Portuguese. A tendency towards obligatory subject 
markers has also been observed in Caribbean Spanish. Whereas some authors clas- 
sify French as a strongly analytic language, it has also been claimed since the early 
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1960s that French has rather developed towards a synthetic language after the clit- 
ization of the preverbal subject pronouns, the main difference to the other Ro- 
mance languages consisting in the pre-modification of French in contrast to South- 
ern and traditional Romance post-modification (see Weinrich 1962). In generative 
terms, this means that French would still be a pro-drop language (Kaiser 1992), 
with the former pronouns je, fu, il, elle, on, vous, ils being preverbal inflectional af- 
fixes that can be reinforced by diachronically more recent overt subjects moi, toi, 
lui/elle, nous, vous, eux: 


Old French: travaille » Middle French/standard written modern French je travaille (ob- 
ligatory pronoun) > modern spoken French moi, je travaille (with a tendency to phonic 
reduction of the “affix” je: mwaftr’vaj) 


Figure 6: Evolution of subject marking in French 


5.3.2. Differential object marking 


In his early 20th century ‘neolinguistic’ areal-typological differentiation Matteo 
Bartoli (1925) distinguished between “inner Romance" and “lateral Romance" lan- 
guages, the latter being diachronically more conservative since late innovations 
spreading from Rome did not affect these peripheral areas. Some lexical phenom- 
ena such as the existence of archaic words like Prt fermoso, Spn hermoso and Rom 
frumos, which contrast with It bello, Fr beau/bel, served to support this hypothesis. 
(Note that Spn bello is an italianism which was incorporated later.) 

A grammatical parallel between peripheral Ibero-Romance and Balkan-Ro- 
mance is the existence of differential object marking, as in (11) and (12): 


(11) Spanish 
a. ¿Has visto el tren? 
have.2SG.PRS see.PSTPTCP.M DET train 
“Have you seen the train?’ 
b. ¿Has visto a papá? 
have.2SG.PRS see.PSTPTCP.M DOM dad 
“Have you seen dad?’ 


(12) Romanian 
a. Ai väzut tren-ul? 
have.2SG.PRS see.PSTPTCP train-DET 
“Have you seen the train?’ 
b. L-ai väzut pe tata? 
OBJ-have.2SG.PRS see.PSTPTCP DOM dad 
“Have you seen dad?’ 


Differential object marking (henceforth short DOM) can also be found, with dif- 
ferent degrees of grammaticalization, in varieties of Occitan, in Southern Italian 
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varieties and in Sardinian. It is generally obligatory with indirect objects and 
serves to distinguish human or human-like animate direct objects from other ob- 
jects, allowing a series of subtle distinctions along the animacy-inamimacy con- 
tinuum. 

A phenomenon that has been related to DOM is the neutralization of the 
opposition between direct and indirect human pronominal objects and a preference 
for a human — non-human distinction (so-called leismo de persona) in some Euro- 
pean Spanish varieties (13a). By contrast, other European varieties and generally 
Latin American Spanish preserved the etymological distinction between male and 
female objects and indirect objects without gender distinction (13b). 


(13) a. European Spanish (with “leismo de persona"; accepted as standard in the 
male case) 
Le ve-o a Juan. Le ve-o a Maria. 
OBJ.HUM see-Isc.PRs DOM John OBJ.HUM see-1sG.PRS DOM Mary 
‘I see John. I see Mary.’ 
b. European Spanish (without “lefsmo de persona) / Standard American 


Spanish 

Lo ve-o a Juan. La ve-o a 
OBJ.ACC.M see-1sG.PRS DOM John OBJ.ACC.F see-1SG.PRS DOM 
Maria. 

Mary 


‘I see John. I see Mary.’ 


5.3.3. Clitic doubling 


An important typological characteristic of the Romance languages is the ten- 
dency towards clitization. Apart from articles, possessives, some prepositions 
and conjunctions, mainly object pronouns and, to a lesser extent (and in some 
languages) subject pronouns tend to appear as bound clitics. A phenomenon that 
like DOM can be related to the animacy hierarchy is clitic doubling, i.e. object 
clitics appear simultaneously with the full noun phrases they refer to (see 12b 
and 13). Clitic doubling is common in all those Romance languages and varieties 
that feature DOM and can further be found in colloquial Italian and in Northern 
Italian dialects. 


5.3.4. Partitives 


Those Romance languages that show DOM mark mass nouns in object position by 
Zero (see 5.2.). Ongoing grammaticalization of DOM parallels the loss of partitives 
in these languages. Other languages have overt partitive elements derived from the 
Latin preposition DE or the pronominal adverb INDE (Fr/Ctl en; It ne). Partitives are 
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characteristic of contemporary Gallo-Romance and Italo-Romance. In French, the 
partitive determiner functions as a plural indefinite article, and therefore consider- 
ably reduces pre-nominal zero marking in this language. According to the animacy 
hierarchy, partitives are linked to inanimate objects and yield a [+mass] interpre- 
tation of the objects. 


5.3.5. Passive 


In some Romance languages (e.g. French), a periphrastic “real passive" (like in 
14a) is more frequent than in other languages, like Spanish, where a “reflexive 
passive" (14b) is the most frequent form and “real passives” are practically re- 
stricted to written language. The "reflexive passive", only possible in third person, 
is used, above all, for impersonal expressions since it almost precludes the explicit 
mention of the agent. French, on the other hand, prefers impersonal active con- 
structions as in (14c): 


(14) a. French 


Le bátiment est construit au centre ville. 
ART.M bulding is  construct.PSTPTCP to.ART centre town 
b. Spanish 
El edificio se construy-e en el centro de 
ART.M building REFL build-3sG.prs in ART.M centre of 
la ciudad. 
ART.F town 
c. French 
On construit le bátiment au 
SBJ.N.3sG construct.3sG.PRS  ART.M building to.ART 
centre ville. 
centre town 


"The building is being constructed in the centre of town.’ 


5.3.6. Negation 


Basically three types of sentential negation exist in the Romance languages: a) pre- 
verbal negation, b) discontinuous negation with a pre- and a postverbal element 
and, finally, c) postverbal negation (Bernini and Ramat 1996: 17). Type a), 
NEG- V, can be considered the “common Romance ground”. It is found in Latin 
and in the medieval Romance languages, and still is the type to be found in pres- 
ent-day Southern Romance (Portuguese, Spanish, Italian and Romanian), where it 
is accompanied by negative concord in postverbal elements. “Discontinuous” ne- 
gation (type b) is generally a result of grammaticalization of postverbal reinforcing 
elements with an originally positive meaning: Fr/Ctl/Occitan pas, Piedmontese pd 
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(‘step’); Emilian, Lombard miga or mia, Ctl mica, Old French mie (‘crumb’); Pied- 
montese ren, Ctl res (‘thing’); Occitan ges, Ctl gens (‘people’), Fr point (‘point’). 
In spoken Brazilian Portuguese, discontinuous negation is common with repetition 
of the negation marker, in contrast to European Portuguese where preverbal ne- 
gation is the rule: 


(15) a. European Portuguese / Standard Brazilian Portuguese: 
Não quer-o. 
NEG want-1SG.PRS 
b. spoken colloquial Brazilian Portuguese: 
Não quer-o não. 
NEG want-1SG.PRS NEG 
‘I don’t want (it). 


This double negation with postverbal repetition is also common in several Ro- 
mance-based Creoles. The third type (c) is the result of the elimination of preverbal 
elements in type-b)-areas and coexists with type b) as a colloquial variant (see also 
chapter 15 by van der Auwera in this volume and van der Auwera 2009). 


(16) a. Standard French 
Je n’aime pas le concombre. 
sBJ.lsG neg=like.PRS NEG ART.M cucumber 
b. Colloquial French 
J'aime pas le concombre. 
sBJ.lsc-like.PRs NEG ART.M cucumber 
‘I don’t like cucumber.’ 


(17) a. Standard Italian (Bernini/Ramat 1996: 21) 
Ma non c’era niente da _ fare. 
but NEG it=was.3sG nothing from do.INF 

b. Colloquial Italian 
Ma c’era niente da fare. 
but it-was.3sc nothing from do.INF 
“But there was nothing we could do.’ 


5.4. Complex syntactic structures 


Clause-linking in Romance is carried out via different conjunctional morphemes, 
some of which evolved from Latin conjunctions whereas others were newly cre- 
ated in later periods. As in other European languages, many of these morphemes 
are used both as conjunctions and interrogative markers and are therefore grouped 
under the common label of wh-words. In the case of syntactic subordination, tradi- 
tional grammar distinguishes between complement clauses, relative clauses and 
adverbial clauses. For complement clauses, a pan-Romance complementizer mor- 
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pheme que (che, cd, etc.) is used, which is a conflation of the Latin subordinating 
morphemes QUEM, QUOD and Quia. Complement-clause formation being a way of 
nominalizing sentential strings in order to use them as verbal or nominal comple- 
ments, the complementizer que, in a number of Romance languages, must be pre- 
ceded by a preposition according to the valency of the governing matrix-clause el- 
ement, as in (18a-b) from Spanish: 


(18) Spanish 

a. Tiene la convicción de que Dios 
hold.3sc.PRs ART.F conviction. from coMP God 
exist-e (cf. tiene la convicción de esta cosa) 
exist-3SG.PRS 
‘he is convinced that God exists.’ 

b. Me acuerd-o de que es su 
REFL remember-lsc.PRs from comp is Poss.3 
cumpleaños (cf. me acuerdo de esta cosa) 
birthday 
‘I remember that it is his / her birthday.’ 


In colloquial varieties of Spanish, but also in some other Romance languages, there 
is a tendency to extend this type of prepositionally supplemented complementation 
to verbs which do not call for it from a valency point of view. This tendency, called 
dequeísmo in Spanish and rejected by normative grammar, is considered to express 
certain modalizing nuances. Consider (19): 


(19) Spanish 


Dice de que es el cumpleafios de su 
say.3sG.PRs from COMP is ART.M birthday of Poss.3 
hermano (cf. Zdice de esta cosa) 

brother 


‘he says that it is his brother's birthday.’ 


In some Romance varieties, including several Italian dialects, dual complemen- 
tizer systems (i.e. que/che altering with another subordinating morpheme in the 
same slot) and complementizer-doubling, as illustrated by (20a—b) from Roman- 
ian, are attested. These phenomena of complementizer alternation correlate with 
mood contrasts and mirror different modal values of the subordinate clause. 


(20) Romanian 
a. Zicea cd máine nu se duce la cámp. 
say.3SG.PRS COMP tomorrow NEG REFL g0.3sG.PRS(IND) to field 
‘he says that he doesn't go to the field tomorrow.’ 
b. Zicea că mâine să nu se ducă la câmp. 
say.3SG.PRS COMP tomorrow COMP NEG REFL g0.3SG.PRS.SBJV to field 
‘he says that tomorrow he should not go to the field.’ 


The Romancelanguages 89 


Complementizer deletion, on the other hand, is less frequent than in Germanic lan- 
guages such as German or English and often controversial in normative terms but 
still found in several Romance languages. 

Adverbial clauses are introduced by different conjunctions according to the 
semantics of the subordinate clause (cause, purpose, result, temporal or locative 
meaning). Some of these conjunctions have evolved directly from the correspond- 
ing Latin wh-words (e.g. Lat QUANDO > Spn cuando, Prt/It quando ‘when’; Lat 
QUARE [< QUA RE ‘which thing’] > Fr/Ctl car ‘because’, Lat UNDE > Fr ou, Ctl on) 
but many other Latin forms have been replaced in medieval and modern times by 
complex conjunctions based on prepositional, nominal, adverbial or — less fre- 
quently — verbal elements and the complementizer morpheme que, such as Fr afin 
que ‘in order that’ (lit. ‘to the end that’), Ctl tot i que ‘although’ (lit. ‘all and that’), 
Spn a pesar de que ‘although’ (lit. ‘at weighing of that’), It allorché ‘when’ (lit. ‘at 
the hour that’). Some colloquial varieties of French in Canada tend to add the com- 
plementizer que even to simple, etymological adverbial conjunctions, yielding 
forms like guand que, ou que, etc. 

As for relative clause formation, Romance shares with many other European lan- 
guages the typologically rather infrequent feature of relativizing conjunctional el- 
ements that carry inflectional features of the head noun in the matrix clause and, at the 
same time, inflectional features indicating the syntactic function of this noun in the 
subordinate clause (Cristofaro and Giacalone Ramat 2007). Such ‘true’ (dedicated) 
relative pronouns are therefore both anaphoric and cataphoric. In (21a), for instance, 
Fr duquel shows gender and number agreement with the head noun le journaliste and 
indicates that this noun functions as an oblique argument of the verb recevoir in the 
relative clause. With the alternative form de qui, illustrated in (21b), the anaphoric 
pronominal value corresponds to the indication of the animacy of the head noun: 


(21) French 

a. Le journaliste du-quel jai 
ART.M journalist from.ART.M-Which.M sBJ.1sG=have.1sG.pRs 
reçu cette information est normale-ment bien 
receive.PSTPTCP dem.F information is normal.F-ADv_ well 
renseigné. 
inform-PSTPTCP 

b. Le journaliste de qui j'ai 
ART.M journalist from who ssBj.lsc-have.l1sc.PRs 
reçu cette information est normale-ment bien 
receive.PSTPTCP DEM.F information is  normal.F-Apv well 
renseign-é. 
inform-PSTPTCP 
*the journalist from whom I have received this information is normally 
well informed.’ 
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However, upon closer inspection the cataphoric function (i.e. that of a syntactic 
slot-filler) turns out to be fulfilled by the prepositional element de, whereas the pro- 
nominal element (lequel/qui) is in charge of the anaphoric function. This holds pri- 
marily for those cases where the relativized element has a syntactic function which 
ranks low on the Noun Phrase Accessibility Hierarchy for relativization (cf. Kee- 
nan and Comrie 1977). In spoken Romance varieties, this tendency to disentangle 
and morphologically separate the anaphoric and the cataphoric function frequently 
leads to the use of the above-mentioned generalized neuter que as a marker of sub- 
ordination and as a syntactic link with the head noun, while the syntactic slot in the 
dependent clause is filled by a separate (non-relative) pronoun, as in (22): 


(22) a. Colloquial French 


C'est le seul prof qu'on peut 
it-is  ART.M only teacher REL=SBJ.N.3SG can.3SG.PRS 
lui faire | confiance. 


OBJ.DAT.3sG do.iNF confidence 
b. Standard French 


C'est le seul prof a qui on peut faire 
it=is  ART.M only teacher to who sBJ.N.3sG can.3sG.PRs do.INF 
confiance. 
confidence 


‘that’s the only teacher you can be confident in.’ 


Que thus plays an important role in relative-clause formation, too, although the 
array of functions varies considerably between languages and registers. In Stan- 
dard French, relative que can mark the relativized item as being the second actant 
only (in opposition to bare qui, indicating first-actant status). In Spanish or Italian, 
que/che alone may relativize both syntactic functions ranking uppermost on 
the Accessibility Hierarchy. For relativizations of lower-ranked syntactic functions 
the above-mentioned disentangling strategies involving que are frequently used. 
To sum up, the complementizer/relativizer que must be attributed a crucial role as a 
subordinating device in Romance. 

The second central device for marking syntactic (and ensuing semantic/prag- 
matic) subordination in Romance is the use of the subjunctive mood. As compared 
with Classical Latin, the subjunctive paradigm of Romance has undergone heavy 
restructuring and simplification (see 4.1.1). On the one hand, many subjunctive 
forms have become formally identical to indicative forms, with the notable excep- 
tion of irregular high-frequency verbs. On the other hand, only two tense forms 
survive, namely the present and the past subjunctive, which are of variable vitality 
in the different Romance languages and / or according to whether written or 
spoken language is taken into account. However, contrary to English and other 
Germanic languages, the subjunctive is an important feature in Romance and its 
loss from the verbal system is far from being imminent. Its distributive patterns are 
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complex and, furthermore, vary remarkably between languages. Generally speak- 
ing, the subjunctive is a modalizing device through which the speaker makes 
manifest that a statement depends on some external fact or subjective point of 
view, which then is encoded through syntactic dependency. In languages that 
closely stick to the Romance type, the subjunctive still has a certain degree of 
pragmatic availability and optionality, which may lead to variable use along with 
indicative or conditional verb forms, depending on the pragmatic context. How- 
ever, in all Romance tongues there is a tendency towards fixation of the subjunc- 
tive after certain triggers in the matrix clause. Again, French proves to be most ad- 
vanced in this process since it has almost fully grammaticalized the obligatory use 
of the subjunctive as a means of indicating syntactic subordination after a closed 
inventory of matrix-clause triggers. Only a limited degree of variability remains, 
e.g. in restrictive relative clauses where the subjunctive is used to distinguish 
between a factive and a non-factive reading (a usage found in other Romance lan- 
guages, too): 


(23) French 

a. Je cherche un étudiant qui sait 
sBJ.lsG search.PRs one(M) student who know.sG.PRS(IND) 
parl-er | chinois. 
speak-ınr Chinese 
‘I am looking for a student who speaks Chinese.’ 

b. Je cherche un étudiant qui sache 
sBJ.lsG search.PRs one(M) student who know.PRs.SBJV 
parl-er | chinois. 
speak-INF Chinese 
‘I am looking for a student who speaks (= should be able to speak) 
Chinese.’ 


Syntactic dependency correlates with reduced finiteness features of the verbal el- 
ement in the subordinate clause (Raible 1992). As mentioned before, the Ro- 
mance subjunctive, as a verbal paradigm typical for subordinate clauses, cannot 
express the same array of tense forms as the indicative and therefore lacks some 
finiteness features. Frequently, however, sentential subordination is replaced by 
even less finite verbal forms, namely infinitives and gerunds / present participles 
(the latter two being formally undistinguishable for most regular verbs). The 
replacement of restrictive relative clauses by an uninflected participle/gerund, 
well-known from English, occurs frequently in French (cf. 24) but remains mar- 
ginal in other Romance languages. However, on top of that the gerund often 
replaces adverbial clauses expressing simultaneous action and manner / method, 
but also — particularly in Ibero-Romance — purpose and cause, as in (25) from 
Spanish: 
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(24) French 
Je cherche un étudiant sachant 
SBJ.lsG search.PRs one(M) student know.GER 
parl-er ` chinois (= qui sache / sait parler chinois; cf. 23) 
speak-INF Chinese 


(25) Spanish 


Siendo el responsable, tengo que | preocup-ar-me 
be.GER ART.M responsible hold.1sG.PRS COMP wWorry-INF-REFL 
de esto. 
from DEM 


“since I am the responsible person, I have to be worried about this.’ 


The infinitive replaces a finite subordinate clause if the logical subjects of the sub- 
ordinate clause and of the matrix clause are identical (cf. 26); this, however, does 
not hold true for Balkan-Romance and Southern Italian dialects, where due to Bal- 
kan-Sprachbund influence (see 5.2) the infinitive is rarely used. 


(26) Catalan 

a. Prefereix torn-ar a casa. 
prefer.3sG.prs return-INF to house 
‘he prefers to go back home.’ 

b. Prefereix que torn-i a casa. 
prefer.3sG.PRS COMP return.PRs.sBJV(3sG) to house 
‘he, prefers that he, goes back home.’ (but not: ‘he; prefers that he; goes 
back home’) 


Since syntactic subordination always is atype of nominalization, the infinitive may 
display nominal features when used instead of a subordinate clause. This is the 
case in Spanish where temporal adverbial clauses may be replaced by an infinitive 
preceded by the preposition a and the enclitic definite article (cf. 27a). This con- 
struction may even appear with non-coreferential subjects in the matrix and the 
subordinate clause, as in (27b): 


(27) Spanish 

a. A=l lleg-ar a casa, me di cuenta de mi error 
at=ART.M arrive-INF to house REFL gave.lsG account from my error 
‘when I arrived at home, I became aware of my mistake.’ 

b. A=l lleg-ar la policia a casa, me di cuenta 
at=ART.M alrive-INF ART.F police to house REFL gave.lsc account 
de mi error. 
from my error 
“when the police arrived at home, I became aware of my mistake.’ 


The Romance languages 93 


Even though the subjunctive may occasionally be used in main clauses, it is gen- 
erally a good criterion to attribute subordinate status to the clause it appears in. 
European Portuguese, Galician and Asturian offer another morphosyntactic cri- 
terion for the main-clause / subordinate-clause distinction, comparable e.g. to the 
word order variation between main and subordinate clauses in German: in these 
Ibero-Romance languages, clitic pronouns are enclitic in main clauses and proclitic 
in subordinate clauses (cf. 28). However, this general enclisis/proclisis distribution 
may be altered by preposed sentence adverbs, negation markers and other focus- 
attracting or stance-expressing elements. 


(28) Portuguese (European) 
a. Ele compr-a-o hoje mesmo. 
SBJ.3SG.M buy-3SG.PRS=0BJ.3sG today REFL 
‘he buys it today’ 
b. Quer-o que ele o compre hoje mesmo. 
want-1SG.PRS COMP SBJ.3sG.M OBJ.3SG buy.PRS.SBJV.3sG today REFL 
‘I want that he buys it today.’ 


5.4. Synopsis of some typological features 


Table 5 concludes this chapter with a synopsis of major typological features of the 
Romance languages. 


Table 5: Romance: synopsis of major typological features 


Feature > 1 2 3 4 5 6 7 à & 
Language 4 

Romanian + + = - - 4 " = E 
Italian = = + + = + + —I(+) = 
Rhaetoromance - (+) +/— + +. +/: /(+) 
Sardinian = + —/(+) + E + + ES SS 
Catalan (+) /(+) + + + Ss 
Spanish = + = = _ + + + + 
Galician - 4 - E = + + + + 
Portuguese — + — — — + + + + 
Occitan - (+) + + - + +/— E (+) 
French - = + + + 


SV-VS-variation 
Se-passives 

Be/stay-copula 

Simple past tense dominant 


1 Morphological case marking 
2 DOM 

3 Partitives 

4 Auxiliary type: be/have 

5 Overt subject clitics 


VD OND 


+ = general; — = inexistent; (+) = rare; +/— = existent 
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See also the following chapters in this volume: 15 by van der Auwera, 16 by 
Tomić, 18 by Sansó and 28 by Murelli and Kortmann. 
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1. Introduction 


The Slavonic! languages form a branch of the Indo-European language family 
spoken in Central and Eastern Europe, the Balkans, and parts of Northern Asia (see 
map 1). The first documented texts written in a Slavonic idiom date from to the 9th 
century AD. 

With a total of around 290 million native speakers, this genetic family consists 
of 16 recognized languages, some of which form a dialectal continuum. The family 
is traditionally divided into the three sub-groups West, East, and South Slavonic, as 
presented in Table 1. This classification is based on diachronic phonological crite- 
ria correlating with areal distribution. 

From a sociolinguistic point of view, the languages can be divided into fully- 
fledged standard languages functioning as official state languages — Polish, Czech, 
Slovak, Belarusian, Russian, Ukrainian, Slovenian, Serbian/Croatian/Bosnian, 
Macedonian, and Bulgarian — and languages with a lower degree of standard- 
ization. Kashubian, Upper and Lower Sorbian, and Rusyn are less used languages 
spoken by small ethnic minorities and do not have the status of state languages. 
Kashubian, which is spoken in Northern Poland near Gdansk, does not cover all 
domains of communication and thus lacks some registers. In contrast, the two Sor- 
bian languages, spoken in Eastern Germany, are fully codified and have developed 
terminologies for different registers but suffer from a relatively low vitality and are 
therefore considered endangered languages. Rusyn is a cover term for different 
varieties spoken mainly in Eastern Slovakia, Western Ukraine, Southern Poland, 
and Northern Serbia (Vojvodina) whose speakers are still striving for a commonly 
accepted standard. The Vojvodina variety of Rusyn has received official recogni- 
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Map 1: The Slavonic languages in Europe 


tion and has gone through the stages of elaboration and implementation in schools 
and can thus be considered the most standardized variety. Some comments are in 
place concerning the language labelled Serbian/Croatian/Bosnian because its name 
and its status still spark nationalistically flavoured debates both among speakers 
and scholars. This language, which in the time of the existence of the super- 
national state of Yugoslavia was called Serbo-Croatian, was the result of the 
merger of the historically independent languages of the Serbs and Croats which 
since the 1970s showed accelerating centrifugal tendencies, after the break-up of 
Yugoslavia leading to the political recognition of the three official state languages 
Serbian, Croatian, and Bosnian. Whereas the first and the second ones are clearly 
distinguishable variants, the elaboration and implementation of distinct Bosnian 
language norms are still under way. From a purely linguistic point of view Serbian/ 
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Table 1: Subgroups and number of speakers 


Genetic Language Language name in Number of speakers 
subgroup (alternative spelling the local language ` (first language excluding 
or name) diaspora)? 
West Polish jezyk polski 38 Mio 
Slavonic Czech čeština 9.7 Mio 
Slovak slovenčina 4.5 Mio 
Kashubian (Cassubian) kaszëbsczi jăzëk 50.000 
Upper Sorbian hornjoserbska rěč 10.000-15.000 
(Lusatian) 
Lower Sorbian dolnoserbska réé 4.500-7.000 
(Lusatian) 
East Russian PYCCKUM A3bIK 170 Mio 
Slavonic Ukrainian yKpaiHCbKa MOBa 37-40 Mio 
Belarusian 6eJlapycKası MoBa 7 Mio 
(Belorussian) 
Rusyn pycKu s34K / pycka figures vary between 
6eIlena 100.000 and 895.000 
South Serbian/Croatian/ srpski / hrvatski / 15 Mio 
Slavonic Bosnian bosanski jezik 
CPIICKH jeauk 
Bulgarian O'bIrapcKu e3UK 7.3-8.5 Mio 
Slovene (Slovenian) slovenščina 2 Mio 
Macedonian MAaKEJIOHCKH A3MK 1.3 Mio 


Croatian, however, can still be treated as one polycentric language the varieties of 
which differ considerably in lexis but only marginally in phonology, morphology, 
and syntax. Another distinguishing feature is the use of the alphabets: whereas 
Croats and Bosnian Muslims make exclusively use of Latin script, Serbs use both 
Latin and Cyrillic script. Considerable efforts of Croatian language planners not- 
withstanding, both variants are still mutually comprehensible to speakers of the 
other variant. Slavonic languages are nowadays written in two distinct scripts: the 
Latin and the Cyrillic alphabet. The Slavic nations historically oriented towards 
the West Roman Empire use the Latin alphabet (West Slavonic, Slovene, and Croa- 
tian) whereas the predominantly Orthodox nations use Cyrillic (Russian, Ukrai- 
nian, Belarusian, Rusyn, Bulgarian, and Macedonian). As mentioned above, Ser- 
bian is written in either Cyrillic or Latin, which is quite unique among the nations 
of Europe (cf. also chapter 49 by Voss, this volume). 

The Slavonic languages are relatively closely related to each other, which leads 
to a certain degree of mutual comprehensibility among neighbouring languages. 
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A characteristic feature of the Slavonic-speaking world is the tendency since 
World War II towards the creation of new standard varieties, a process which 
is connected to language-based nation building processes in Central and South 
Eastern Europe. This tendency is reflected in the accelerating divergence between 
Serbian, Croatian, and Bosnian, on the one hand, and various attempts to create 
new micro-languages, e.g. in independent Montenegro or in Silesia, on the other 
hand. 

As the Slavonic speaking countries have a strong tradition in philological 
research and there are many centres of Slavonic studies in Western Europe and the 
US, the Slavonic languages belong to the most thoroughly documented and de- 
scribed languages in the world, both in synchrony and diachrony (for an overview 
see Kosta et al. 2009); most linguistic studies, however, are published in the local 
languages in local publishing houses. Important contributions to Slavonic lin- 
guistics, especially in the field of language history, have been published in Ger- 
man, less so in English. Survey works in English covering the whole of the Slav- 
onic language family are Sussex and Cubberley (2006), Comrie and Corbett 
(1993), and Schenker and Stankiewicz (1980).3 Finally, it is worth mentioning that 
Slavists have made considerable contributions to the development of linguistic 
thought, e.g. Neogrammarians like August Leskien or members of the Prague Lin- 
guistic Circle (e.g. Roman Jakobson and Nikolaj Trubetzkoy). 


2. Phonetics and phonology 


The sound systems of the individual Slavonic languages show considerable vari- 
ation. One of the few common features is the fact that consonant phonemes in all 
languages outnumber vowel phonemes. The vowel systems are mainly based on 
the two oppositions ‘front vs. back’ and ‘high vs. mid vs. low’ vowels. All lan- 
guages have /1 e a o u/; in Czech, Slovak, Serbian/Croatian, and Slovene, the in- 
ventory of vocalic phonemes is increased by the distinction +length: the Czech 
Serbian/Croatian and to a certain degree in Slovene length interacts with the tonal 
opposition of rising vs. falling tone which can be illustrated by the Serbian/Croa- 
tian word sela: [sèla] ‘village.GEN.sG’ with a short rising vs. [s€la] ‘village.NoM.PL’ 
with a short falling tone.* Finally, it is worth mentioning that the vowel system can 
be enriched by (i) diphtongs, e.g Czech /ou/ or Slovak /uo/, (ii) the nasals /&/ and 
/D/ as found in Polish phonetically realized as diphthongs [e] and [DW], (iii) syl- 
labic liquids /r/ and /1/ found in Czech, Slovak, and Macedonian (in Serbian/Croa- 
tian only /r/); e.g. Slovak krk ‘neck’ or vik ‘wolf’. In Slovak the syllabic liquids 
show length distinction as in vrch ‘throw’ vs. v/sok ‘hill’. Finally, it is worth men- 
tioning that Slovak has the specific low front vowel /æ/ and Bulgarian a centralized 
vowel in stressed syllables /9/. 
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On the phonetic level, some languages make a qualitative contrast between 
stressed and unstressed vowels. This especially holds for Russian, Belarussian, 
Slovene, and Bulgarian. 

Slavonic languages possess very different consonant systems depending on 
systemic correlations. In a restrictive analysis, the number of consonantal pho- 
nemes varies between around 24 to some thirty odd elements. Table 2 is the 
example of a relatively small consonant system (Serbian/Croatian) whereas table 3 
shows one of the larger systems (Russian): 


Table 2: A ‘small’ consonant system (Serbian/Croatian) 


Bilabial Labio- Dental Alveolar  Post-Alveolar Palatal Velar 
dental 


Plosives p b t d k g 
Affricates ts dz t & te dz 
Nasals m n n 
Fricatives fv sz pre x 
Trill r 

Approximant j 

Lateral 1 A 


Table 3: A ‘large’ consonant system (Russian) 


Bilabial Labio- Dental Alveolar  Post-Alveolar Palatal Velar 


dental 
Plosives -pal p b t d k g 
t+pal p bi t di 
Affric. —pal ts ff 
+pal 
Nasals —pal m n 
+pal m ni 
Fricat. -pal fv sz fS 3 x 
+pal Bow s ai D a 
Trill —pal r 
+pal ri 
Approx. j 
Lateral —pal l 


+pal li 


102 Björn Hansen 


The systems are characterized by the systemic correlations like +voiced (e.g. [p] 
vs. [b]) and - in East Slavonic, Polish, and Sorbian — +palatalized which is a fea- 
ture of secondary articulation ([p] vs. [p/]). For recent studies on the phonological 
typology of the Slavonic languages see Stadnik (1997) and Sussex and Cubberley 
(2006). 


3. Morphology 


Morphologically, the Slavonic languages behave like typical synthetic languages, 
sharing many features with earlier stages of other Indo-European languages such 
as Latin and Ancient Greek. We find prefixes, suffixes, infixes, endings, and post- 
fixes (circumfixes are the exception). The paradigms are characterized by a high 
number of morphonological alternations and portmanteau morphemes. The Slav- 
onic morphological systems make a clear distinction between inflectional and deri- 
vational categories. 

The individual languages have rich morphologically marked word class sys- 
tems very much reminding of the ones of other Indo-European languages. The 
traditional classification is based on the distinction between inflecting vs. non-in- 
flecting word classes. Inflecting word classes are verbs, nouns, adjectives, personal 
pronouns, and numerals. The set of non-inflecting word classes comprises adverbs, 
conjunctions, prepositions (some rare cases of postpositions), and the heterogen- 
eous class of so-called particles (i.e. utterance modifiers, analytical grammatical 
markers, and interjections). As expected, all word classes show a radial structure 
with typical members in the core and atypical ones at the periphery. One of the bor- 
derline cases is the group of lexemes which in Russian grammaticography is called 
kategorija sostojanija ‘category of state’ and in Polish grammars sometimes cza- 
sowniki niewlasciwe ‘pseudo-verbs’. These elements morphologically behave like 
uninflected adverbs but syntactically like verbs because they are restricted to the 
predicate position. In all Slavonic languages we find instances of the full or partial 
transition of elements from one word class to another; e.g. converbs > adverbs; ad- 
jectives > nouns etc. 


3.1 Inflectional morphology 


3.1.1. The verb 


The finite forms of the Slavonic verb are marked for the following grammatical 
categories: aspect, tense, mood, voice, number, person, and gender. Apart from 
that, most languages have adjectival participles, converbs, and an infinitive (in 
Bulgarian only relics and in Macedonian absent). In Polish, the formation of de- 
verbal nouns is so regular and encompasses practically the whole class of verbs 
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that it is justified to treat it as a grammatical category of the verb (e.g. czyta-c read- 
INF ‘to read’ > czyta-ni-e read-NMLZ-NOM.SG ‘the reading’). 

All Slavonic languages distinguish the unmarked present tense, past, and future 
tense. Some languages additionally have resultatives, perfects, pluperfects, pasts 
in the future, and some other categories. The past tense, the perfect, and the future 
tenses go back to auxiliary constructions described in chapter 4.1. Sorbian and the 
South Slavonic languages in addition retain the synthetic tense-aspect forms aorist 
and imperfect inherited from Proto-Slavonic. In Ukrainian, we also find a synthetic 
future tense which developed through the erosion and cliticization of the verb ‘to 
have’: pysa-ty-mu write-INF-FUT.1scG ‘I will write’. 

One of the typologically prominent features of the Slavonic verb is the cat- 
egory of aspect which is based on the dichotomy ‘perfective’ vs. “imperfective’. 
According to Dahl (1985), Slavonic aspect is typologically unique in its specific 
combination with tense and in its morphology. In most aspect languages, like e.g. 
French, the aspectual opposition is restricted to the past tense (tripartite system): 


Table 4: The tripartite aspect system 


French past non-past 
perfective il partit (passé simple) il partira (future) 
imperfective il partait (imparfait) 


The time reference of Slavonic aspect, however, is less restricted: The opposition 
perfective vs. imperfective is found both in past and in non-past tenses. 


Table 5: | Slavic-style aspect 


Russian past non-past 
perfective on s-delal on s-delaet 
imperfective on delal on delaet 
(delat‘ pry / sdelat’,., “to do’) 


It is worth mentioning that the South Slavonic languages have developed the 
Slavic-style aspect while retaining elements of the old Proto-Slavonic tripartite 
system, based on the inflectional categories ‘imperfect’ and ‘aorist as mentioned 
above. A second peculiarity of ‘Slavic-style aspect’ is its morphological marking: 
Being a highly grammatical, i.e. obligatory category of the verb, it is not expressed 
by inflectional but by derivational means (see section 3.2). In contrast to e.g. Ro- 
mance languages, aspect is marked both in finite and non-finite forms. Aspect com- 
bines with tense in a specific way; both aspects have morphologically unmarked 
present tense forms; whereas imperfective present refers to the time of speech, the 
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perfective present form in West and East Slavonic has future time reference; thus, 
Polish prze-czyta-mppy PFv-read-1sc means ‘I will read’. Another peculiarity is 
found in the analytical future (see section 4.1) which in these languages is formed 
exclusively from imperfective verbs (Czech bud-u  psd-tppy, FUT-1SG 
write[IPFV]-INF but *bud-u na-psa-tppy FUT-1SG PFV-write-INF ‘I will write’). Both 
aspects regularly express past tense which is formed by the addition of the suffix -I- 
plus a gender-number agreement marker (see section 4.1). 


Russian 

(1) Ver-a Cita-l-a knig-u. 
Vera-NOM.SG read[IPFV]-PST-FEM.SG book-Acc.sG 
“Vera was reading a book.’ 


(1’) Vera pro-cita-l-a knig-u. 
Vera-NOM.sG PFV-read-PST-FEM.SG book-Acc.sG 
“Vera has read through the book.’ 


Slavonic aspect mainly expresses the completion or non-completion of the action 
but can assume further non-aspectual functions. In the imperative, for example, it 
can be used to distinguish different shades of politeness. There is also a specific in- 
teraction with modal notions; e.g. the perfective present tense form can render the 
modal meaning of (in)capability: Russian ne skaz-uppy ‘I can’t say’. A close inter- 
action with aspect is also found in the voice system; e.g. Russian perfective verbs 
form their passive with a participle plus auxiliary whereas imperfective ones make 
use of a postfix derived from a reflexive marker; cf.: 


Russian 

(2) Knig-a by-l-a pro-cita-n-a. 
Book-NOM.SG AUX-PST-SG.F PFV-read-PTCP-F.SG 
"The book had been read through." 


(2) Knig-a Cita-l-a-s’. 
Book-NoM.sG_ read[IPFV]-PST-F.SG-REFL 
“The book was being read.’ 


Mood comprises the subcategories indicative, imperative/hortative, and condi- 
tional; the indicative coincides with the set of tense forms, i.e. indicative as such 
is morphologically unmarked. The imperative of the 2nd ps. sg. is usually formed 
by 8-Suffix and 2nd ps. pl. by the addition of a personal ending; Polish, Czech, 
and Serbian/Croatian also have a morphological imperative/hortative of the 1st 
ps. pl. 

Polish 

(3) czytaj-ø! “You guy, read!’ 

read-IMP.2SG.FAMILIAR 
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(3°) czytaj-cie! “You guys, read!’ 
read-IMP.2PL.FAMILIAR 


(3") czytaj-my! ‘Let’s read!’ 
read.ımPp-1PL 


In verbal forms, gender is marked only in the past tense, the conditional, and the 
perfective passive constructions, all of which historically go back to an auxiliary 
plus adjectival participial construction to be described in section 4.1. 


3.1.2. The nominal word classes 


The Slavonic noun is marked for the grammatical categories gender, number, and 
case. Bulgarian and Macedonian noun phrases in addition are obligatorily marked 
for definiteness (see section 4.2). 

All Slavonic languages have gender, which forms a grammatical category not 
only of the noun but also of the adjective, the pronoun, and partially the verb and 
the numeral. Gender divides nouns into different agreement classes. An agree- 
ment class is defined as “a set of nouns such that any two members standing in any 
grammatical form require that any given agreement target in any agreement 
domain must take the same form" (Corbett 1988: 2). All languages show a divi- 
sion into feminine, masculine, and neuter gender; all of them, except Bulgarian 
and Macedonian, exhibit the additional subgenders animate resp. «personal. 
Subgenders are defined as "agreement classes which control minimally different 
sets of agreements" (Corbett 1988: 5); e.g. Serbian/Croatian prijatelj ‘friend’ and 
zakon ‘law’ differ in the accusative singular only. The range of animate gender 
differs from one language to another; it can include some referentially inanimate 
nouns like Russian fuz ‘ace’ or Polish names of dances (e.g. polonez). Whereas 
gender is an inherent property of every noun, in adjectives and the other parts of 
speech it functions as an agreement marker. Nouns in one and the same meaning 
usually do not show variation in gender. One exception in this respect is the so- 
called depreciative use of the masculine non-personal subgender for nouns denot- 
ing men in Polish. These nouns usually belong to the masculine personal sub- 
gender; cf.: 


Polish 

(4) Ci chlop-i cal-y 
this.NOM.PL.MPERS farmer-NOM.PL.MPERS whole-ACC.SG.M 
dzien spa-l-i! 
day.ACC.SG.M sleep-PST-NOM.PL.MPERS 


"These farmers have been sleeping the whole day long.’ 
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(4) Te chtop-y 
this.NOM.PL.NON_MPERS farmer-NOM.PL.NON_MPERS 
caty dzien spa-t-y! 


whole-Acc.sG.M day.ACC.sG.M sleep-PST-NOM.PL.NON, MPERS 
"These silly farmers have been sleeping the whole day long!’ 


Another peculiarity is found in some Serbian/Croatian archaisms which have a 
different gender in the singular and the plural: sluga ‘servant’ (sing. masc) — slüge 
(pl. fem.). 

Slavonic gender is morphologically linked to the categories of number and 
case. The three genders ‘masculine’, ‘feminine’ and ‘neuter’ and the subgenders 
*zanimate' are found in the singular of most Slavonic languages but cannot be con- 
veyed in all cases. According to Stankiewicz (1968: 31), the implementation of the 
gender oppositions in the singular shows the following distribution: 


Table 6: | Gender-Case distribution 


Genders Cases 
Accusative Nominative Oblique Cases 
1. Fem/Non-Fem + + dh 
2. Neut/Non-Neut + + = 
3. Inan/Anim + = = 


In Slavonic the meaning of a noun is not sufficient to determine its gender; the 
gender assignation is thus determined by the interaction of semantic and morpho- 
logical information. Most nouns denoting males are masculine and belong to the 
typically masculine declension class. In some cases the two types of assignment 
rules are in conflict (Corbett 1988: 14): Nouns like Russian muzcina ‘man’ denote 
males but follow a predominantly feminine declension pattern. In these cases the 
semantic assignment rule overrules the morphological one. About these ‘hybrids’ 
see section 4.3. 

Belonging to the synthetic morphological type, all Slavonic languages except 
Bulgarian and Macedonian possess a rich case system, i.e. a system of marking de- 
pendent nouns for the type of relationship they bear to their heads. In Slavonic, case 
is a grammatical category of the noun, adjective, pronoun, numeral, and verb (parti- 
ciple). The majority of Slavonic languages have six cases: Nominative, Genitive, 
Dative, Accusative, Instrumental, and Prepositional case. In addition, many of them 
have a Vocative marker, which is not a case sensu stricto. Apart from that, there is a 
series of markers which meet the structural criteria for being treated as cases in their 
own right but are restricted to small sets of nouns. Examples are the locative and the 
partitive in Russian (see Zaliznjak 1967). In contrast, Bulgarian and Macedonian 
have undergone considerable grammatical change in the direction of the analytical 
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type, which has led to the loss of nearly all cases. There are only few relics of the op- 
position between nominative vs. oblique cases in personal pronouns and some nouns. 

Number is a grammatical category of the Slavonic noun, the adjective, the pro- 
noun, the verb, and partially the numeral. Number in Slavonic covers the whole 
Animacy Hierarchy (Corbett 2000: 56), i.e. it is regularly expressed in pronouns, 
kinship terms, human nouns, animate nouns, and inanimate nouns. The Slavonic 
languages have number systems consisting of two or three subcategories. All of 
them distinguish singular and plural; Slovene and Sorbian, like the extinct lan- 
guage Old Church Slavonic, additionally have the dual. 

Slavonic pronouns and the agreeing verbs distinguish first, second, and third 
person; only the second person is marked for social distance. In all Slavonic lan- 
guages the distant form of address (the so called “polite form’) is formed by the 
second person plural like in French vous. The only exception is Polish, which has 
grammaticalized the noun pan ‘gentleman’ and its derivates showing agreement 
with the third person into the distant form of address. 


3.2. Derivational morphology 


The Slavonic languages are characterized by a very rich system of derivational 
word formation. The main morphological techniques are prefixation, suffixation, 
and to a less degree the composition of lexical roots, which are often accompanied 
by morpho-phonological alternations of the lexical stem and/or the affix. Trun- 
cations like Polish czotga-é¢ sie > czołg ‘crawl-INF REFL’ ‘to crawl’ > ‘tank’ are 
relatively rare. The inventory of word formation patterns allows for the permanent 
enrichment of the stock of nouns, adjectives, verbs, and adverbs. There are patterns 
specialized to create new members of one and the same word class and those which 
are based on the transposition from one word class to another. Some affixes add an 
expressive component. For the purpose of illustration, here are some word 
formation patterns of Polish nouns and adjectives: 


Table 7: Examples of transpositional word formation patterns 


noun > noun: ryba ‘fish’ > rybnik ‘fisherman’ 
adjective > noun leniw-y ‘lazy-NOM.SG.M’ > lenistwo ‘laziness’ 
verb > noun biega-é ‘run-INF’ > biegacz ‘runner’ 


adjective > adjective ` grub-y 'fat-NoM.sG.M' > grubaw-y ‘pudgy-NOM.SG.M’ 
verb > adjective ptaka-é ‘cry-INF’ > placzliw-y ‘tearful-NOM.sG.M’ 


noun > adjective samochód ‘car’ > samochod-ow-y ‘car-ADJ-NOM.SG.M’ 


The individual patterns vary according to the semantic motivation of the new 
word; in some cases we are dealing with a high degree of compositionality, in 
others the new form renders less motivated new meanings. 
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A peculiarity of Slavonic word formation is its intimate link to grammatical in- 
flection. As mentioned above, Slavic-style aspect is an obligatory, hence fully 
grammatical category of the Slavonic verb, which is not expressed by regular in- 
flection but by derivational morphology. It is expressed by the addition of a range 
of prefixes or suffixes to a verbal stem. In some cases, the imperfective verb is de- 
rived by the addition of a suffix to the stem of the perfective equivalent, as in Rus- 
sian da-t ‘ppy > da-va-t’ppy ‘to give’. In other cases, the perfective form is derived 
by prefixation from the simpler imperfective equivalent, as in Russian Cita-t’ ppy > 
pro-cita-t'yg, ‘to read’. Sometimes the aspectual opposition is marked by supple- 
tive verbal roots (e.g. Czech brát p, vs. vzit,,, ‘to take"). 


IPFV PFV 


4. Syntax 


4.]. The verb phrase 


Slavonic uses a range of auxiliary constructions to encode temporal, modal, and 
evidential operators. In the majority of languages past tense, or rather perfect, are 
expressed by a form historically going back to a construction with an auxiliary plus 
adjectival participle. This analytical form in the individual languages underwent 
different stages of grammaticalization causing the gradual erosion of the auxiliary. 
In Russian the grammaticalization process has lead to the full elision of the auxili- 
ary leaving a synthetic verbal form; cf. 


Serbian 
(5) cita-l-i smo 
read-PTCP-PL AUX.1PL 


Polish 
(5) czyta-l-i=smy 
read-PTCP-PL=1 PL 


Russian 
(5”) my  Cita-l-i 
we  read-PTCP-PL 
“We have read. / We read.’ 


A similar analytical construction is the compound conditional found in all lan- 
guages without exception; it is also based on the adjectival participle but combines 
with a different form of the copula. 


Czech 

(6) koupi-l-i bychom 
buy-PrcP-PL cond. Int 
"we would buy’ 
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Also, future tense in all languages is formed by an analytical construction consist- 
ing of an auxiliary and a non-finite verb form (in South Eastern Slavonic plus finite 
verb). The auxiliaries are derived from verbs of *being' or *wanting'. West and 
East Slavonic future auxiliaries bear person marking whereas the lexical verb is in 
a non-finite form, usually the infinitive. In Polish and in Slovene, however, the 
lexical verb can be in a participle form agreeing with the subject in number and 
gender. A different structure is found in South Eastern Slavonic: the Serbian aux- 
iliary is aclitic, which can co-occur either with the infinitive or the semi-finite form 
of the subjunctive. A fourth structural type is found in Bulgarian where the future 
auxiliary remains uninflected and combines with a finite verb: 


Slovak 
(7) bud-em  hovori-t' 
FUT-1PL talk-INF 


Slovene 
(T) bo-mo  govori-l-i 
FUT-1PL talk-PTCP-PL 


Serbian 
(77) mi ćemo — govori-ti/ govori-c-emo 
we FUT-IPL talk-INF talk=FuT-1PL 


Bulgarian 

(T7) šte govori-m 
FUT talk-1PL 
“We will talk.’ 


Modal constructions expressing necessity and possibility in Slavonic follow the 
general European patterns (see Hansen and de Haan 2009). In every language there 
is a closed set of polyfunctional independent words serving as operators on lexical 
verbs. The modal constructions vary with respect to (i) the syntactic encoding of 
the subject and (ii) the assignment of the agreement marking to the modal and/or 
the main verb and (iii) TAM-marking on the modal or an auxiliary. The subject can 
be coded either in the Nominative, the Dative, or it can be elided. 


Russian 
(8 My moz-em rabota-t’. 
we.NOM can-1PL work-INF 


“We can work.’ 


(8) Nam mozno rabota-t’. 
We.DAT possible WOrk-INF 
‘It is possible for us to work’ 
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(8") Mozno rabota-t’. 
possible work-INF 
‘It is possible to work.’ 


The agreement with the subject can be marked in three ways: only on the modal 
(9), only on the main verb (9’) or on both the modal and the main verb (9°). 


Serbian 

(9 Ivan i Pavle mora-ju  radi-ti. 
Ivan and Paul must.3pL work-INF 
‘Ivan and Paul must work 


(9°) Ivan i Pavle treba da rad-e. 
Ivan and Paul should comp work-3PL 
‘Ivan and Paul should work.’ 


(9") Ivan i Pavle mora-ju da rad-e. 
Ivan and Paul must.3pL coMP work-3PL 
‘Ivan and Paul must work.’ 


In many Slavonic languages, especially in the Eastern area, we find modals histori- 
cally going back to adjectives. In these constructions tense and mood is not marked 
on the modal but on a second auxiliary: 


Russian 

(10) Vanja dolz-en by-l rabota-t'. 
Vanja obliged-m.sG be-PrCP.sG.N work-INF 
“Vanja had to work.’ 


In all Slavonic languages we find analytical constructions with imperative or hor- 
tative meaning which fill functional gaps in the imperative paradigm. These con- 
structions are the results of the grammaticalization of verbs like ‘to let’, ‘to give’ 
and others. The imperative marker occupies the first or the second syntactic slot in 
the clause and has to precede the lexical verb, which in some constructions is in the 
indicative and in others in the infinitive: 


Polish 

(11) Niech Pawet czyta! 
LET Paul  read.PRs[3sc] 
‘Let Paul read!’ 


Serbian/Croatian 

(12) Nemoj reci da [...] 
PROH.2SG say-INF comp 
‘Don’t say that [...]' 
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It is worth mentioning that Bulgarian has amore elaborated mood system, which in 
addition to the conditional comprises the so-called oblique mood which functions 
as quotative, inferential, and mirative marker. For more details on mood in Slav- 
onic see the corresponding articles in Rothstein and Thieroff (2010). 

A feature setting Bulgarian, Macedonian, and to a certain degree also Serbian 
apart from the other Slavonic languages is the loss of the infinitive, a feature typi- 
cal of the Balkan Sprachbund. In Bulgarian and Macedonian the infinitive has 
been marginalized or even replaced by a construction which carries features of a 
complex clause. We are dealing with a condensed complementizer-headed clausal 
construction with a verb inflecting for person and number but not for mood and 
tense (subjunctive). In Serbian this construction competes with the regular infini- 
tive. 


Bulgarian 
(13) Moze-te da dojde-te pri nas s tramvaj 22. 
can-2PL COMP come-2PL to us with tram 22 


“You can come to our house with the tram N° 22.’ 


4.2. The noun phrase 


Within the Slavonic noun phrases we can distinguish between agreeing and non- 
agreeing adjuncts, traditionally called attributives, on the one hand, and non-agree- 
ing complements, i.e. elements filling a slot opened by the lexical head noun, on 
the other hand. Agreeing attributives are characterized by consistent agreement in 
gender, number, and case. The main word order pattern in Slavonic NPs is Det- 
Adj-Noun: 


Polish 

(14) t-a mtod-a dziewczyn-a 
this-NOM.sG.F young-NOM.SG.F girl-NOM.SG.F 
‘this young girl’ 


In some semantically determined cases we find the postposition of the determiner 
or of the adjective; e.g. in Polish this holds for adjectives denoting permanent 
features of the referent like jezyk polski ‘the Polish language’. In line with known 
word order universals relative clauses follow their nominal heads whereas quanti- 
fiers (usually) precede them. 


Czech 

(15) Dopis, kter-y jsem od něho dosta-l, 
letter  which-NoM.sG Aux.lscG from him  got-PTCP.M.sG 
je velmi dülezit-y. 
be.3sG very important-NOM.SG.M 


“The letter I received from him is very important.’ 
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Polish 

(16) dwa samochod-y 
two car-N.PL 
‘two cars’ 


Non-agreeing complements and adjuncts are usually postponed and can be in the 
genitive, dative, instrumental case, or in a prepositional phrase. Nouns denoting 
verbal concepts can govern a clausal argument (infinitive or complement clause): 


Czech 
(17) seznam zkratek 
list abbreviation[GEN.PL] 


‘a list of abbreviations’ 


Russian 

(18) pamjatnik Puskin-u 
monument Pushkin-DAT.sG 
“a monument to Pushkin’ 


Russian 

(19) upravlenie biznes-om 
administration business-INs.sG 
“business administration’ 


Polish 

(20) szczoteczka do zeb-ów 
brush to tooth-GEN.PL 
*tooth brush’ 

Russian 

(21) Ne by-l-o nadezd-y vyrva-t'-sja na 
Not be-PsT-N.sG hope-GEN.sG get.out-INF-REFL ON 
svobod-u. 


freedom-Acc.sG 
"There was no hope to regain freedom.’ 


Polish 
(22) Decydowani-e, kiedy dziec-i mog-q 
Decide-NMLz.NoM.sG when child-NoM.PL can-3sG 
uzywa-é komputer-a. 
use-INF  computer-GEN.SG 
"The decision, when children will be allowed to use the computer.’ 


The Slavonic languages lack obligatory markers for definiteness with the excep- 
tion of Bulgarian and Macedonian, which have a suffixed definite article (Bulgar- 
ian masa ‘(a) table’ vs. masa-ta ‘the table’). The definite article in Bulgarian is a 


Slavonic languages 113 


clitical element which occupies the second position in the NP; i.e. in NPs without 
adjectives it is in postposition to the noun but moves away if the noun is accompa- 
nied by a modifier: 


Bulgarian 

(23) cant-i-te 
bag-PL-DEF. ART.PL 
‘the bags’ 


(23°) xubav-i-te čern-i | koZen-i cant-i 
nice-PL-DEF.ART.PL black-pL leathern-pL bag-PL 
‘the nice black leather bags’ 


It has often been claimed that the morphological opposition between long and short 
form of the adjective in Serbian/Croatian, as in star Covjek vs. star-i Covjek ‘old 
man’, is used to mark definiteness. As recent corpus-based research shows, how- 
ever, the distribution of the two forms does not directly correlate with definiteness 
(cf. Hansen 2004). 

NPs with numeral quantifiers are characterized by some morpho-syntactic ir- 
regularities due to the non-homogeneous dependency marking by agreement and 
government (see Corbett 1987). From a typological point of view, one can speak of 
a morphosyntactic category ‘paucal’, denoting the numbers ‘two’, ‘three’, and 
‘four’. The Slavonic paucal relies on the alternation of government structures; e.g. 
the Polish numeral ‘one’ behaves like a regular, agreeing adjective in the noun 
phrase, i.e. it does not influence the syntactic encoding of the lexical head, which 
shows up in the regular Nominative Singular; the numerals ‘two’, ‘three’, and 
‘four’, however, behave like nouns and govern the nominative plural (in Russian 
the genitive singular) and numerals higher than ‘four’ govern the Genitive plural. 


Polish 

(24) jeden samochód 
one.NOM.SG.M Car.NOM.SG 
‘one car’ 

(24’) dwa samochod-y 
two.NOM.M Car-NOM.PL 
‘two cars’ 


(24’’) pieé samochod-ów 
five.NOM Car-GEN.PL 
*five cars' 
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4.3. The simple clause 


The traditional descriptions of Slavonic syntax usually classify sentence patterns 
according to their primary function in the speaker-hearer communication. These 
functions, which are associated with specific syntactic patterns and/or intonational 
features, are declarative, interrogative, and imperative. The first communicative 
sentence type is considered unmarked. Slavonic interrogative sentences distin- 
guish between wh- and yes/no-questions. The first are characterized by the fronting 
of the interrogative element: It can be in the initial or in the non-initial preverbal 
position as in the following examples (Meyer 2002): 


Russian 

(25) Cto — dela-et mozg vo vremja sn-a? 
What do-3sc brain in time sleep-GEN.SG 
“What does the brain do during sleep?’ 


(26) A vo Franci-i čto X interesn-ogo vy 
And in France-prep.sG what interesting-GEN.sG you 
vide-l-i? 

See-PST-PL 


“What interesting things did you see in France?’ 


Czech and Polish allow the interrogative element to stay in situ not only in so- 
called echo-questions; in situ is the unmarked position of the unknown constituent 
in the corresponding declarative sentence (Meyer 2002). 


Czech 

(27) A tvůj bratr — nes-l dárek komu? 
And your brother bring-PsT.M.sG present whom.DAT 
‘And your brother brought the present to whom?’ 


Polar or so-called yes-no-question can be marked by intonation and/or a specific 
grammatical question marker. Czech and Russian polar questions marked exclus- 
ively by intonation are pragmatically unmarked, meaning they do not trigger spe- 
cific implicatures (Meyer p.c.). 


Russian 

(28) Interesn-aja rabot-a u vas? 
Interesting-NOM.sG work-NOM.SG at yOU.GEN.PL 
literally: “You have an interesting work?’ 


Whereas Czech and Slovak do not possess question particles, Russian and Bulgar- 
ian use the clitic li, which usually appears in the second position of the clause. 
Polish czy, in contrast, has to be in the clause-initial position; compare: 
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Russian 

(29) Doziv-u li ja do ét-ogo let-a? 
live-lsc Q I to this-GEN.SG summer-GEN.SG 
“Will I live till that summer?’ 


Polish 

(30) Czy brat X jest w dom-u? 
Q brother be-3sc in house-PREP.SG 
‘Is your brother at home?’ 


Generally, Slavonic interrogative constructions do not show strict word order rules 
concerning the position of the finite verb; i.e. word order reversals like in German 
are not attested. 

The set of imperative sentence types is characterized by specific syntactic tem- 
plates in which (a) the first argument of the verb is deleted (ex. 31) or (b) a pre- 
poned analytical marker occurs (see examples 11 and 12). 


Serbian/Croatian 
(31) Ø Citaj-te ov-aj tekst! 
Read-ımp.2PpL this-Acc.sG text 
"Read this text!’ 


The Slavonic languages, like nearly all European languages, belong to the accus- 
ative marking languages; i.e. they assign subjects of transitive and intransitive 
verbs one and the same case (Nominative) and objects another (Accusative). All of 
the Slavonic languages are SVO-languages with pragmatically flexible word order, 
i.e. elements can be shifted around according to focus operations. In Slavonic we 
find empirical evidence for the gradualness of the category subject. The overt 
coding properties of prototypical subjects are (1) Nominative case, (ii) triggering of 
agreement on the predicate and (iii) default pre-predicate position. All these prop- 
erties are present in the following example: 


Czech 

(32) Iren-a Cie knih-u 
Irena-NoM.sc read-3sG book-Acc.sc 
‘Irena is reading a book.’ 


Apart from that, Slavonic subjects show some of the known ‘behaviour-and-con- 
trol properties’ like binding of reflexives, equi-NP reference in complement 
clauses, deletion in coordinate conjunction constructions, and others (for Russian 
see Testelec 2001). 


Russian 
(33) Mal’cik udari-l sebja. 
boy beat-PST.sSG.M self.acc 


“The boy hit himself.’ 
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(34) Mal'cik xoc-et  udari-t' brat-a. 
boy want-3sG hit-iNF  brother-Acc.sc 
"The boy wants to hit his brother.’ 
(35) Mal'cik udari-l brat-a i 
boy beat-PsT.sG.M brother-acc.sG and 
ubeza-l. 


run.away- PST.SG.M 
“The boy hit his brother and ran away.’ 


As mentioned above, prototypical subjects trigger agreement with the predicate in 
the categories number, person, and gender. The agreement relation is usually de- 
termined by formal characteristics of the subject NP. Some peculiarities are found 
if the formal and the semantic gender assignment are in conflict. This is e.g. the 
case with some Russian nouns denoting professions like vrac ‘doctor’, which can 
be used to refer either to men or to women. In the latter case we are dealing with a 


hybrid masculine/feminine agreement pattern: 


Russian 
(36) Zubn-ojn.s. vrac  pris-I-agm- ONG gem nam 
tooth.ADJj-M doctor come-PST-F.SG she us 


skaza-l-ag. [...] 
Say-PST-F.SG 
“The dentist came in. She told us [...]’ 


This can be explained by Corbett’s (1988: 14) ‘Agreement Hierarchy’, which pre- 
dicts that the occurrence of different agreement patterns with one and the same 
noun is restricted by the following positions: 


attributive <predicate < relative pronoun <personal pronoun 
(zubnoj) (prisla) (ona) 


For any noun "permitting alternative agreement forms, as we move rightwards 
along the hierarchy, the likelihood of agreement forms with greater semantic jus- 
tification will increase monotonically" (ibid.). 

There is good evidence to assume some Slavonic languages to have non-ca- 
nonical or quirky subjects encoded in dative case. These are NPs which do not 
trigger the agreement of the predicate but which are able to bind reflexives. 


Russian 

(37 Emu by vernu-t'-sja k sebe domoj. 
he.DAT COND return-INF-REFL to self. par home 
‘He should better go home (to himself). 


Slavonic languages 117 


There are various constructions and construction templates in the individual lan- 
guages dedicated to the grammatical encoding of the predicate and its syntactically 
privileged argument (the subject). Apart from the typical subject-predicate-object 
templates, there is a series of specific — sometimes lexically restricted — construc- 
tions lacking a canonical subject in the nominative case. Here are some of them: 


(i) ‘Pro-drop’: in some languages the unstressed pronominal subject is regularly 
omitted 


Czech 

(38) Udela-l jsem velk-ou hloupost. 
make.PTCP.M.sG AUX.1sG big-Acc.sG stupidity 
‘I have done something very stupid.’ 


(ii) Avalent verbs (metereological verbs) 


Czech 
(39) Prší. 
rain-3sG 
‘It is raining.’ 


(iii) Experiencer constructions 


Russian 

(40) Menja  tosni-t. 
me.ACC nauseate-3sG 
*[ feel sick.’ 


(iv) modal constructions with dative subject (see above) 


Russian 

(41) Nam možno rabota-t’. 
we.DAT possible work-INF 
“It is possible for us to work.’ 


(v) deagentive constructions 


Polish 

(42) Jak wam sie tu mieszka? 
how you.DAT.PL REFL here live.3sG 
“How do you feel living here?’ 


(vi) ‘anonymizing’ constructions (e.g. -no/-to construction in Polish) 


Polish 
(43) Rann-ych przewiezio-no do szpital-a. 
injured-ACC.PL bring-No to hospital-GEN.sG 


“The injured were taken to hospital.’ 
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To conclude this section, we shall briefly discuss some features of the passive in 
Slavonic (for a full description see Siewierska 1988). The following types of con- 
structions marking the promotion of a non-first argument into the subject position 
and the demotion of the first argument are attested: 


(I) Full passive: Ist Argument = prepositional Object; 2nd Argument = Subject 


Polish 

(44) Most jest budowa-n-y przez 
Bridge.NoM.sG be.3sc build-PrCP-M.sG through 
robotnik-ów. 
worker-ACC.PL 
"The bridge is being built by workers.’ 


(ID Reduced passive: Ist Argument = Ø; 2nd Argument = Subject 


Polish 

(44°) Most Jest budowa-n-y. 
Bridge.NoM.sG be.3sG build-PTCP-M.sG 
"The bridge is being built.’ 


(IIT) Recipient passive: 1st Argument = indirect object; 2nd Argument = direct ob- 
ject; 3rd argument = subject. This passive type is found in Czech but neither in 
Russian nor in Polish. 


Czech 

(45) Petr má prikdzd-n-o vráti-t se za 
Peter have.3sG order-PTCP-N.sG return-INF REFL from 
svétl-a. 


daylight-GEN.sG 
‘Peter was ordered to get back before dark.’ 


Apart from these, there are various constructions involving the demotion of the 
first argument without the promotion of the second argument (e.g. the Polish 
-no/-to construction as in example 43). With respect to promoted entities, the Slav- 
onic passive constructions reflect the universal accessibility hierarchy: direct ob- 
ject > indirect object > adjunct. In Russian and Polish, only direct objects can be 
promoted to the subject position whereas Czech, like German and English, allows 
passives with promotion of a dative indirect object. The promotion of adjuncts 
is not attested. From a morphological point of view, one can distinguish between 
analytical and synthetic passives. In the Slavonic languages we predominantly find 
analytical passives consisting of an auxiliary plus a participle (see example 2). 
Apart from that, some languages have passives formed with reflexives like the 
Russian form with the postfix -sja; 
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Russian 

(46) Most stroi-t-sja. 
bridge build-3sG-REFL 
"Ihe bridge is being built.’ 


4.4. The complex clause 


Slavonic complex clauses are usually divided into coordinate and subordinate 
complex structures. Coordinate complex sentences are marked by conjunctions 
like ‘and’, ‘but’, and ‘or’ and function similarly to their English equivalents. There 
is a whole series of subordinate clauses ranging from non-finite constructions with- 
out tense and mood marking (infinitival and converbal constructions) to fully- 
fledged subordinate sentences with regularly marked verbs. We can distinguish 
complement and adjunct subordinate clauses. It is worth mentioning that subordi- 
nate clauses show basically the same word order rules as superordinate clauses. 

Slavonic biclausal infinitive constructions can appear after control verbs allow- 
ing for subject or object control: 


Polish 

(47) Zdecydowa-l-em; sie ,_ kupi-é LCD Samsung. 
Decide-PTCP-lsG REFL buy-ınF LCD Samsung. 
‘T ve decided to buy a LCD Samsung.’ 


(48) Nie radz-e, ci; teraz _ wychodzi-é. 
Not recommend-1sG you.DAT now leave-INF 
‘I don’t recommend you to leave now.’ 


The Slavonic languages have converbs which are used in constructions where the 
subject of the converbial clause is deleted and has to be co-referential with the sub- 
ject of the matrix clause, which usually has to be in the nominative case. In some 
cases, also dative subjects are possible: 


Serbian/Croatian 
(49) Ceka-mo voz  razgovaraj-uci o nas-im 
Wait-lIPL train talk-cvB about our-PREP.PL 


plan-ov-ima. 
plan-PL-PREP.PL 
“We are waiting for the train talking about our plans.’ 


(50) Hladn-o mi je Cekaj-uci voz. 
Cold-N.sc me.DAT be-3sG wait-CvB train 
‘I am freezing while waiting for the train.’ 


Some West Slavonic languages possess the AclI-construction which however is 
lexically restricted to very few verbs. 
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Czech 

(51) Vidi-m ho prichaze-t. 
See-1sG him.ACC come-INF 
‘I see him coming.’ 


Among the subordinate clauses with a finite verb we find complement, adverbial, 
and relative clauses. Complement clauses governed by verbs of saying, demand- 
ing, perception, and others are introduced by a clause-initial complementizer with 
the original meanings ‘that’ or ‘how’. In some cases the complementizer is fused 
with the conditional marker. 


Polish 

(52) Chc-e, Zeby=s przyszed-t do dom-u. 
want.-1SG COMP.COND=2SG come-PTCP.M.sG to house-GEN.SG 
‘I want you to come home.’ 


Adverbial clauses are traditionally grouped together according to the main sem- 
antic concepts encoded in the conjunction like locative, finale, temporal, com- 
parative, conditional, etc. Apart from that, Russian linguistics postulates the exist- 
ence of complex sentences without any dedicated conjunction or other type of 
connector. These can be realized e.g. by specific combination of mood forms as in 
the following example, which gains the reading of a conditional sentence. 


Russian 

(53) Brosi-l by ee togda, nicego by 
Leave-PTCP.sG.M COND her.acc then nothing COND 
ne by-l-o. 


not be-PTCP-N.SG 
‘Had he left her at that time, nothing would have happened.’ 


Relative clauses function in a way known from other Indo-European languages: 
They are introduced by relative pronouns inflecting for case, number, and gender. 
The pronoun agrees with its head noun in gender and number and receives its case 
from the predicate of the subordinate clause. 


Russian 

(54) V 2010 god-u bud-et | sozda-n 
In 2010 year-PREP.sG FUT-3.sG creat.PTCP.NOM.SG 
avtomobil', | kotor-omu ne nuzn-o topliv-o. 


Car.M.NOM.SG REL-M.DAT.SG not necessary-N.sG fuel-N.NOM.SG 
‘In 2010 a car will be invented that does not need any fuel.’ 
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Table 8: ` Typological features 
dominant typological features exceptions 
morphological synthetic 
structure many morphonological 
alternations 
clear distinction of word classes 
gender 3 genders with animacy Bulgarian and Macedonian: 
distinctions no animacy distinction 
number singular, plural, syntactic paucal Old Church Slavonic, Sorbian and 
Slovene: also dual 
case six cases (in some languages plus Bulgarian and Macedonian: loss of 
vocative) case system 
definiteness no articles Bulgarian and Macedonian: clitic 


tense-aspect 
system 


aspect 

future tense 
mood system 
hortative 


evidentiality 
modality 
(possibility and 
necessity) 
passive 
constructions 
possessive 
construction 
word order 


non-canonical 
subjects 
non-finite verb 
forms 


aspect in all verbal forms, aspect 
grammatical, but derivational 


perfective vs imperfective 
auxiliary verb 
indicative-imperative-conditional 
analytical construction with the 
auxiliary ‘to let’, ‘to give’ or others 
verbs and particles 

analytical constructions 


with auxiliary verb or reflexive 
‘have-constructions’ 


flexible SVO 

Adj-N 

N-Rel.clause 

Prep-N 

dative and zero subjects 


infinitive, converbs, participles 


definite article and non-clitic 
indefinite article 

South Slavonic: combination of old 
tripartite and new Slavonic aspect 
system 


Serbian and Ukrainian: suffix 


Bulgarian: synthetic 


Russian: ‘be-construction’ 


Loss of infinitive in Bulgarian, 
Macedonian and partly in Serbian 
Lower Sorbian and Slovene: also 
supine 

Polish: also fully grammatical 
verbal noun 
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Notes 


1 American authors prefer the form ‘Slavic’. 
For discussions of the figures concerning the number of speakers see the corresponding 
articles in Okuka (2002). 

3 In other languages: Rehder (2006), Moldovan et al. (ed.) (2005), Okuka (2002), Dalewska- 
Greń (1997). 

4 The IPA notation is [“] vs. [7]. 
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1. Introduction 


Europe’s linguistically most exotic area is the Caucasus. In terms of linguistic den- 
sity it is the subcontinent’s New Guinea. Languages of Western, Central and Fast- 
ern Europe are less typologically diverse — and not much more numerous - than the 
languages spoken in its southeastern corner. Three “endemic” language families 
are spoken here, South Caucasian (Kartvelian), Northeast or simply East Cau- 
casian (Nakh-Daghestanian) and Northwest or simply West Caucasian (Abkhaz- 
Adyghe). The latter two are sometimes considered to form a deep-level North Cau- 
casian family (see Nikolaev and Starostin 1994), but this entity is disputed. An ear- 
lier hypothesis of genealogical relationships among these families (the assumed 
Ibero-Caucasian family) has now been largely abandoned (cf. Tuite 2008). The lin- 
guistic diversity of the area is further expanded by the presence of Turkic and Indo- 
European languages. 

The South Caucasian family includes four languages that are relatively close 
from the typological point of view. West Caucasian is somewhat less homogene- 
ous, counting five languages, which split into two branches. East Caucasian is the 
most diverse, with over 30 languages divided into eight branches. 


Endemic language families: 

South Caucasian: Georgian (about 5 million speakers), Mingrelian (Megrelian) 
and Laz (sometimes merged into one Zan language, about half a million and several 
dozens of thousands speakers, respectively) and Svan (less than 20 thousand); 

West Caucasian: Kabardian (East Circassian), Adyghe (West Circassian), 
Abkhaz, Abaza and Ubykh, ordered according to the number of speakers from 
about one million in Kabardian (although the demographic data may not be quite 
reliable) to Abaza with several dozens of thousands speakers to the now extinct 
Ubykh; 
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East Caucasian: the Nakh branch, including Chechen (about 1 million), Ingush 
(over 200 thousand) and Bats, alias Tsova-Tush (according to some accounts, a few 
hundred speakers); 

the Tsezic branch, including Tsez (alias Dido), Bezhta, Hunzib, Khwarshi, 
Hinuq (from under 10 thousand speakers for Tsez to less than one thousand for 
Khwarshi and Hinug); 

Avar (over half a million); 

the Andic branch, including Andi (about 20 thousand), Tindi, Chamalal, Bot- 
likh, Karata, Akhvakh, Godoberi, Bagvalal (each less than ten thousand speakers 
and as few as two thousand for Bagvalal); 

Lak with over 100,000 speakers; 

the Dargwa branch, including Kubachi, Itsari, Mehweb and other (Dargwa has 
long been considered to be a dialect cluster and does not have a generally accepted 
classification; speakers of Dargwa languages count between 300,000 and 
400,000); 

Khinalug (around 1,500); 

the Lezgic branch, including Lezgian (over 400,000), Tabasaran (under 
100,000), Rutul, Tsakhur and Agul (all about 20,000), Udi (about 7,000), Kryz 
(less than 4,000), Archi (about 1,500) and Budukh (500 at most). 
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Non-endemic language families: 
Turkic: Azerbaijani, Kumyk, Nogai, Karachay-Balkar 
Indo-European: Armenian (Eastern) and the /ranian languages Ossetic and Tat. 


Below, we will primarily deal with endemic families, sometimes also with Arme- 
nian (for the purposes of this survey, the difference between Eastern and Western 
Armenian may be neglected), to a lesser extent with Iranian languages, which are 
also spoken outside the Caucasus, and minimally with the Turkic languages, as 
Turkic is dealt with in a separate chapter of the present volume. 

There are only two languages with an uninterrupted written tradition that goes 
back to the early Middle Ages: Georgian and Armenian, both of which use their 
own writing systems. The earlier stages of these languages, Old Georgian and 
Classical Armenian (Grabar) were in many respects significantly different from the 
respective modern languages and will not be considered either. Until recently, only 
these two languages provided reliable data for studies of language-internal devel- 
opment (a small number of written records of Avar are available from the 14th cen- 
tury on). The situation has changed due to the discovery of a substantial manuscript 
in Caucasian Albanian, a language which had been previously known from a very 
limited number of inscriptions found on the territory of the modern northern Azer- 
baijan and which proved to be an early (around the middle of the first millennium) 
dialect of Udi (Gippert et al. 2009). 

Major East and West Caucasian languages used Arabic-based systems through- 
out the 19th century, but these were substituted by Cyrillic-based systems intro- 
duced in late thirties (after a brief period of using the Latin alphabet). Since around 
2000, Azerbaijani only uses a Latin-based alphabet, close to the Turkish one. Some 
languages have introduced writing systems in the last two decades (e.g. Udi, Rutul, 
Tsakhur), some have recent writing systems that are not used in practice (e.g. Archi 
and Khinalug), and some remain unwritten (various Tsezic and Andic languages, 
Svan). 


2. Sociolinguistic situation 


An important factor fostering language diversity in the Caucasus is the mountain- 
ous landscape, but there are sociocultural factors as well; for instance, until re- 
cently many of the communities were endogamous. Many languages, including 
Armenian and Avar, show considerable internal variation, to the extent that differ- 
ent dialects are not easily mutually intelligible. Several East Caucasian languages 
are spoken in one village only, viz. Godoberi, Archi, Budukh, Khinalug, Bats. Mi- 
gration may lead to peculiar sociolinguistic situations of co-existence of and inter- 
ference between two distinct dialects within one village (the dialect of the original 
population and that of newcomers), observed in some regions of Armenia. 
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The ecological situation varies greatly, and no complete picture can be drawn 
because of the lack of systematic sociolinguistic surveys. There seem to be very 
few moribund languages in terms of Krauss (1992), as most languages are still 
learned as first languages. Only one Caucasian language, Ubykh (West Caucasian), 
is known to have become extinct. In the second half of the 19th century, the 
Ubykhs were forced to leave Russia; those who survived settled in Turkey, and the 
last speaker of Ubykh, Tevfik Esenc, died on November 7, 1992. Highly endan- 
gered or, according to some accounts, even moribund are Budukh (Lezgic) and 
Bats (Nakh). But many other languages are at least potentially endangered because 
of politically and economically driven migrations, especially by the outflow from 
native villages to the towns, where Russian, Azerbaijani or Georgian is the domi- 
nant if not the only means of interethnic communication. In Tsezic or Andic lan- 
guage communities, a major language of the area, Avar, is taught in schools as a 
mother tongue. Similarly, only Lezgic or Azerbaijani are taught to speakers of 
some Lezgic languages and Khinalug, and only Georgian is taught to the speakers 
of Mingrelian and Svan. The combination of all these factors makes the survival 
prospects of these languages unpredictable. 

Diasporic situations are very characteristic of the Caucasian languages in the 
20th century. Just as Ubykhs, many Circassians (i.e. speakers of Adyghe and Ka- 
bardian) were expelled from Russia to settle in Turkey, Jordan, Israel and else- 
where in the Near East. Ethnic cleansings in Turkey in the early 20th century led to 
the almost complete emigration of the surviving Western Armenians and the 
growth of Armenian diasporas in Europe and the USA. Today, Russia has sizeable 
diasporas of Georgians, Armenians, Chechens, Daghestanians and Azeris, partly 
dating back to Soviet times but reinforced by recent economic and political events, 
such as the collapse of the Soviet Union and the Chechen wars. 

Multilingualism is very common in the Caucasus. In the last two centuries, 
Russian spread throughout the area as a language of urban interethnic communi- 
cation, although now its role obviously diminishes in Armenia, Georgia and Azer- 
baijan. In some areas of Daghestan, Avar and Lezgian serve as lingua franca on a 
par with Russian. As a result, speakers of some minor languages of Daghestan have 
a full command of at least three languages — the language of their village, a major 
language dominant in the area, and Russian. Sometimes, the same language is 
spoken on the two sides of a national border, which produces parallel patterns of 
bilingualism with the same mother tongue but different literacies (as is the case of 
Lezgian, Avar and Akhvakh spoken both in Russia and Azerbaijan; Ossetic spoken 
in Russia and Georgia, Udi spoken in Azerbaijan, Georgia and Russia etc.). Cau- 
casian languages provide a rich field for investigating code-switching. 
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3. Phonetics and phonology 


Caucasian languages are famous for their rich phonological inventories, extremely 
abundant in consonants. Georgian and Armenian both have a considerable array of 
consonantal elements (including a typologically rare opposition between a trilled r 
and a flapped r in Armenian), but this richness reaches an extreme in West and East 
Caucasian. Some languages, such as Ubykh and Archi, count over 70 phonologi- 
cally distinct consonants. Ejectiveness involves not only stops but also fricatives; 
it is quite widespread in each of the three endemic families but is also present in 
Ossetic and, according to some accounts, starts to develop in Armenian and 
Kumyk dialects. Other phonological features of consonants include voicing, aspir- 
ation, tenseness or gemination and labialization, which is widespread as a phono- 
logical feature with relatively free combinability (as in Avar). Consonant palatal- 
ization is rare, but attested in e.g. Ubykh, Tsakhur, Khinalug. In the latter, a 
palatalized consonant preceding a back vowel or a non-palatalized consonant 
preceding a front vowel — so to speak, the reverse of palatalization — signal a mor- 
pheme boundary (e.g. ink,-e mountain-GEN, where the subscript prime may be in- 
terpreted as *no expected palatalization occurs’) 

Typologically rare consonants include laterals [#, L, L’] (IPA #, kt, kt’), which 
are quite common in East Caucasian, and less common pharyngealized obstruents 
in e.g. Ubykh and Archi (see below). East Caucasian also features an extensive in- 
ventory of laryngeal and pharyngeal consonants, typically four [h, ?, $, h] but ad- 
ding the epiglottal stop [2] in Budukh and Agul; these are fewer in West Caucasian 
(between one and three) and only one — [h] - in most South Caucasian languages. 

Table 1 demonstrates the consonantal system of Archi; pharyngealization [1] 
combines with the uvulars; labialization [W] combines with virtually all obstruents 
except for the labials, laryngeals, and pharyngeals and thus nearly doubles the 
number of consonants. Bracketed are the obsolete lateral voiced fricative [LE], 
which is probably also secondary from the point of view of morphophonology, and 
IN. ñ], which are almost exclusively used in Arabic loanwords. 

Consonant clusters are rare in East and West Caucasian, but widespread in 
South Caucasian and Armenian. It is not uncommon for Georgian and Armenian 
to have four or more consonants following each other in orthography: cf. a one-syl- 
lable Georgian word gvbrdyvnis (‘he tears us into pieces’). However, their pho- 
netic realization (and sometimes phonological representation) may involve schwas 
which are not spelled in writing: they are shown as (9) in the following Armenian 
example: m(a)st(a)n3enakan ‘eternal’. 

Vowel inventories are poor in West Caucasian, standard in South Caucasian 
and vary from regular to rich in East Caucasian languages. West Caucasian lan- 
guages distinguish two or three vowels only. South Caucasian has ‘triangular’ sys- 
tems based on the standard five vowel inventory [i - e - a — o — u] (Svan is a more 
complicated case). The same inventory is present in many East Caucasian lan- 
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Table 1: Consonantal inventory of Archi 
(Kibrik et al. 1977a, 4: 223-248; Kibrik and Kodzasov 1990: 335) 


b d go 

p tw) cw), GO) LO) kw) oa? 

P tw) kw 

p’ Cie) 7 OW), BOW) po) ko gwo ? 
Tw, P0 qo) 
zw, zw [B] go» [S] 
sm, Sie) HCH YO DI h 
go», Ste) dw) HOO 

m,w n r,lj 


guages (Avar, Andic) and is the basis of the vowel systems in many others, though 
some south Lezgic languages may be described as having ‘rectangular systems’. 
Variations on the triangular theme include adding a middle vowel (for instance, [9] 
in Archi or [i] in some Tsezic languages or distinguishing two “a”s in Lak), intro- 
ducing a row of fronted back vowels etc. (cf. Kibrik and Kodzasov 1990). Vowel 
length (attested in various branches of East Caucasian and West Caucasian and dia- 
lects of Svan) and nasalization (attested in Nakh, Tsezic and Andic branches of 
East Caucasian) are on the whole rare and peripheral. 

An important phonetic feature is pharyngealization, reported for Ubykh and 
various branches of East Caucasian (Lezgic, Tsezic, Lak, Dargwa); see also Ni- 
chols (2000). It is essentially a prosodic feature, but sometimes it *focuses on' or 
*gravitates to' one or other phoneme and may thus be interpreted as a phonological 
property. Most often its focus falls on a vowel (which duplicates the number of 
vowel phonemes), but e.g. Ubykh and Archi have pharyngealized consonants. 

Vowel harmony is rare (outside Turkic). To various extents it is attested in Lez- 
gian, Budukh, Khinalug, Udi, Kryz (Kibrik and Kodzasov 1990), Tat, all of which 
are spoken in Azerbaijan or southern Daghestan. On the other hand, some dialects 
of Kumyk, a Turkic language supposed to have the characteristic Turkic vowel har- 
mony but strongly influenced by East Caucasian, are reported to have unstable 
vowel harmony. The status of stress in the Caucasian languages is studied rather 
unevenly. In general, fixed stress seems to be rare (found in Armenian). Other lan- 
guages have stress mobility across paradigms, e.g. in direct vs. oblique nominal 
stems (e.g. Avar). 
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4. Morphological make-up 


Caucasian morphology is close to the agglutinative prototype. This is not to say that 
there is no fusion at morpheme boundaries; internal sandhi may be quite complex, 
depending on the language. However, combining two distinct grammatical mean- 
ings in one marker is rare, while conveying the same grammatical meanings by dif- 
ferent markers depending on the class of the stem (declensions and conjugations of 
e.g. Slavic) is absent or quite infrequent (with a possible exception of Svan). Ar- 
menian, which used to have nominal declensions in its classical period, only pre- 
served them peripherally, converging with the agglutinative model of other Cau- 
casian languages; Ossetic underwent a similar development. 

Major inflectional categories are expressed by suffixation; but most Caucasian 
languages also use other kinds of affixation. Thus, various Caucasian languages 
have systems of verbal prefixes, including West and South Caucasian, Ossetic, Nakh 
languages, Lezgic languages, Dargwa and Khinalug. Often these preverbs have 
clear spatial functions or origins; in Lezgic most of them are formally close to nom- 
inal spatial suffixes. Pre-root position in the verbal complex is where most argument 
marking takes place in West Caucasian. Personal agreement markers in South Cau- 
casian and class agreement markers in East Caucasian are likewise primarily prever- 
bal. In some East Caucasian languages, class agreement prefixes may appear on 
nouns (typically, kinship terms) and, in Archi, in some case forms of personal pro- 
nouns. Adjectives, participles, and sometimes case forms of nouns have either post- 
root (e.g. Lezgic and Andic) or pre-root (e.g. Nakh, Tsezic and Dargwa) agreement. 

Abundant verbal infixation is a salient morphological property of East Cau- 
casian. It primarily involves agreement class markers, but sometimes also other 
categories, such as imperfective in Archi or prohibitive in Budukh and Rutul. As 
class infixes are often identical to class prefixes, it is quite likely that these are 
former prefixes that were trapped between the verbal root and a former preverbal 
element. In Udi, personal markers, similarly trapped, are inserted, by analogy, into 
what may never have constituted a complex verb. Interestingly, the same markers 
occur not only inside the verbal root but also attach to different constituents word- 
finally. Harris (2002), who first introduced the notion of ‘trapped morphology’, 
considers them to be clitics that also occur word-internally and hence constitute a 
typologically rare category of endoclitics: 


(1) Udi endoclitics: same marker as infix and (apparent) clitic 
be'<=ne=>5-sa, iz bavazne. 
look ;<=3sG=>look,-PRs self.:GEN ` father=3sG 
‘He looks up - (that is) his father’. 


West Caucasian languages are often described as polysynthetic. There is no noun 
incorporation in the sense of Mohawk or Chukchi, but morphology can express 
much of what is conveyed by syntax in most languages of the world: 
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(2) Kabardian: polysynthetic complex (Kumakhov 1989: 123) 
w-a-q'o-do-O-d-(O-je-z-Ee-$o-Zo-f-a-te-q' o.m-jo 
2sG.Ss/P-3PL.IO-DIRECTIVE-COMIT-3sG.10-LOC-3SG.I0-OBL.PRVB-18G.A-CAUS- 
bring-REVERSIVE-ABLE-PST-THAT.TIME-NEG-AND 
"And that time then I was not able to make him bring you back out of there with 
them'. 


Utterances like (2) may not be frequent because of their morphological complexity, 
but the important thing is that they are structurally possible. To put it differently, 
there is a kind of competition, or overlap of potentials, between syntax and mor- 
phology, at least in some domains: some functions that are more often expressed 
morphologically can be covered by syntax, and some functions rendered by syntax 
can be expressed morphologically. For instance, there are both reciprocal prefixes 
and independent reciprocal pronouns, or both an inchoative verb and an inchoative 
suffix on the verb. 

Unsurprisingly, morphology of this kind manifests some properties of syntax. 
Certain slots in the predicative complex are optional rather than obligatory, and 
affix ordering is often more variable than it is in non-polysynthetic languages. The 
fact that reflexives, reciprocals, and relativizers belong to the same category (hold 
the same slots) as personal indexes also suggests a parallel to independent pro- 
nouns in syntax. Yet these similarities with syntax are only partial. Morphology 
may apply specific constraints to the ‘grammar of verbal slots’ which, in combi- 
nation with the phonetic properties of the complex, preclude a multi-word analysis 
of these word forms. 

Derivational morphology varies across families. In South Caucasian and es- 
pecially in Armenian it is similar to European patterns, with root compounding and 
various denominal and de-adjectival suffixes. West Caucasian is traditionally also 
considered to have abundant derivation, but it is difficult to distinguish it from ‘in- 
ternalized syntax’ discussed above. In East Caucasian, derivation plays a much 
more important role in verbal than in nominal morphology: the number of non-de- 
rived verbs is often quite small, especially in Lezgic (as few as a dozen in Khina- 
lug), and other verbal meanings are derived by using primary verbs as conjugated 
heads in combination with nouns, adjectives and formally autonomous elements 
with unclear origins. Such conjugated heads - light verbs or conjugators — are most 
often abstract roots such as ‘do’, ‘become’, ‘say’, ‘give’, etc. In terms of phonetics 
and morphosyntax, the degree of univerbation varies and the conjugators may 
eventually grammaticalize into semantically non-transparent thematic affixes. 
Thus Khinalug has ten odd conjugators, most of which can not be easily traced 
back to any autonomous verb. 

Another type of univerbation is grammaticalization of autonomous verbs into 
suffixes or grammatical clitics. One example is the so-called verificational mood 
suffixes in Archi and Agul; these are very likely to originate from the verb ‘see’. 
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Interestingly, these clitics may preserve some predicative properties, for instance, 
introducing their own arguments. 


(3) Agul: verificational marker (Maisak and Merdanova 2004) 
sifta zun  rus-a-s gada kandej-Cuk'-ase 
at.first I(ERG) girl-oBL-DAT boy love-VERIF-FUT 
‘First UU find out whether the girl loves the boy’. 


The verb 'say' is strongly grammaticalized in reported speech constructions (half- 
way from clitic to suffix) in many East Caucasian languages. In Archi and Agul it 
is cliticized to the main predicate but can introduce its own arguments, the person 
whose speech is being reported (eventually resulting in having two agentive er- 
gatives in the same clause) and the addressee; cf. also the ‘verificator’ participant 
in (3). In Archi, the quotative marker may also have its own auxiliary. Sentence (4) 
is an interesting example of mismatch between phonetic and morphosyntactic 
word boundaries: the quotative clitic depends on the auxiliary, which is an auton- 
omous word. 


(4) Archi: mismatch between phonetic [] and morphosyntactic () boundaries 


tor "zart a<b>¢u [bec'u-q " -{er-Si] 
that-2 LerG <HPL>kill.erv [be.able.prv-pot -{QUOT-CVB] 
e<r>di-li} 


<2>AUX.PFV-EVID} 
‘She was saying that she will be able to kill them’. 


5. Grammatical categories 


Case inventories. Nominal case is typical of the Caucasus. The exception is West 
Caucasian, in which cases are few to none. Abkhaz and Abaza have no core cases 
at all, while Circassian languages and Ubykh only have two, nominative (or abso- 
lutive, depending on the descriptive tradition) and oblique (traditionally called er- 
gative). 

South Caucasian case inventories are typical for a medium-size ergative case 
system. The core cases are nominative for S/P (the only argument of an intransitive 
predicate or the patient of a transitive predicate), ergative for A (the agent ofa tran- 
sitive predicate) and dative, which also plays an important role in core argument 
marking (see Section 6 below). Non-core cases include genitive, instrumental, and 
a case form called “adverbial” or “transformative” (used in contexts like "become 
someone', 'turn into something"). Mingrelian and Laz additionally have two local 
cases, lative and elative, but, interestingly, no dedicated locative marker. To ex- 
press general location, Mingrelian often uses dative (in Georgian this use is more 
limited and peripheral) or postpositions with genitive or dative. Less typologically 
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common are special case forms for vocative in Georgian and benefactive in Min- 
grelian. 

Modern Eastern Armenian also has the usual medium-size case inventory in- 
cluding nominative, genitive, dative, ablative, locative, and instrumental. Dative 
and genitive are formally identical (except in personal pronouns); dative typically 
occurs with a definite article, while genitive cannot take it (a rare point where East- 
ern and Western Armenian diverge). There is no dedicated accusative marker for 
direct objects: Armenian uses dative for human patients and nominative for non- 
human ones (the treatment of non-human animates varies), which is a widely at- 
tested differential object marking pattern. In a way, Armenian has only two core 
cases: nominative or direct, covering subjects and inanimate patients, and genitive/ 
dative or oblique, covering all other functions, including possession, some com- 
mon types of essives (similarly to Mingrelian), indirect objects and animate direct 
objects. 

Interestingly, Udi, which has a very different case system but has been in a 
close contact with Armenian, shows an identical pattern, using dative for definite 
patients as opposed to nominative for indefinite ones. Such distribution is unique in 
East Caucasian, where Udi belongs, differential object marking being on the whole 
atypical of ergative languages. Differential object marking under a different guise 
of marked vs. unmarked accusative is also a common property of Turkic languages 
including Azerbaijani, another source of strong structural pressure on Udi. Similar 
patterns are found in Iranian languages, e.g. Ossetic. In Circassian, too, P may be 
marked or left unmarked, and this also correlates with its specificity. However, it is 
not obvious that we deal with differential object marking here, as the same is ob- 
served on S; and proper names and possessives rarely carry S/P or A marking, most 
probably because of their inherent specificity. 

Ossetic has a sizeable inventory of nine cases including four spatial forms — al- 
lative, ablative, locative and superessive. This is not typical of the modern Iranian 
languages, and only few case markers seem to go back to Indo-European. The pres- 
ent inventory is thus an innovation, which may be due to structural pressure from 
other Caucasian languages. 

It is the East Caucasian languages which are typologically unparalleled in this 
respect — the crowning glory of nominal declension in the Caucasus (Kibrik 2003b, 
and Daniel and Ganenkov 2009 for a brief overview). Here, nominal paradigms 
can count many dozens of forms. The paradigm splits into two sets of forms, syn- 
tactic (abstract) cases and the spatial subparadigm. The inventory of syntactic 
cases includes the usual labels — nominative (alias absolutive), ergative, genitive. 
East (and South) Caucasian are in general experiencer-prominent: unlike many 
European languages, they consistently mark most experiencers with dative. But 
some additionally introduce a special case: affective, marking experiencers with 
modal verbs, verbs of perception or emotion etc. This case is probably of spatial 
origin, attested in Andic and Tsakhur. Sometimes the system is further extended by 
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the comitative ‘with’ as well as less common cases such as forms meaning ‘about’ 
or ‘instead of’. But it is the spatial subparadigm which is a true cornucopia and the 
focus of typologists’ attention. 

Spatial forms commonly include at least two suffixes: a localization marker, 
which specifies a spatial domain with respect to a landmark, and an orien- 
tation marker, which indicates motion to, from or through this domain or absence 
of motion. Some Tsezic and Dargwa languages have a third slot, but this is less 
common; in Tsezic it is often described as approximate location, while in Dargwa 
the third slot is used for vertical or personal deixis. The inventory of orientations 
may include two to five and more items: essive, elative, lative, allative, some- 
times also translative. The inventory of localizations varies from none (more pre- 
cisely — no productive forms) in Udi or two in Khinalug to eight in Agul. Most 
common localizations include iN (often irregular), location inside a hollow ob- 
ject or in a building — ‘in the house’, ‘in the jug’; SUPER, location on the upper sur- 
face of an object — ‘on the table’; cont, location on a surface with a strong fix- 
ation on the supporting plane — ‘picture on the wall’; sus, location under the 
landmark - ‘under the table’; APUD, location in the vicinity of the landmark — ‘near 
the wall'. Cf. an example of localization x orientation combinations in Lak in 
Table 2. 


Table 2: | Some spatial forms from Lak from the total of about 30 (Zhirkov 1955: 36-37) 


qat-lu-w qat-lu-wu-n gat-lu-w-a 

house-OBL-IN house-OBL-IN-LAT house-OBL-IN-EL 

‘in the house’ ‘into the house’ ‘from inside the house’ 
qat-lu-j qat-lu-j-n qat-lu-j-a 

house-OBL-SUP house-OBL-SUPER-LAT house-OBL-SUPER-EL 

‘on the top of the house’ ‘onto the house’ ‘from the top of the house’ 
qat-lu-x, qat-lIu-xu-n gat-lu-x-a 
house-oBL-POST house-oBL-POST-LAT house-oBL-POST-EL 
*behind the house* ‘to the back side of the house’ ‘from behind the house’ 


It is not always possible to draw a clear-cut line between East Caucasian spatial 
forms and syntactic cases, as the former often develop non-spatial meanings 
and thus expand into the domain of abstract cases. Abstract meanings conveyed by 
spatial forms usually include such typologically rare categories as unintentional 
agent (do something inadvertently) or temporary recipient (lend to someone; typi- 
cally one of the latives) or may be introduced by verbs such as "be afraid’ (of what; 
typically one of the elatives) or ‘say’ (to whom; often a lative or essive of APUD). 
However, even highly abstract usages such as temporary recipient may preserve 
elements of spatial semantics (‘hand something over to someone’ rather than ‘lend 
something to someone’) 
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(5) Avar: temporary recipient expressed by a spatial form (Daniel et al. 2010: 291) 
di-q-e L'e-C'o-go, di-je šaj du-ca 
LOBL-APUD-LAT give.IPFV-NEG-SEQ LoBL-LAT why you.sg.OBL-ERG 
ha-b tex L’-ola-r-e-b? 
this-N book give-FUT-NEG-PART-N 
"Instead of giving this book to me for a while, why don't you offer it to me?' 


Crosslinguistically, spatial adpositions may be quite close to suffixes, bound to the 
host so strongly that one may question their status as autonomous words. Thus, 
Vogt (1971) considers Georgian postpositions to be “secondary case markers’. Yet 
East Caucasian localization markers are different in that they leave no doubt about 
their morphological nature; they are bound to the noun in the same way as syntactic 
cases are, being added to the oblique stem. 

The use of oblique stems is another peculiar feature of the East Caucasian lan- 
guages. One case is the unmarked nominative (absolutive). It is opposed to all 
other cases that are formed from the so-called oblique stem: these are oblique 
cases. Oblique stem formation patterns are diverse, but most commonly include 
an oblique stem suffix, sometimes in combination with further modifications of 
the stem (stress shift, vowel change, etc.). The oblique stem is sometimes identical 
to one of the core case forms, genitive in some languages, ergative in the others. 
The plural, which is derived from the singular by suffixation, may behave in 
exactly the same way: the nominative is identical to the nominal stem plus plural 
suffix, and in oblique cases it is extended by a plural oblique suffix followed by 
case markers (identical in the singular and the plural). This pattern is illustrated in 
Table 3 for Lezgian. It is present in quite a few East Caucasian languages, and 
some other patterns of nominal paradigms may be considered as its variations (see 
Kibrik and Kodzasov 1990). 


Table 3: Case formation in East Caucasian: a fragment of the paradigm of luw ‘wing’ in 


Lezgian 

nominative luw luw-är 

wing wing-PL 
oblique stem luw-ä- luw-är-i- 

wing-oBL wing-PL-OBL.PL 
ergative luw-ä luw-är-i 

wing-ERG wing-PL-ERG 
genitive luw-ä-n luw-är-i-n 

Wing-OBL-GEN Wwing-PL-OBL.PL-GEN 
super luw-ä-l luw-ar-i-l 


wing-OBL-SUPER 


Wing-PL-OBL.PL-SUPER 
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In East Caucasian the category OBLIQUE is in a Way ‘internal’ to the word form and 
not related to the syntactic function of the form: it is different from the oblique case 
in West Caucasian. In some East Caucasian languages, however, obliqueness does 
come into morphosyntax by triggering a special type of attributive agreement (see 
Section 6 below on noun phrase structure). 

Morphological expression of possessive relations. Outside West Caucasian, 
notoriously poor in case marking, most languages have a dedicated genitive case. 
In East Caucasian, Tsakhur and Rutul (Lezgic) have a general suffix to mark at- 
tributives: the same marker is used not only on possessors but also on adjectives, 
participles, attributivized adverbs etc. Possessor indexing exists in Turkic, Arme- 
nian and West Caucasian, but is absent from East and South Caucasian (but see 
(Lacroix 2009: 54) for a discussion of possessive clitics in Laz, also present in (11) 
below). In Ossetic, dative-marked possessors (as opposed to genitive possessors) 
are cross-referenced by a cliticized possessive pronoun: 


(6) Ossetic: possessive proclitic (Erschler 2009: 426) 
azemet-en je-qaZen-t-ej 
Azamat-DAT 3SG.POSS=toy-PL-ABL 
‘with Azamat's toys’ (instrumental) 


Among all Caucasian languages, the category of (in)alienability in adnominal 
possession is present in Adyghe, the Besleney dialect of Kabardian, Budukh and 
Khinalug. For West Caucasian alienable possessees, the personal possessive pre- 
fix is followed by a special marker of possession, while for inalienable ones it im- 
mediately precedes the nominal stem. Remarkably, in Budukh and Khinalug the 
contrast between alienable and inalienable possession is not conveyed by marking 
on the possessum but by marking on the possessor, which is typologically very 
rare. 

Number. South Caucasian and Armenian as well as Turkic have one marker for 
nominal plural number; the use of additional markers is exceptional. West Cau- 
casian languages are essentially similar, although e.g. Abkhaz distinguishes be- 
tween human and non-human plurals. Many East Caucasian languages, on the 
other hand, have rich inventories of plural markers, some of which are limited to 
human nouns, while others follow morphophonological distributions. The Cauca- 
sus lacks additional number categories such as dual (except some north-eastern 
dialects of Georgian, see Boeder 1998, Tuite 1998) or paucal. On the other hand, 
associative plurals *X and associates’ are quite widespread, and may be expressed 
either by regular plural markers (when attached to first names or kin terms) or by 
dedicated suffixes. An interesting fact about Armenian and Georgian number mor- 
phology is that both have ‘old’ plurals preserved from the classical period. Arme- 
nian uses its old plural in personal pronouns, Georgian uses it in some specific con- 
structions and for stylistic purposes, and both languages use the old plural in 
demonstrative pronouns and in associative plural formation. 
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A peculiar use of plural suffixes is attested in Armenian. As in many other lan- 
guages, Armenian postpositions may take possessive suffixes instead of a depend- 
ent nominal. But Armenian has first and second person possessive suffixes only in 
the singular, and to express meanings such as "on us’ the singular possessive suffix 
is combined with the nominal plural marker (e.g. vra-ner-s literally “on-PL-MY”). 
This is distantly parallel to the usage some Turkic languages make of the plural 
marker on third person possessed nouns (where noun-PL-Poss3 is ambiguous be- 
tween plural possessum and plural possessor or both). 

In all Caucasian languages, the use of numerals (and sometimes other quanti- 
fiers as well) requires the singular form of the noun. However, Armenian, just like 
Turkic languages, also allows plurality under certain semantic or pragmatic condi- 
tions, usually associated with specificity and related factors. 

The Digor dialect of Ossetic has a special form of nouns in oblique noun 
phrases containing numerals: goret-i (town-Loc) ‘in the town’, but cuppar 
goret-e-mi (four city-COUNT-Loc) ‘in the four towns’, cuppar goret-e-j (four city- 
COUNT-GEN) ‘of the four towns’, while in nominative noun phrases the genitive of 
the noun is used, as in cuppar goret-i (four city-GEN) ‘the four towns’ (Isaev 1966: 
51). 

Agreement. Caucasian Indo-European, Turkic, West and South Caucasian all 
have personal indexing on the verb (although in some West Caucasian languages 
these indexes are functionally closer to arguments than to genuine agreement) but 
no nominal classes. Abkhaz and Abaza are (possible) exceptions, because they 
have personal indexes distinguishing gender and animacy. 

East Caucasian languages are different. Not many of them possess personal 
marking on the verb (e.g. Lak, Dargwa, Udi, Tabasaran, Bats) and, when they do, 
this seems to be an innovation; other languages have various indirect ways of ex- 
pressing person-related categories on the verb (e.g. Archi, Akhvakh). Instead, their 
whole structure is permeated with nominal classes. With the exception of a few 
Lezgic languages (Agul, Lezgian, Udi) that lost their class agreement, East Cau- 
casian have three to five classes in the singular, including classes for male and fe- 
male on the one hand, and non-human classes with some semantic clusters but 
without consistent motivation on the other. The common pattern in the plural is to 
distinguish between humans and non-humans only. The most complex class sys- 
tems are attested in Nakh languages, with sophisticated mapping between singular 
and plural classes. 

The striking feature of class agreement in East Caucasian is its pervasiveness. 
In some languages (e.g., Andic languages), certain types of possessors agree in 
class with their heads. Under clausal agreement, the nominative (S/P) may control 
agreement on verbs, adverbs, some case forms of nouns and pronouns, particles, as 
in (7). In several Andic languages, a converb may agree with an NP which is not an 
argument of the converb but an argument of the main predicate (8). Finally, an im- 
portant syntactic phenomenon in East Caucasian is so-called “backward control", 
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where the overt controller belongs to the subordinate clause (Polinsky and Pots- 
dam 2001); cf. (9). 


(7) Archi: pervasive agreement (Kibrik et al. 1977b: 84) 
zari  Oo«»ka-qi ja-r, bo-li, d-ez d-og’i 
LERG «2>take.away.PFV-POT this-2(NOM) say.PFV-EVID 2-I.DAT 2-give.IMP 
‘I will marry her, he said, give (her) to me’. 


(8) Akhvakh: external agreement of the converb (Creissels, in press) 


mollarasadi w-ur’-I $wela-La m-äne 
Molla Rasadi m-die-HPL[cvB] graveyard-IN.LAT HPL-g0.PROG 
b-ak’-I goli. 


HPL-be-HPL(CVB) AUX.HPL 
“Molla Rasadi died, and they were going to the graveyard’. 


(9) Ingush: backward control (Nichols 2008: 61) 
masa Co b-aq-a b-ieza az 
how.many  hair(6) 6-cut-ınr 6-should I.ERG 
‘How many hairs should I cut?’ 


Note also that agreement patterns are formally quite complex and may involve suf- 
fixing, characteristic of attributive agreement, and prefixing and infixing, typical of 
verbal agreement. One word form may have several agreement slots. At the same 
time, in Khinalug all agreement markers except feminine, while present, are real- 
ized as zeros in most morphophonological contexts. 

Tense-Aspect-Mood. Aspectual morphology is rather diverse. East Caucasian 
often has verbal paradigms based on several primary stems; the two most import- 
ant stems are perfective and imperfective, and the imperfective stem is sometimes 
derived from the perfective one. Specific tense, aspect and modal meanings choose 
one of the two stems, according to their functional semantic preferences. This two- 
stem architecture of the verbal paradigm is absent from e.g. Avar and is completely 
unknown in other endemic families (there are, however, quite a few verbs that dis- 
tinguish perfective and imperfective stems in Armenian, cf. Plungian 2006). South 
Caucasian and Ossetic use verbal prefixes for perfectivization, in a way partly 
similar to Slavic. Armenian uses analytical constructions with ‘be’ and perfective 
vs. imperfective converbs; the verbal paradigm is largely analytic, except for aor- 
ist, imperative, subjunctive and positive conditional. In the western Caucasus, 
Abkhaz and Abaza have rich but on the whole common inventories of tense, aspect 
and mood markers bound to specific slots. In Adyghe and Kabardian, however, 
these markers may combine in sequences of up to three or more markers to form 
complex TAM configurations: 
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(10) Adyghe: stacking of tense-aspect markers 
wone-m she-r tjo-tha-Be-Be-n faje 
house-oBL head-NOM  Loc-put-PsT-PST-POT must 
“Presumably, the house had been covered by a roof’. 
(e.g. there are traces of a roof on a now-roofless house) 


An important distinction found in various Caucasian languages, partly correlated 
with aspect, is the opposition between stative and dynamic verbs, attested in e.g. 
West Caucasian and Lezgic; experiential statives also play an important role in the 
morphosyntactic classification of the predicates in South Caucasian. Lezgic 
statives (‘want’, ‘know’, ‘be ill’) are deficient compared to regular verbs; they may 
lack the two-stem distinction, infinitives etc. 

The Caucasus is fairly rich in volitional moods. Some East Caucasian lan- 
guages have mild imperatives as a morphological category and Nakh also has fu- 
ture imperatives (Johanna Nichols, p.c.). Imperatives may show special morpho- 
logical behavior: in Archi, imperatives of intransitive verbs optionally express the 
number of addressees; in Akhvakh, the optative shows a special pattern involving 
unusual agreement (Denis Creissels, p.c.); and in some Andic languages there is a 
morphological difference between imperatives of transitive and intransitive verbs, 
not expressed anywhere elsewhere in morphology (Kibrik 2003a). East Caucasian 
has dedicated prohibitive (negative imperative) morphology. West Caucasian and 
Turkic also express negative imperative morphologically, but they use a marker of 
negation that is used in other morphosyntactic contexts, while Armenian uses a 
dedicated marker, but it is a particle; both patterns are cross-linguistically much 
more widespread than the one attested in East Caucasian. Several languages use 
dedicated marking for apprehensives (as in ‘do not play with the dog lest it bites 
you’). West and East Caucasian are known to have morphological performative op- 
tatives (forms used in curses and blessings), and at least Abkhaz and Nakh addi- 
tionally introduce desiderative optatives (forms used to express speaker's wishes 
and dreams). Both categories are cross-linguistically rare — and yet both are also 
attested in Caucasian Turkic languages Karachay-Balkar and Kumyk. Notably, op- 
tatives are on the whole atypical of Turkic languages - the category traditionally 
labeled optative in Turkic linguistics is closer to the imperative. For a detailed dis- 
cussion of optatives in the Caucasus see (Dobrushina 2009). 

Inferential evidentials (‘apparently, a situation P took place’) are common in 
the Caucasus (cf. Chirikba 2008) but are rarely if ever conveyed by dedicated 
markers. The inferential meaning of a form is determined by the context. It may be 
semantic context, as with Armenian or Georgian or some East Caucasian perfects, 
or even a special syntactic pattern, as the evidential reading of the past converb 
when used as independent predicate in Archi or some Tsezic languages. Hearsay 
evidentials (‘they say that P^) often grammaticalize from speech verbs contracted 
into clitics in East Caucasian; other evidentials seem to be peripheral and weakly 
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grammaticalized or not present at all. For details, see Johanson and Utas (2000), 
Tatevosov (2001), Chirikba (2003). 

Valency increase and decrease. East Caucasian languages are poor in valency 
changing processes. The only valency-related process regularly present in the 
family is causative formation, which formally varies from morphological marking 
in e.g. Tsezic and Andic to transparent “do’ (or, rarely, ‘give’) periphrasis in some 
Lezgic languages and show many cases that are intermediate between biclausal 
and monoclausal constructions. South Caucasian, West Caucasian and Armenian 
all have robust morphological causatives. Valency decrease is almost absent from 
East and West Caucasian, but prospers in Armenian which has a mediopassive 
(middle) suffix covering passive, reflexive, reciprocal and other functions, similar 
to reflexive verbs in Slavic. 

South Caucasian, on the other hand, has a most sophisticated system of valency 
derivation — the so-called versions, expressed by vowels immediately preceding 
the verbal root. With a typical Georgian version, a secondary semantic role — bene- 
ficiary in (11), external possessor in (12) or a spatial function in (15) below - re- 
ceives indirect object marking and becomes eligible to control the agreement on 
the verb (which is otherwise impossible). In a way, versions promote non-core ar- 
guments into core argument positions and are thus typologically close to appli- 
catives (Lacroix 2009: 484 ff.); Lacroix further argues that promoted arguments are 
ambiguous between indirect (dative marking) and direct (agreement control) ob- 
jects. 


(11) Laz: “objective version” — recipient promotion (Lacroix 2009: 496) 
aždaa m-i-il-a-t-na bozo-Ckimi me-k-€-ae-a 
dragon 1-AppL-kill-opt-1/2.PL-conp girl-1PL PRVB-2-give-1/2.sG-QUOT 
‘If you kill a dragon for our sake, I will give you our daughter’. 


A version marker may also convey various mediopassive meanings or, more gen- 
erally, indicate that the subject is (implicitly) present elsewhere in the predication — 
in the following example, the agent is also the possessor of the waistcoat: 


(12) Laz: "subjective version" (Lacroix 2009: 464) 
tabak’a do  mendil jelegi-š ceb-epe-s 
snuff.box and handkerchief waistcoat-GEN pocket-PL-DAT 
dol-i-du-mer-nan 
PRVB-MED-put-TH-PRS3PL 
“They are putting their snuff-box and their handkerchief in their (own) pockets’. 
(lit. ‘they are putting snuff-box and handkerchief into waistcoat pockets to 
themselves’ ) 


West Caucasian languages are also described as languages rich in applicatives, as 
they readily introduce benefactive, malefactive, comitative, locative and other 
slots into the verbal complex: cf. (2). However, in polysynthetic languages like 
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West Caucasian, these slots are not — or not straightforwardly — agreement slots 
(see Section 2 for discussion of West Caucasian morphology). Thus, the same slot 
may occur several times in the word form: 


(13) Adyghe: two benefactive slots in one word form (Lander and Letuchiy 2010) 
s-a-fo-@-f-e-txe 
1sG.s/P-3PL.IO-BEN-3SG.IO-BEN-DYN-Write 
‘I write to them for his benefit'. 


To argue that these two benefactive slots are agreement slots we need to posit two 
different benefactive agreement positions. More natural seems a parallel with two 
dative NPs which, in many languages, may be present in the same clause to convey 
similarly different meanings. In other words, at the morphological level the two 
benefactives behave more like adjuncts at the level of syntax: they are optional and 
may be repeated. The same issue holds for valency decrease patterns, e.g., reflex- 
ive forms, which may be analyzed as a verbal complex containing a pronominal- 
like reflexive element rather than verbal forms containing a morphological marker 
of reflexivization. 

Pronouns. In East, West and South Caucasian, all languages with a domi- 
nant ergative construction, personal pronouns often do not distinguish ergative 
and nominative forms. This does not necessarily involve all persons and numbers, 
but in e.g. Georgian first and second person pronouns are indeclinable, and when 
case marking is required, possessive pronouns are added to the declined noun 
‘head’: 


(14) Georgian: substitute declension of personal pronouns (Boeder 2005: 54) 
sabralo éven-o tav-o, sac’q’al-o sakartvelo-v 
pitiful our-voc head-voc, pitiable-voc Georgia-voc 
*O pitiful us, poor Georgia!’ 


Inclusives are attested in various East Caucasian languages (although not present 
in all of them - they were lost in e.g. Tsezic, Lak and some Lezgic and Dargwa 
languages). Archi has a peculiar inclusive pattern: this language probably first lost 
the distinction, integrating formerly inclusive and exclusive forms into one ‘neu- 
tral' first person plural paradigm, but then reintroduced the distinction by adding a 
reflexive / emphatic particle to specify inclusive reference. The only inclusive re- 
ported outside East Caucasian is attested in Svan (personal agreement and possess- 
ive pronoun); inclusive in Abkhaz (Hewitt 1979: 157) is contested. 

Caucasian languages tend to lack third person pronouns: in Armenian, Ossetic 
and South Caucasian (and probably also in Nakh) substantive third person pro- 
nouns are historically derived from — though not identical to — attributive demon- 
stratives. Many West and East Caucasian languages have demonstratives that func- 
tion like personal pronouns. The system of deictic reference may be rather 
complex: from Armenian and Georgian and West Caucasian, which have a tripar- 
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tite deixis (‘this one here’ ~ ‘that one there’ ~ “that one yonder’), to East Caucasian 
languages, which are obsessed with space and introduce further spatial distinctions 
such as (‘this below’ ~ ‘this above’) and other spatial nuances (see Schulze 2003). 
Most East Caucasian languages also have a special pronoun with a very wide range 
of usages covering logophoric, reflexive, resumptive, emphatic and other func- 
tions. With this variety of functions, this pronoun seems to be a special linguistic 
means of reinforced reference tracking, whose similarity to reflexive pronouns in 
European languages or logophoric pronouns in African languages is only partial. 
For a discussion see Testelets and Toldova (1998) and Lyutikova (2000); cf. also 
Donabedian (2007) for a similar phenomenon in Armenian. 


6. Syntax: some highlights 


In a brief survey of such a diverse linguistic area as the Caucasus, giving a full ac- 
count of syntactic variation is even less feasible than a full account for morphology 
or phonology. This section is a balance between central issues, on the one hand, 
and salient features, on the other, including noun phrase structure, alignment, fi- 
niteness, focus marking, relativization, clause combining and reported speech con- 
structions. 

Noun phrase syntax. Although word order in Caucasian languages is quite 
flexible, the basic word order is invariably SOV (cf. e.g. Testelec 1998a and 1998b 
for East and South Caucasian). Consistently with SOV typology, NPs are left- 
branching, except for marked patterns, e.g. constructions involving heavy depend- 
ents. Typical for SOV is also the use of postpositions rather than prepositions: Ar- 
menian has few prepositions (preserved from Classical Armenian), and so does 
Ossetic, but West and East Caucasian do not have any. In noun phrases, West Cau- 
casian and Turkic languages use head marking, while East and South Caucasian 
lack possessive pronouns and use genitive marking on the dependent (Armenian 
uses genitive pronoun or possessive suffix for the first and second person singular 
possessor and only genitive marking in other cases). 

Within the noun phrase, agreement between the head and the dependent varies 
from poor to none. In South Caucasian, there is no NP-internal agreement except 
partial agreement in case on adjectives. East Caucasian languages typically have 
class agreement, which also includes number (class / number agreement is syn- 
cretic), although some have special plural agreement suffixes for adjectives (e.g. 
Dargwa). Nominal possessors may agree with their heads in class e.g. in Andic lan- 
guages. Less common patterns include cases of double case marking in South Cau- 
casian (15), where a genitive dependent in a non-canonical position copies the case 
suffix of the head noun (Suffixaufnahme), and agreement in obliqueness in Tsezic 
languages, where there are two genitive markers, one used when the head noun is 
in the nominative and the other when it is in any of the oblique cases. Similar 
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agreement in obliqueness is also attested in e.g. Khinalug and Nakh, but here it 
applies only to demonstratives or adjectives. 


(15) Mingrelian: Suffixaufnahme (Boeder 1995: 194) 
gi-Ø-a-ntxū-d-esə k'ata-so te xenc’ape-Si 
PRVB-3o-Loc-fall-iPF-3s.PL.PST people-DAT this king-GEN 
yomala-&i-so 
dominion-GEN-DAT 
"They attacked the people of this king's dominion'. 


Another rare pattern is incorporation in West Caucasian NPs: 


(16) Adyghe: noun - adjective complex (Arkadiev et al. 2009: 50) 
cC’ of-g’asxe-r 
man-brave-NoM 
*a brave man' 


When used headlessly, all attributes take nominal inflections, which directly fol- 
low the nominal stem. With genitive attributes, this leads to having two case 
markers in one word form and, in some languages, to clearly group-marking pat- 
terns, where the nominalized attribute attaches its own dependent ‘prior’ to nomi- 
nalization; (in (17) the brackets show NP boundaries): 


(17) Bagvalal: group marking (Kibrik et al. 2001: 212) 


a-b hob [in-H-1-da waS$a]-Su-b-H-ba 
this-N tomb(noM) self-0BL-GEN-PTCL SON-OBL.M-N.GEN-OBL-AFF 
b-ati-li-b-o, o-w-la raq’wa-la W-ü-W-O 


N-similar-VBLZ-N-CVB _ that-M(NoM)-and  heart-sUP.LAT | M-come-M-CVB 
“As this tomb was similar to the one of my own son, I remembered him ...' 


This type of interaction between syntax and morphology is very different from the 
competition described above in Section 3 for the West Caucasian; rather, it is a 
mismatch between phrasal constituents and morphological word boundaries. 

Of all Caucasian languages only Armenian, Abaza and Abkhaz possess gram- 
maticalized definite articles (the Old Georgian definite article was lost in the mod- 
ern language). The East Armenian definite article, which most probably comes 
from the third person possessive marker, is used in a wider range of contexts than 
European articles — it is obligatory on inherently specific nouns such as proper 
names. There is also a peculiar interaction between the definite article and the 
category of case: it cannot be used on non-core cases or the genitive, but is almost 
obligatory on the dative. It is thus only in the nominative that the marker has the 
value of a genuine definite article. Some East Caucasian languages use demon- 
stratives more or less article-wise, but these means are much less obligatory and 
less bound to the head noun than in Armenian or Abkhaz. In many languages 
the numeral *one' is used as an introductory indefinite article, but it does not seem 
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to grammaticalize to the extent of European indefinite articles to include non- 
specificity. 

Ergativity. Notably, all three endemic families are consistently ergative 
(though see below on South Caucasian), while Indo-European and Turkic lan- 
guages spoken in the Caucasus show no traces of ergative alignment at all. In East 
Caucasian, case marking follows the S/P vs. A distinction (but see Section 5 on 
neutralizing the distinction in pronouns and on differential object marking in Udi), 
and all agreement slots are controlled by nominative arguments. West Caucasian 
languages also use cases — when they have any — distinguishing between direct for 
S/P and oblique for all other functions including A; and in all West Caucasian lan- 
guages there are different slots for indexing S/P and A. Now, the real challenge of 
ergative morphosyntax comes from sophisticated systems of case marking in 
South Caucasian, where the same marker is used to indicate different roles depend- 
ing on the morphosemantic category of the predicate and its tense. Cf. Table 3, 
which represents the distribution in a rather simplified way (without taking into ac- 
count classes of predicates): 


Table 3: Tense and case function in Georgian 


Present series Aorist series Perfect series 
P -S DAT 
-i / Ø NOM 
S 
-i / Ø NOM 
A -ma | -m / -man ERG -S DAT 


Other South Caucasian languages show variations on this theme. In some Laz var- 
ieties, the ergative extended to A in the present, and in some others case marking of 
the core arguments was lost completely and the dative is used exclusively as a lo- 
cative marker. The Mingrelian “ergative” has extended to S in the aorist. This 
change affected the whole system: the aorist switched from the ergative to accus- 
ative construction, and -i (Ø) assumed the accusative function, opposite to what it 
conveys in the present. For a discussion see, for example, Harris (1985). 

Syntactic ergativity and the distribution of subject properties between S and O 
is another rather complicated issue and subject to variation. In Modern Georgian, 
conventional tests such as coordinate deletion and reflexivization demonstrate ac- 
cusative syntactic behavior. In West Caucasian, too, reflexivization most often fol- 
lows the accusative pattern; but other tests are more problematic. And especially 
problematic is the notion of subject in East Caucasian. Some studies suggest the 
complete absence of a syntactic subject or, if there is one, the presence of a “weak” 
subject (Kibrik 1997). Compare the following reflexive constructions in Lak, 
where, according to Kibrik (20032), possessive reflexivization is controlled by the 
linear order rather than the role of the NP: 
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(18) Lak: role-independent possessive reflexivization (Kibrik 2003a: 477) 
a. gWani-l ci-l-b-a su b-awtun-di 
she-ERG self-GEN-3-PTCL sister(NOM) 3-beat-3sc 
*She beat up her own sister'. 
b. su ci-l-Ø-a buta-l b-awtun-di 
sister(NOM) Self-GEN-1-pTcL father-ERG 3-beat-3sc 
"Ihe sister was beaten by her father’. 
(lit. ‘her own = sister's own father beat the sister’) 


The diversity of accounts of syntactic alignment comes from the fact that the situ- 
ation varies from language to language, from test to test (Kibrik 2003a) and pos- 
sibly from speaker to speaker. 

Finiteness. In South Caucasian finite forms bear personal agreement markers, 
while e.g. participles and action nominals do not; this is reminiscent of European pat- 
terns. In West Caucasian, finiteness is especially salient in Abkhaz and Abaza, where 
verbal forms clearly split into a finite and a non-finite sub-paradigm, while in Kabar- 
dian and especially in Adyghe the distinction is less pronounced. Formal differences 
are less obvious in East Caucasian: participles, action nominals, infinitives and 
converbs include the same class markers as finite forms do, although participles can 
also take class suffixes (typical of adjectives), and action nominals may attach case 
markers. In some East Caucasian languages participles (sometimes also other nonfi- 
nite forms) may in some contexts appear as predicates in independent clauses (see 
e.g. Kalinina and Sumbatova 2007). In Khinalug, the two main finite forms are most 
often formally identical to participles, which may be seen as the endpoint of the par- 
ticiples’ expansion into the finite domain. In some languages, general converbs can 
be used finitely (e.g. for unwitnessed past in Tsezic and Archi). This use is likely to 
originate from the auxiliary construction by omitting the finite auxiliary predicate. 

Focus constructions. By default, the focused element is placed in the prever- 
bal position. South Caucasian also makes a wide use of clefts (Boeder 2005). But 
many other languages of the Caucasus feature peculiar types of focus marking 
(Testelec 1998c). In some languages (Lak, Udi, Dargwa), the focused constituent 
may be followed by a personal agreement clitic (cf. Kazenin 2002, Harris 2002). 
The finite predicate in such clauses sometimes changes to a participle (Kalinina 
and Sumbatova 2007). In some other East Caucasian languages, e.g. Bagvalal and 
Agul, and in Armenian (see Comrie 1984), the focus is marked not by a per- 
sonal marker but by the position of the copula, which must immediately follow the 
focused item. 


(19) Itsari Dargwa: focus marking by personal clitic (Sumbatova and Mutalov 
2003: 141) 
ih sibij c’il-di-q’al w-a'r-un-ci u 
that Gibi later-2-but M-go.PFV-PRET-PTCP you.sg(NOM) 
‘But it was later on that you went to that Gibi!” 
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Where East Caucasian employs special material, West Caucasian may rely on 
syntax only. To focus an argument, a syntactic reorganization is required, which 
transforms the original structure into a clause structurally similar to equative predi- 
cations. 


(20) Focus in Adyghe 

a. Coyo-r  wone-m  2o-pashe je-t 
tree-NOM house-oBL 3sG.PR-front Loc-stand 
"The tree is in front of the house’. 

b. čəyə-r  zə-paşhe je-to-r wone-r a-ro 
tree-NOM REL.PR-front Loc-stand-Nom house-NoM that-PRED 
‘It is in front of the house that the tree is’. 
(lit. ‘in front of what the tree stands is the house’) 


Relativization. Georgian and Armenian have European-style relative clauses 
headed by interrogative-based relative pronouns; while Ossetic follows cor- 
relative, and Tat participial pattern of relativization. In Laz relative clauses, the 
predicate immediately precedes the head noun, as in the participial strategy com- 
mon in the Caucasus, but is finite, which makes it similar to European relative 
clauses. However, there is no relative pronoun, only a subordinating clitic hosted 
by one of the constituents inside the relative clause. 

Most East Caucasian languages lack dedicated relative pronouns (Udi and Bats 
are rare exceptions, probably contact-induced) and use a participial strategy. The 
most salient feature of relative clauses in East Caucasian is the extreme flexibility 
of relativization, which may be based on virtually any argument and adjunct as 
well as their possessors (see also Comrie and Polinsky 1999) and, in some rare 
cases, may not even be traceable to any syntactic position at all. In the following 
example, the noun ‘meat’ heads a relative clause it never belonged to, denoting an 
item which is “possessed by” a participant in the relative clause (‘cattle’): 


(21) Archi: ‘untraceable’ relativization (Kibrik et al. 1977b: 107) 
[hawan bu-r'u-tu-t ar lun Lo-t'u 
cattle(NoM,3) 3-slaughter.prv-ATR-4 meat(NoM,4) 4.give.PFV-NEG 
"They wouldn't give (us) the meat of the slaughtered sheep'. 


Unlike the relative pronoun strategy, the participial strategy does not indicate the 
role of the target of relativization within the relative clause. Some East Caucasian 
languages may use reflexive pronouns in resumptive function. These are mostly 
employed when the role of the target cannot be easily established: 


(22) Lezgian: reflexive pronoun used as a resumptive (Haspelmath 1993: 342) 
čun witi-kaj raxa-zwa-j kas 
we(NoM) self-suB.EL talk-iPFv-PTCP man 
*the man we're talking about 
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West Caucasian languages can be described as having relative pronouns, but these 
are represented by prefixes. They also feature what is called internally-headed 
relative clauses (23), a phenomenon widespread in the world but quite rare in Eu- 
rope. Typologically unique is multiple relativization, with several coreferent 
relative prefixes within the same clause (see Lander 2009 on Adyghe); cf. (24). 


(23) Abkhaz: internally-headed relative clause (Andrej Kibrik 1992: 148) 


[s-ab a-S$"no jo-jo-rgola-z] 
1sc.PR-father pEF-house ` REL.S/P-3sG.M.A-build-NFIN.PST 
(Ó-aajgvara s-golo-w-p' 


3sc.N.oBL-near  lsG.Ss/P-stand-PRES-FIN 
‘I stand near the house built by my father’. 


(24) Abkhaz: multiple relativization (Hewitt 1979: 37) 
z-an jo-lo-ca-m-noq’ Va(-z) a-C k'von 
REL.PR-mother REL.S/P-3SG.F.IO-COM-NEG-gO-NFIN.PST DEF-boy 
*the boy who did not go for a walk with his (lit. whose) mother' 


Clause combining. In European languages complex clauses are either coordinate 
or subordinate. South Caucasian and Armenian are roughly compliant with this 
model. In East and West Caucasian, however, the situation is different. Conjunc- 
tions, both coordinating and subordinating, are few. For mere sequencing of 
events, chains of general converbs are built, which combine properties of subordi- 
nation with properties of coordination (Kazenin and Testelets 2004). 


(25) Akhvakh: general converb in a coordinative construction (Denis Creissels, in 


press) 

molla-s"-e taxi-gunu igora b-ex-e jhora-ge ger-i 
Molla-oBL.M-ERG pocket-EL bread w-take-cvB.N lake-Ess inside-Ess 
tük-ini 

dip-EviD 


‘Molla took some bread from his pocket and dipped it into the lake’. 


To specify the adverbial relation linking the dependent predicate to the main clause 
(‘because of the main event’, ‘despite the main event’, ‘simultaneously with the 
main event’, etc.) special converbs are used. The inventory of these special con- 
verbs may count up to some dozen forms; sometimes, they contain nominal case 
markers also used in adverbs. Other dependent forms corresponding to European 
complementation include case forms of action nominals (e.g. for purposive and 
causal relations), infinitives (used under e.g. phasal and modal matrix verbs), 
sometimes also participles and general converbs. 

Semidirect speech. In general, reported speech in East Caucasian is very dif- 
ferent from what we find in (the prescriptive grammars of) European languages. In 
Standard Average European, reporting is either consistently direct (intended to re- 


The Caucasianlanguages 149 


produce the actual speech act) or consistently indirect (using special marking of re- 
porting, such as the use of reflexive pronouns in logophoric function or shifting the 
personal, temporal and spatial deictic perspective to the actual speaker's viewpoint). 
In East Caucasian, the same clause may combine direct and indirect elements. 
In the following Archi example, reported speech combines the use of indirect per- 
sonal deixis (agreement) with direct illocution (prohibitive) within the same word. 
Direct reporting meets indirect reporting within the same word form: 


(26) Archi: ‘semidirect word form’ (from the texts collected in Dirr 1908) 


OS iq-n-a jemib ummu 
one day-oBL-IN this.PL father.oBL(ERG) 
b-o«»q*-ir-gi bo-tu-t ` raq-m-a-8i o<b>gqia-li. 


HPL-<IPFV>gO-IPFV-PROH say-ATR-4 side-OBL-IN-ALL <HPL>gO.PFV-EVID 
“One day they went in the direction that the father said not to follow’. 
(lit. ‘to the father “do not go”-said side’) 


The actual prohibitive form being reported is org’irgi ‘you (plural) do not go", 
with a special 1st/2nd person plural zero agreement. When reported, the agreement 
switches to regular HPL borg‘irgi ‘they do not go’, because the actual addressee 
(the listener) is no longer identical with the addressee of the form. However, the 
narrator uses the form of the prohibitive mood, which may only be addressed to the 
second person. 

Patterns of semidirect speech reporting are an important and understudied phe- 
nomenon in East Caucasian. In some languages of the family, however, it may play 
a less important role, and seems to be closer to European in West and South Cau- 
casian (but see Hewitt and Crisp 1986) or Armenian. 


7. Perspectives 


While the tradition of grammatical description of literary Armenian and Georgian 
goes back to the early Middle Ages, the systematic linguistic study of East and 
West Caucasian languages started only in late 19th (Petr Uslar) and early 20th cen- 
tury (Adolf Dirr) and has been continued mainly by the Soviet scholars from 
Makhachkala, Tbilisi, Nalchik, Maikop and Baku. At that time, not many linguists 
from outside the Caucasus were interested in the area; an important player was the 
Georgian school. From the late 60s on, the Moscow school with its team fieldwork 
method started to play an important role in language documentation, producing a 
series of detailed up-to-date morphological and syntactic descriptions of Cau- 
casian (primarily East Caucasian) languages. Since the 70s and especially during 
the 90s, the interest in minority languages of the Caucasus has grown significantly 
in the West, and triggered what might be called a frontal attack on linguistic mi- 
norities of the Caucasus. Today, Caucasian studies form an impressive network of 
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groups and individuals in the Caucasus, Moscow, Europe and the USA. New ap- 
proaches with their shift from grammatical description to language documentation 
are coming to the Caucasus. Linguistic corpora are being compiled, both for major 
languages such as Eastern Armenian (www.eanc.net) and smaller languages in- 
cluding Archi, Khinalug, Agul, Udi, Dargwa, Bezhta, Ossetic. These corpora 
sometimes include digital archives of audio and even video recordings which are 
accessible via internet (cf. www.ossetic-studies.org and www.philol.msu.ru/ 
~languedoc/eng). 

By now, no language in the Caucasus is left undescribed. This, and the absence 
of an immediate threat of language death, makes this area look quite well-off in 
terms of description, as compared to Central Africa or Papua, where many lan- 
guages lack even primary accounts, or North America and Siberia, where lan- 
guages are on the verge of extinction. However, descriptions of Caucasian lan- 
guages, especially those produced in the early to mid-20th century, need revision 
from the perspective of the typological knowledge accumulated over the last few 
decades. The problem is especially acute in syntax, where older descriptions often 
followed the model of Russian grammar, which is drastically different from the 
syntactic patterns attested in the Caucasus. Syntactic research is thus a typological 
priority for Caucasian languages. 

Another important direction of research is the dialectal variation of major lan- 
guages. While modern research has focused on minor and previously undescribed 
languages, the dialect documentation of Armenian, Azerbaijani, Georgian has 
moved off-focus (partly for economic reasons) before the new typology-oriented 
approaches to language description have emerged. The documentation of Avar or 
Dargwa dialects never did well because of the sociopolitical pressure of written 
standards. Corpus building is a necessary prerequisite for both dialect documen- 
tation (because it shows differences from the literary language in a natural, spon- 
taneous environment) and syntactic research in minority languages. 

Even more than descriptive and comparative studies, the Caucasian languages 
lack approaches that traditionally develop after the first descriptive effort has been 
completed. Thus the field of psycholinguistics (color terms, language acquisition, 
etc.) is almost vacant. Given the multilingualism of some areas, the Caucasus is 
looking forward towards fundamental sociolinguistic studies. The issue of lan- 
guage contact in the Caucasus clearly deserves more attention, even more so be- 
cause Caucasian languages constitute one of the first Sprachbunds posited in the 
history of linguistics (Nikolai Trubetzkoy’s hypothesis, see Chirikba (2008) for a 
recent discussion); there are several non-endemic languages like Eastern Arme- 
nian, Tat and Ossetic (Indo-European), Azerbaijani and Karachay-Balkar (Turkic) 
that have been in contact with endemic languages for many centuries. 

A short overview chapter cannot represent the modern bibliography of Cau- 
casian linguistics — thousands of volumes and articles — in anything but a partial 
way. Here we provide the reader with references only to those general surveys 
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where further indications to books and papers on specific languages, language 
families and linguistic topics related to the Caucasus can be found. Recent publi- 
cations include first of all Hewitt (2004) and van den Berg (2005); earlier introduc- 
tions to the Caucasian field can be represented by Klimov ([1986] 1994). For an 
overview of East Caucasian grammars see Schulze (2005). Detailed reference de- 
scriptions of non-endemic languages in English include Abaev (1964) for Ossetic 
and Dum Tragut (2009) for Eastern Armenian. Linguistic diversity in the Caucasus 
is discussed in Catford (1977) and later in Comrie (2008); main sources on the 
Caucasian ethnolinguistic geography are Koryakov (2002) and Tsutsiev (2006). 
Bokarev and Lomtatidze (1965), Greppin (1991-1994), and Alekseev et al. (1999) 
are collections of sketches of Caucasian languages. 
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1. Introduction 


The Turkic language family comprises a group of genetically related languages 
spoken over a vast geographic area, from North-Siberia to Iran, and from North- 
West Europe to Northern China. The question of a genetic relatedness of the Turkic 
languages with Mongolic, Tungusic, and even Korean and Japanese, forming the 
so-called Altaic language family, is still debated (for a recent introduction to the 
discussion see Robbeets 2005: 18-29). 

The oldest primary sources for the Turkic languages are the so-called Köktürk 
runiform inscriptions of Mongolia from the late 7th century onwards. Old Turkic 
manuscripts from the 9th century onwards written in various scripts (Uyghur, 
Soghdian, Tibetan, etc.), have been found in North-Eastern China. As a result of 
the conversion to Islam of Turkic groups from the 10th century onwards, the im- 
pact of Arabic and Persian grew significantly, particularly among Turkic groups of 
the Central Asian region (Kipchak, Oghuz, Uzbek). In the 11th century, this cul- 
tural influence led to the introduction of the Arabic alphabet for writing Turkic. 
The varieties documented in various texts up to that time can, despite a certain lin- 
guistic variation they display, all be subsumed under the term Old Turkic (see Erdal 
2004: 6-22). 

Language-internal factors and geographic distribution as well as the growing 
impact of contact languages (Iranian, Mongolic, Uralic, and others) ultimately led 
to the formation of different branches of Turkic with their various subgroups and 
varieties. From the 15th century onwards up until the modern period in the 20th 
century with its nation building processes, two major literary varieties of Turkic — 
Ottoman and Chaghatay — were used as supra-regional literary languages. The use 
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of the Arabic alphabet provided for the intelligibility of written language material 
over a vast geographic area. Due to the nature of the Arabic alphabet and the con- 
servative writing system employed for writing Turkic, however, many questions 
about the development of the Turkic languages in the Middle and Pre-Modern Tur- 
kic periods (13th to 19th centuries) remain as yet unsolved. 

Currently, the Turkic language family consists of about 25 written and a 
number of non-written languages and varieties among which Turkish is the largest 
one, having the largest number of speakers and the broadest written language pro- 
duction. The modern languages are divided into the six subgroups: Chuvash, Kha- 
laj, Northwestern or Kipchak Turkic, Southeastern, Northeastern, and Southwest- 
ern or Oghuz Turkic. This division is based on linguistic criteria and geographic 
distribution (see Johanson 1998: 82). The two branches Chuvash and Khalaj con- 
sist of only one language each, whereas the remaining four branches contain sev- 
eral sub-branches and varieties. 


2. The Turkic languages of Europe 


Of the six sub-branches of the Turkic language family, three are represented in Eu- 
rope: Oghuz, Kipchak, and Chuvash. 


2.1: The Oghuz languages of Europe 


Oghuz or Southwest Turkic is the branch with most speakers and the largest single 
language Turkish (= Trk), which has approximately 70 million native speakers in 
Turkey, Cyprus, the Balkan countries, and North-Western Europe. The number of 
native speakers of Turkish can only be guessed, as statistics for Turkey, and also 
most of the Balkan countries as well as the Northwest European countries, do not 
exist. The very concept of native speaker is of course also a problem. Turkey has a 
population of about 70 million people, but the number of Turkish citizens having a 
first language other than Turkish is unknown. This uncertainty applies of course 
also to Turkish immigrants to Europe, who number about 3.5 million people. Tur- 
kish is the sole official language of the Republic of Turkey and one of the official 
languages of the entire Republic of Cyprus. In North-Western Europe it is the 
strongest immigrant minority language, with about 3 million speakers, and is 
included in the school curriculum in a variety of Northwestern European countries 
as well as in some Balkan states. The opinion, however, that Turkish language 
instruction should be removed from the curriculum because clinging to one’s eth- 
nic language supposedly hinders integration in mainstream society seems to gain 
ground lately, see the introduction in Extra and Yağmur 2004 and, as a case study, 
the documentation (Dokumentation 1993) on the discussion about a bilingual al- 
phabetization project for Turkish children in Berlin. 
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The second Oghuz language spoken in Europe is Gagauz (= Ggz), with a total of 
about 250,000 speakers. The largest group of Gagauz people lives in the Republic 
of Moldova (147,500 in 2004) (http://www.statistica.md/pageview.php?l=en&idc= 
334&id=2338 [24. 03. 2011].), where Gagauz has official language status in the 
autonomous region Gagauzia (or Gagauz Yeri). Smaller Gagauz communities live 
in the Ukraine (31,900 in 2001), Bulgaria, Romania, Greece, and Russia. Unlike the 
Turks of Turkey, who are almost all Muslims, the Gagauz are Orthodox Christians. 
Linguistically, Gagauz can be regarded as a dialect of Turkish that has developed in- 
dividual features due to intensive contact with Slavic languages. Gagauz was de- 
clared one of the official languages of the Soviet Union in 1957. Since then it was 
written in Cyrillic script until in 1995 an alphabet based on Latin script was intro- 
duced which is slightly different from the alphabet used for Turkish. Gagauz is part 
of the curriculum at schools in Gagauzia. 


2. The Kipchak languages of Europe 


The Kipchak (Northwest Turkic) languages spoken in Europe are the Northern 
Kipchak languages Tatar (2 Ttr) and Bashkir (2 Bsh), the Western Kipchak lan- 
guages Karaim (= Krm), Crimean Tatar (= CrTtr), Kumyk (= Kmk), and Karachay- 
Balkar (= Krch), and the Southern Kipchak language Noghay (= Nog). The major- 
ity of Kipchak language speakers are Muslims, with the exception of groups of 
Christian Tatars and the Karaim, who adhere to a form of Judaism. 

There are about 5.5 million Tatars living in the Russian Federation (from the 
2002 census, see http://demoscope.ru/weekly/ssp/rus_nac_02.php [24. 03. 2011]), 
about 2 million of whom are in the Republic of Tatarstan, followed by Bashkir with 
1.6 million people, the majority of them (1.2 million) living in the Republic of 
Bashkortostan. Tatar and Bashkir are both official languages alongside Russian in 
their respective republics. It is worth noting that the Bashkir are a minority in their 
own republic, where 36 % of the population are Russian, 30% Bashkir, and 24% 
Tatar (but see Gorenburg 1999 for the fluctuation between Bashkir and Tatar iden- 
tity in Bashkortostan). Tatar and Bashkir are linguistically closely related to each 
other, with differences mainly in phonetics and phonology. 

Crimean Tatar is an endangered Western Kipchak variety, originally spoken on 
the Crimea, from where the Crimean Tatars were deported to Central Asia during 
World War II. Since the mid 1980s Crimean Tatars have been allowed to return to 
the Crimea, where, according to the census of 2001 (http://www.ukrcensus.gov.ua/ 
results/general/nationality [24. 03. 2011]), about 250,000 Crimean Tatars live at 
present, 92 % of whom claim to have Crimean Tatar as their mother tongue. The 
number of Crimean Tatars in Uzbekistan is not certain. 

Ethnic Kumyks are about 422,409 people and the group of Karachay-Balkars, 
who form a linguistic unit, consists of 192,182 ethnic Karachays and 108,426 eth- 
nic Balkars. Kumyk and Karachay-Balkar are spoken mainly in the Caucasus re- 
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gions. They all have official language status alongside Russian and other languages 
in the republics where their majority is based. Kumyk has official status in the Re- 
public of Daghestan; Karachay is one of the official languages of the Karachay- 
Cherkess Republic; and Balkar is an official language in the Kabardino-Balkar Re- 
public. 

Karaim is a highly endangered language spoken in Lithuania by fewer than 100 
speakers. 

The ethnic group of the Noghays has 90,666 members. Noghay is spoken 
mainly in Daghestan, where it has official language status, and in its adjacent re- 
gions (Noghay also has official language status in the Karachay-Cherkess Repub- 
lic) and 1s the only European Turkic language belonging linguistically to the South 
Kipchak group (the South Kipchak languages spoken outside of Europe are Ka- 
zakh, Karakalpak, and Kirghiz). 


2.3. Chuvash 


Chuvash (2 Chu) constitutes a branch of its own and is linguistically quite distant 
from all the other Turkic languages because its ancestor language split away from 
Turkic before the Old Turkic period. It is an official language in the Republic of 
Chuvashia where about 900,000 Chuvash people live. The total number of Chu- 
vash in the Russian Federation is about 1.6 million. The Chuvash are predomi- 
nantly Christian. 


All Kipchak languages mentioned above as well as Chuvash are part of the cur- 
riculum in the regions where their speakers concentrate; Tatar is the medium of 
instruction in some schools and also in some branches of higher education in Tat- 
arstan. 

While the population figures are easily available for at least the ethnic groups 
of the Russian federation, estimating the number of native speakers is more com- 
plicated. In the 2002 census, knowledge of language was elicited only quanti- 
tatively, not qualitatively. The results cannot straightforwardly be connected with 
the figures for the ethnic groups. In any case, it is safe to say that a shift to Russian 
as the first language is not uncommon among all the minority groups in urban re- 
gions of the Russian Federation, while in rural areas the ethnic language has 
broader functionality and higher prestige and is thus not given up easily. On the 
other hand, we must also account for a certain influence of Turkic languages with 
higher prestige upon the less prestigious ones in areas where speakers are in close 
geographic contact. Thus, Tatar has a higher prestige than Bashkir and Chuvash, 
and speakers of those languages living in Tatarstan might give up their variety 
in favor of Tatar. Governmental influences in this area might even lead to a change 
in ethnic identity between linguistically closely related Turkic groups. As an 
example, see the study by Gorenburg (1999) for Bashkortostan. Turkish has gained 
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a certain influence on Gagauz in the course of the last 15 years that might event- 
ually lead to diglossia among ethnic Gagauz who have close connections to Turkey 
(see Menz 2003). 

Almost all Turkic languages of the former Soviet Union suffer from a certain 
degree of endangerment because of the overwhelming social dominance of Rus- 
sian during Soviet times. Russian was then and is still the language of the sciences, 
technology, political, legal, and administrative institutions and thus enjoys a much 
higher prestige than the languages of the minorities, however large they might be. 
This dominance led to an underdevelopment of the lexicon of the languages in 
question in certain domains. 

Chuvash, Tatar, Bashkir and their predecessors have been in a complex lan- 
guage contact situation with Finno-Ugric languages of the Volga region ever since 
speakers of Turkic arrived there (see Johanson 2000). 

Mutual intelligibility is high between Turkish and Gagauz and also among the 
Kipchak languages. Intelligibility between Oghuz and Kipchak languages is also 
possible on a basic level of communication. Chuvash, however, is not intelligible 
for any of the other Turkic languages. 


3. Typological characteristics of the Turkic languages of Europe 


3.1. Phonetics and phonology 


Vowel phonemes: The Turkic languages typically employ a set of 8 vowel pho- 
nemes, most clearly represented in Turkish (see Table 1). Each back vowel has its 
front counterpart. Tatar, Bashkir, and Noghay have two e phonemes, /£/ and /e/, and 
therefore 9 vowel phonemes. Tatar, Bashkir, Karaim, and Chuvash have central- 
ized shortened or reduced vowels (lax). Chuvash has no /o/ and /g/ phonemes, ex- 
cept for /o/ in Russian loanwords. 

The Turkic languages of Europe typically have no long vowels in native words 
and no diphthong phonemes. Long vowels do appear in Arabic-Persian loans and 
as a result of syllable contraction. In some Gagauz words of Turkic origin, Ancient 
Turkic long vowels have been preserved; this mainly concerns long /a:/ and does 
not involve all Turkic words that had primary long vowels (see Pokrovskaja 1964: 
29). 


Table 1: Turkish (and Gagauz) vowel phonemes 


front back 


high i y u u 
low € o a o 
illabial labial illabial labial 
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Consonant phonemes: In general, the plosives are pairs of voiced and unvoiced 
counterparts. Chuvash, however, has only unvoiced plosive phonemes that are pro- 
nounced fortis except between vowels or between sonorant and vowel, where they 
are pronounced lenis. All other languages have pairs of the voiced/unvoiced bilab- 
ial (/b/ and /p/), alveolar (/t/ and d/), and palatal/velar (/k/ and /g/) plosives. All 
have at least one labiodental (/f/ or /v/), one alveolar (/s/ or /Z/), and one postalveo- 
lar fricative (/f/ or /3/), and a postalveolar affricate (/tf/ or /d3/). In all these lan- 
guages the nasals /m/ and /n/, laterals /l/, /r/ and the approximant /y/ exist. Some 
Kipchak languages have a bilabial approximant /w/. The Kipchak languages with 
the exception of Karaim, also have a velar nasal phoneme /y/. 

The basic differences between Bashkir and Tatar concern the consonant corre- 
spondences shown in Table 2. 


Table 2: Tatar and Bashkir consonant correspondences 


Tatar Bashkir Tatar Bashkir 
S- h- sakal hakal ‘beard’ 
{+s} {+h bul-sa bul-ha “if it is’ 
é S alf- as- *to open' 
S 0 bas- bað- ‘to press’ 
Z 5 boz boð ‘ice’ 


Secondary phonemes due to language contact are the unvoiced labiodental frica- 
tive /f/, which appears only in native interjections and in Arabic-Persian, French, 
and Russian loan-words, e.g. Trk öf ‘uff’ and Ttr fonétika ‘phonetics’, fikér ‘idea’ 
< Arabic, the alveo-palatal affricate /ts/ in Russian and Romanian loans, e.g. Ggz 
furkulitsa ‘fork’ < Romanian, as well as the alveolar fricative [3] in Persian, French 
and Russian loans, e.g. Ttr Zaloba ‘complaint’ < Russian. 

Syllable structure: preferred syllables are (C)V(C); vowel sequences and con- 
sonant clusters are avoided with exceptions in final clusters consisting of a liquid 
or nasal (in Turkish and Gagauz also a sibilant) and a stop, e.g. Ttr /dyrt/ ‘four’, but 
/dus/ ‘friend’ « Persian dost, in Turkish /dost/. In recent loanwords from Russian 
and English, final clusters with other combinations, as e.g. Ttr /nerv/ ‘nerve’ and 
even initial consonant clusters become more and more acceptable in all European 
Turkic languages, e.g. modern Trk /spor/ vs. the older pronunciation /suipor/ 
‘sport’ and Ttr /spektakl/ ‘theatre play’. Pronunciation in these cases often depends 
on the speaker's foreign language knowledge. 

All Turkic languages have inter-syllabic harmony. A syllable is either back 
or front, each following syllable adhering in this respect to the previous one, e.g. 
Bsh [c#/#] ‘person’ and Ttr [ayak] ‘foot’. This rule also applies to most suffixes, 
e.g. Bsh [cif#ler] ‘persons’, Ttr [ayaktar] ‘feet’. As can be seen from the above 
examples, the syllable harmony affects not only vowels but also those consonants 
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that have front and back allophones. In Karaim, due to the development of a set of 
palatalized consonants under the influence of non-Turkic languages, this system 
spread over all consonants: e.g. [b'er'd'i| ‘he gave’ as opposed to [bardy] ‘he 
went', (see Csató 1999). Back and front allophones of /k/ and /g/ have different 
letters in some of the Cyrillic-based alphabets (Bashkir, Noghay) but not in the 
Latin-based ones (Gagauz and Turkish). 

Loanwords do not always take part in harmonizing processes; suffixes attached 
to them, however, in general adhere to front:back-harmony, e.g. CrTtr minare-ge 
[minaret-DAT] ‘to the minaret’. Only a limited number of suffixes are invariable 
and accordingly do not adhere to vowel harmony. 

Turkish additionally has labial harmony in suffixes that contain a high vowel. 
These suffixes display a fourfold harmony according to frontness and roundedness 
of the vowel in the preceding syllable, e.g. ev ‘house’ > evi [house-Acc], ot ‘gras’ > 
otu [gras-Acc]. 

Baskhir has labial harmony after /ce/ and /o/ in suffixes that contain a mid 
vowel, e.g. bor ‘milk’ > hoet-tae [milk-Acc]. 

Consonant assimilation: Suffix-initial consonants may assimilate to the final 
consonant of the word they are attached to. The degree of assimilation differs in the 
various languages. The plural suffix!, for instance, is -/Ar in Turkish, in Kumyk 
and Karachay-Balkar -/A(r), but -LAr in all the other Turkic languages of Europe, 
displaying individual kinds of assimilation: In Gagauz the initial L is assimilated 
only after n, i.e. gün ‘day’ > giinndr ‘days’, in Tatar, Karaim, Crimean Tatar, and 
Noghay after all nasals, thus alym ‘method’> alymnar ‘method’, in Bashkir it is 
-lAr after vowels, -tAr after plosives, fricatives except /Ó/, and affricates, -dAr after 
nasals, / and Z, and -SAr after y, w, r, and d. 

Suffix-initial plosives assimilate in all languages after voiceless consonants, e.g. 
dative -GA > Krm uvul-ga [son-DAT] ‘to the son’ but it-ke [dog-pat] ‘to the dog’. 

Suffix-initial vowels may drop after vowel stems, e.g. Nog ana-m [mother- 
Poss1sc] ‘my mother’ but is-im [matter-Poss1sc] ‘my concern’. 

Stress is regularly on the last syllable. Suffixes attached to a stem take over the 
stress, e.g. Bsh urdm ‘street’ uramdár ‘streets’ uramdaróán ‘from the streets’. 
Some suffixes and clitics cannot carry stress, e.g. negation, copula elements, the 
question particle, etc. Names and loanwords may have different stress patterns. 


3.2. Morphology 


The Turkic lexicon basically consists of verbs and nouns, formally divided by the 
ability of the verbs to take a negation suffix in -mA added directly to the stem. The 
nouns can be subdivided into nouns proper, numerals, adjectives (with some diffi- 
culties, see Braun and Haig 2000 for Turkish), and pronouns. The Turkic languages 
have no definite article. A group of indeclinable words consists of adverbs, post- 
positions, conjunctions (mostly of non-Turkic origin), and interjections. 
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Turkic languages belong to the agglutinative language type. Words take suf- 
fixes that are attached to the stem and to each other, being rather clear-cut and stay- 
ing relatively stable in the process. Suffix boundaries generally stay quite clear; 
variation is predictable according to vowel and consonant harmony, e.g. Trk kapak 
> kapak-lar > kapak-lar-ı > kapak-lar-in-da and kapak > kapag-ı, see below. 

All Turkic languages have derivational and inflectional suffixes. The order of 
suffixes is not arbitrary: Derivational suffixes precede inflectional ones; the two 
groups also have an internally fixed order, and although combinability is high, it is 
rule-bound. 


3.2.1. Derivation 


Denominal and deverbal suffixes for verbs and nouns form new stems, e.g. 


denominal nouns: Chu yurd ‘song’ > yurd-sä ‘singer’ ; 
denominal verbs: Bsh es ‘work’ > es-le- ‘to work’; 
deverbal nouns: Krm yalvar- ‘to request? > yalvaris ‘request’; 
deverbal verbs: Ttr kurk- ‘to fear’ > kurkit- ‘to frighten’. 


All Turkic languages have suffixes for passive, reflexive, causative, and cooper- 
ative that precede the negation suffix, i.e. they are in the slot for derivational suf- 
fixes. 

Besides synthetic derivation, nominal and verbal compounding is used to form 
new words. The most frequent type of nominal compounds consists of two (or 
more) nouns with a third person possessive suffix at the head noun, e.g. Ggz yemiş 
aacı [fruit tree-Poss3sc] ‘fruit tree’. (In cases where personal possession of a com- 
pound must be expressed, the compounding suffix is replaced by the necessary 
possessive suffix, thus yemiş aacim [fruit tree-poss1sG] ‘my fruit tree’ yemiş aac- 
lari [fruit tree-poss3PL] ‘their fruit tree’, etc.) Juxtaposition of two nouns without 
possessive suffix is also possible, especially when the attribute denotes a material, 
e.g. Chu Cugun sul [iron road] ‘railroad’. 

Verbal compounds may consist of a noun (of foreign origin) and an auxiliary 
verb, e.g. Krch sabir ‘patience’ > sabir et- "be patient’, or a postverb construction, 
e.g. Nog usip bar- [fly-cvB go] ‘to fly away’. 


3.2.2. Verb inflection 


Closest to the verbal stem comes the negation suffix, -mA, which is followed either 
by finite or non-finite verb forms. -mA does not carry stress. 

Finite forms are built with various aspecto-temporal markers followed by 
agreement markers. In all the languages two basic types of agreement markers (see 
Table 3) combine with the aspecto-temporal forms. The so-called copulative agree- 
ment markers combine with present tense, habitual present, perfect, and the future; 
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the so-called possessive type combines with past tense and conditional. The im- 
perative and the optative mood have their own paradigms of personal markers. 


Table 3: Verbal agreement markers 


Copulative agreement marker Possessive agreement marker 
Ttr? Krm Nog Chu Trk Ttr Krm Nog Chu Trk 
1.sg -mEn -m -MAn -(A)p -(ylIm| -m -m -m Am -m 
2.sg -sEn -S ln -(A)n -sIn Bu -y 0 -Àn -n 
3sg Ø (-t) (-DD Ø Ø Ø Ø Ø -é Ø 
L.pl -bEz -bIz Miz -pÁr  «y)Mz| -K  -K -K -Ámár -K 
2.pl -sÉz -sIz -SIz -(A)r -sInIz | -GEz -ylz -plz -Ăr -nlz 
3.pl (-LAr) -LAr, -dlAr CDA -$(€) (-LAr)|(-LAr) (-LAr) CIAr) -é$ (-LAr) 


The aspecto-temporal system of the Turkic languages is rather complex, consisting 
of primary suffixes and secondary combinations with auxiliary forms. The primary 
forms can roughly be divided into a tripartite system of markers for past, present, 
and future; for the various suffixes see Table 4. To these aspecto-temporal suffixes 
personal markers are added when used as predicators. Note that some of the suf- 
fixes in question serve also as markers for participles. 


Table 4: Basic aspecto-temporal markers 


Ttr Kmk Krm Nog Chu Trk Ggz 
Past I -DE -DI -DI -DI -R -DI -DI 
Past II -GAn -GAn  -GAn -GAn  -nÁ -mls -mls 
Present I -A -A -A -A -At -(O)Iyor* -er 
Present II -(V)r -(V)r -(A)r Ø -(V)r -(V)r 
Future -y)AÉAk -(A)JZAk -(V)r -(A)yAk A -(y)Acak ` -(y)AcAk 


Although Table 4 looks as if the aspecto-temporal systems in the Turkic languages 
were quite similar, matters are a bit more complicated. The individual languages 
display semantic differences despite the fact that forms look very much alike and 
the building mechanisms for combined forms are structurally the same in all the 
languages except Chuvash. 

Past: The basic past tense forms are the simple past (past I), which in all lan- 
guages in question goes back to an Old Turkic formant -Di and an indirective past 
form (past II) conveying post-terminality and modal shades of evidential, inferen- 
tial, hearsay, or surprise. This second past is marked by -GAn in the Kipchak lan- 
guages, -mls in the Oghuz Turkic languages, and by -nA, which cannot take per- 
sonal markers, in Chuvash. 
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Present: In almost all the languages there are at least two present tense forms, 
differing semantically in that one is used to express an ongoing event while the 
other is a broader intra-terminal present form, conveying also intention and habit- 
uality. 

As focal present tense forms tend to lose focality, they were regularly renewed 
throughout all the Turkic languages, most often by the help of a combination 
of a converb form and an auxiliary verb (mostly tur- ‘to stand’, but yorı- ‘to walk’ 
in Turkish) that later became de-lexicalized and subsequently turned into a 
suffix. The second present tense in -(V)r in some languages developed into a 
mere modal prospective form. Noghay, for instance, has a tripartite system con- 
sisting of non-focal, low-focal, and high focal forms, bar-ar-man [go-PRsII-1sc] 
‘I will (intend to) go’, bar-a-man [go-PRsI-1sc], ‘I go’ bar-ayatir-man [go-PRs- 
lsc] ‘I am going’, alongside other intra-terminal forms (see Karakoç 2005: 
152-161). 

Future: Besides conveying prospectivity the future tense has modal conno- 
tations in all Turkic languages in question. The Karaim future in -(V)r developed 
from the former present form that lost its focality (see Csató 1998: 48). 

In addition to these basic simple suffixes, all languages have combinations of 
participles or converbs and forms of the copula or certain auxiliary verbs that bear 
aspecto-temporal markers, thus forming various new aspecto-temporal markers, 
e.g. habitual past forms as Krm koy-ul-ur e-di [place-PAss-PRsII Aux-PsTI3sG] 
‘used to be placed’, Nog üret-etagan bol-gan [teach-HABIT AUx-PsTII3sc] ‘used to 
teach’; pluperfect in -GAn edi, CrTtr algan edi-m [take-PsTII pstcop-1sg] ‘I had 
taken’, or focal present tense as in Krch isle-y tur-a-ma [work-cvB Aux-PnsI-1sc] 
‘I am working’. 

Modal forms: The imperative is the bare stem of the verb in the singular and an 
unstressed personal suffix -(y)In in the Oghuz languages, -[gIz/-InIz/-InIz in the 
Kipchak languages and -(0)Àr in Chuvash in the plural. Negation is regular, i.e. Ttr 
kayt ‘go back" kaytma [go-back.NEG] ‘don’t go back! ; except in Chuvash, where 
the negated imperative is formed with a prepositive element an, which is most 
probably of Finno-Ugric origin: sir ‘write!’, an sir ‘don’t write". Besides the im- 
perative, all languages have optative, voluntative, and necessitative forms. 

Non-finite forms are converbs, verbal nouns, and participles. The main func- 
tion of these non-finite forms is that of predicators in subordinated clauses (see the 
section on syntax). Moreover, they are used as elements in aspecto-temporal com- 
binations. 

Converbs: All languages have a coordinative converb in -(X)p or -(y)Xp, except 
Chuvash, which has -sA. Widespread use is also made of an intra-terminal converb 
in -(y)A. Besides primary non-analyzable converb forms, most languages make 
use of combinations of participle and case suffixes and/or postpositions to build 
converb forms with various semantic shades, e.g. CrTtr -GAndA ‘when ...' < par- 
ticiple in -GAn + Loc, Trk -DIk-tAn sonra ‘after ...’ < participle in DIE + ABL + 
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postposition. The conditional suffix is -sA (+personal marker) in most cases (Trk, 
Ggz, Ttr, Kmk, CrTtr, Krm, and Krch), -hA in Bashkir; for remote condition the 
past copula is added, e.g. CrTtr -sA edi, with the personal marker added to the 
copula. Chuvash forms an invariable real conditional with the converb -sAn that 
can also denote temporal relations and a remote conditional with the suffix -(A)ttA 
(+ personal marker). 

Verbal nouns: Various verbal nouns, most prominently forms in -(V)w in the 
Kipchak languages, -mA in the Oghuz languages, and -ni in Chuvash serve as 
predicators in complement clauses. 

Participles: Participle forms are -D/k, -(y)An, -(V)r, and -(yJAcAK in the Oghuz 
languages; -(O)AkAn, -nÀ and -(0)As, -GAn, -(V)r, and -(yJACAK etc. in the Kip- 
chak languages. Besides other functions, participle forms are used as predicators in 
dependent clauses, especially in relative clauses. Some participles can also appear 
in combination with auxiliary copula elements in complex finite predicates. Verbal 
nouns as well as most participles can take possessive suffixes as agreement 
markers. 


3.2.8. Nominal inflection 


Turkic languages have no grammatical gender. Nouns take plural, possessive, and 
case marking suffixes. The plural suffix is -LAr in all the languages but Chuvash, 
where it is -sem with the allomorph -sen before case suffixes. Possessive suffixes 
follow the plural in all languages but Chuvash, which has the reverse order noun- 
POSS-PL, e.g. Bsh qul-dar-im [hand-PL-Posslsc] but Chu al-ém-sem [hand- 
poss1sG-PL] ‘my hands’. Last come the case markers, e.g. Bsh qul-dar-im-a [hand- 
PL-POSS1SG-DAT] and Chu al-ém-sen-e [hand-poss 1sG-PL-oBJ] ‘to my hands’. The 
plural suffix is normally not used after quantifiers, e.g. Ggz dokuz usak [9 child] 
*nine children.’ 

The basic case markers are suffixes for genitive, accusative, dative, locative, 
and ablative (see Table 5). The nominative has no marker. Direct objects are 
unmarked if they are non-specific and in immediately prepredicative position, 
and marked for accusative if they are specific/definite or separated from the 
predicate. See the Chuvash example in (1), which is taken from Krueger (1961: 
186). 


(1) Ivan kéneke-ne vul-a-t vs. lvan kéneke vul-a-t 
I. book-Acc read-PRs-3scG I. book  read-Pns-3scG 
‘Ivan reads the book.’ ‘Ivan reads books.’ 


Chuvash has only one object-marking case suffix, both for specific/definite direct 
and indirect objects. In Kumyk and Karachay-Balkar the markers for genitive and 
accusative coincide. 
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Table 5: | Case markers 


Ttr Kmk Krm Nog Chu Trk + Ggz 
Gen -nÉg -NI -nIn -DIn -(n/y) End -(n)In 
Acc -nÉ -NI -nI -DI -(n/y)A -(y)I 
Dat -GA -GA -GA -GA -(y)A 
Loc -DA -DA -DA -DA -RA -DA 
Abl -DAn -DAn -DAn -DAn -RAn -DAn 


All languages have additional case-like suffixes for instrumental, directive, etc. 


Personal and demonstrative pronouns often exhibit changes of the stem in ob- 
lique forms. Most languages have a tripartite system of demonstratives denoting 
semantic differences that could be roughly described as different spatial reference. 
The reflexive pronoun is kendi in Turkish and Gagauz, öz, iiz, and üö in the Kip- 
chak languages, and ya in Chuvash. 


3.3. Syntax 


Turkic languages are head-final languages. All modifying elements precede the 
modified ones, e.g. in a noun phrase indefinite article or demonstrative precedes 
adjective precedes noun, as in Ttr bër matur žir [INDEF.ART beautiful song] ‘a 
beautiful song’ or bu matur žir [DEM beautiful song] ‘this beautiful song’. Plurality, 
possession, and case are marked at the head of the noun phrase; there is no agree- 
ment between attributive and head noun matur Zir-lar-i-na [beautiful song-PL- 
POSS3SG-DAT] ‘for her/his beautiful songs’. 

In genitive constructions the possessor marked with genitive precedes the head 
noun marked with possessive suffix, e.g. Nog kinaz-din yurt-i [prince-GEN land- 
Poss3sc] ‘the land of a/the prince’. This word order can normally only be inverted 
in postverbal position. Only Gagauz and, to a much greater extent, Karaim make 
use of the inverted order also in preverbal position. In Karaim the head noun might 
even lack a possessive suffix (see Csató 2001: 14). 

Postpositional phrases consist of a postposition as the syntactical head and a 
noun phrase. There are several types of postpositions: some developed from spatial 
nouns that might require genitive marking on the attribute Ggz daa-yin ic-in-dä 
[wood-GEN in-Poss3sG-Loc] ‘in the wood’ and daa ic-in-ä [wood in-Poss3sG-DAT] 
‘into a wood’ < ic ‘underside, bottom side’, some from verbal stems that require 
case marking on the attribute depending on the valency patterns of the underlying 
verb, e.g. nog üy-in-e karap [house-poss3sG-DAT towards] “towards her house’, 
and some are constructed with a noun in the nominative but a pronoun in the geni- 
tive, e.g Ttr ukugi-lar bélen [student-pL with] “with the student’ but an-in bélen 
[(s)he GEN with] ‘with her’. 
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3.3.1. Simple sentences 


Turkic languages in principle have predominantly SOV order in a neutral decla- 
rative sentence; word order variation serves semantic purposes. In terms of prag- 
matics, the topic takes the sentence-initial position with the focus in immediate 
preverbal position. The postverbal position serves for given or de-focused el- 
ements (see Erdal 1999). Thus, in the following example sentences (cited after Er- 
guvanh Taylan 1984: 11-12) capitals indicate focus position while underlining in- 
dicates topic and italics given elements. 


(2) a. Ahmet yumurta-yı ye-di. vs. 
A. €gg-ACC eat-PsT3sG 
Ahmet ate THE EGG. 
b. Yumurta-yı Ahmet ye-di. vs. 
egg-ACC A. eat-PsT3sG 
AHMET ate the egg. 
c. Yumurta-yı ye-di Ahmet vs. 


egg-ACC eat-PsT3sG A. 
Ahmet ate THE EGG. 

d. Ahmet ye-di yumurta-yi 
A. eat-PST3SG egg-ACC 
AHMET ate the egg. 


Gagauz and Karaim, however, deviate from this pattern. Both languages employ 
SVO as the neutral word order in declarative sentences. With this word order 
change, which is due to the heavy influence of the surrounding socially dominant 
Indo-European languages, the position for the focus has changed to the postverbal 
position, see example (3) from Gagauz (Menz 1999: 188). 


(3) Q: kim-i cair-di-lar? A: car-ar-dı-lar yakın senselä-lär-i 
who-Acc invite-PST.3PL invite-AOR-PSTCOP-3PL near relative-PL.ACC 
Q: “Who did they invite?’ A: ‘One normally invited close relatives.’ 


Nominal sentences have no copula in the present tense, the subject is expressed 
either by a personal markers attached to the nominal predicate or by a pronoun, e.g. 
Kmk isci-sen [worker-2sG] ‘you are a worker’, Ttr min student [I student] ‘Iam a 
student’. Negative existence is expressed by Trk degil, Ggz diil, Ttr and Bsh rügel, 
Kmk tügül, CrTtr degil, etc. as the predicate, e.g. Ttr kiizlégém zur tiigél [glasses- 
Poss1sc big not] ‘my glasses are not big’. 

Constructions expressing possession either have a predicate bar/var Chu pur 
‘existing’ or the negative in yok/yuk Chu suk ‘not existing’. The possessor is ex- 
pressed by a possessive suffix in the required form on the possessed, which pre- 
cedes the predicate, e.g. Chu lasa-m pur [horse-Poss1sc existing] ‘I have a horse’, 
Ttr minem segatém yuk [my watch-Possl1sc not existing] ‘I don't have a watch’. 
Bar/var and yok/yuk are also used together with a locative adverbial in expressions 
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a 


denoting ‘there is ... 
“there are trees there’. 

All languages in question have an interrogative particle m/ for yes/no-ques- 
tions that follows the element asked for; compare the examples in (4) for Turkish: 


, e.g. Nog onda (...) terek-ler bar [there tree-PL existing] 


(4) a. Ali bugiin gel-di mi? 

A. today come-PsT3sG Q 
‘Did Ali come today?’ 

b. Ali bugün mü geldi? 
A. today Q come-PST3sG 
‘Did Ali come today?’ 

c. Ali mi bugün geldi? 
A. Q today come-PsT3sG 
"Did Ali come today?' 


3.3.2. Complex clauses 


Subordinated clauses are typically left-branching non-finite clauses, i.e. the subor- 
dinated clause precedes its head. They are generally constructed on predicates with 
non-finite verbal suffixes that function as subordinators. Besides this genuine Tur- 
kic syntactic pattern, virtually all languages employ structurally copied right- 
branching clauses built on models copied from Indo-European languages. While 
this is subject to certain restrictions (see Johanson 1977) and has relatively low fre- 
quency in most of the languages, Karaim and Gagauz have shifted to use right- 
branching subordinated clauses almost exclusively. 

Constituent clauses are built on verbal nouns and participles that bear case 
marking according to the valency pattern of the governing predicate. 


(5) Israil' radio-si memleket-te  fiat-lar-i-nit) kene 
I. radio-poss3sG land-Loc price-PL-POSS3SG-GEN again 
art-kan-i-ni bildir-di. 


rise-PTCP-POSS3SG-ACC announce-pstl3sg 
“Radio Israel has announced that the prices in the country have risen 
again.’ CrTtr, Jankowski (1992: 383) 


Karaim and Gagauz use finite right-branching clauses as constituent clauses which 
are introduced by a conjunction, see example (6) from Gagauz, where the conjunc- 
tion used in a variety of subordinated clauses — but most prominently in constituent 
clauses — is ani. 


(6) Hepsi sevin-ärdi ani kolxoz-a gir-di-k 
everybody be pleased-HABiT.PsT3sG ANI kolkhoz-par enter-PsT-lPL 
“Everybody was pleased that we joined the kolkhoz.’ 

Ggz, Menz (1999: 192) 
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Karaim makes use of the junctor ki (< Persian ke) in constituent clauses, an element 
employed in most of the Turkic languages, which had at some point in time been in 
direct or indirect contact with Persian. 

Relative clauses are built on participles, most prominently -(y)An and -DIK in 
Turkish, -GAn in the Kipchak languages, and -nA in Chuvash. Besides these, each 
language has a variety of participles that can be used as predicators in relative 
clauses conveying several aspecto-temporal nuances. 

The main difference between the Turkish type of relative clauses and the other 
Turkic languages in question is that Turkish makes a formal distinction according 
to whether the first actant of the relative clause is co-referential with the head noun 
or not. In case it is, Turkish uses the participle in -(y)An. If it is not co-referential, it 
makes use of the participle in -D/K. The Kipchak languages and Chuvash use the 
same participle in both cases, thus: 


(7) Trk gör-en adam Trk  gór-düg-üm adam 
[see-PTCP man] [see-PTCP-POsslsc man] 
‘the man who sees’ ‘the man whom I see’ 
Ttr küren adem Chu kurnd Sin 
[see-PTCP man] [see-PTCP man] 


‘the man who sees / the man whom [subject] sees’ 


In the Kipchak languages and in Chuvash the non-referential subject of the relative 
clause might be unexpressed if it is known or can be recovered from the context. It 
can be expressed in form of a pronoun, see the following example: 


(8) Min de hin kil-gen yul-dan ` kil-dé-m 
I PTCL you come-PTCP road-ABL come-PsTI-1sc 
‘I also took the road you took.’ Bsh, Heß 2008: 356 


Gagauz and Karaim as well as some Turkish dialects on the Balkans again employ a 
right-branching finite model as their main device for relative clauses. Karaim and 
Gagauz have developed relative pronouns making use of the interrogative adjectives 
Krm kaysi ‘which’ and Ggz angi ‘which’ that agree in number with the head noun 
and bear case morphology according to their semantic role within the relative clause 
(see example (9)). Among the Balkan Turkish dialects, the interrogative ne ‘what’ 
functions as junctor in right-branching relative clauses (see Friedman 2006: 39). 


(9  Ggz adam angi-si-ni gór-dü-m 
man rel-Poss3sG-Acc see-PsTI-1sc 
‘the man whom I saw’ 


adamlar angı-ları-nı gör-dü-m 
man-PL  rel-Poss3PL-ACC Ssee-PsTI-1sc 
‘the men whom I saw’ 
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All Turkic languages employ a variety of converb suffixes as predicators in adverb- 
ial clauses. Several adverbial relations can be expressed with converbs: time, 
cause, purpose, and condition. Some of the converbs do not allow for different sub- 
jects in main and dependent clause. 

Most prominent is the coordinating converb -Xp in all languages in question 
but Chuvash, where its functional counterpart is -sA. 


(10) Men skola-ga ` bar-ip, senig belgi-ler-ig-di kara-p 
I school-DAT go-cvB your result-PL-POss2sG-ACC look-cvB 
kel-eyatir-man 
come-PRS-1SG 
‘I went to the school, looked at your results and am coming [back] now.’ Nog 
Karakoç (2005: 155) 


Although both Karaim and Gagauz have developed right-branching patterns for 
adverbial clauses, too, the syntactic shift is not as profound as with complement 
and relative clauses. Both still make use of several converbs, notably for adverbial 
clauses with temporal semantics. 


4. Linguistic changes due to language contact 


All the Turkic languages in question show various traces of contact-induced lin- 
guistic changes. For the varieties of the Muslim speakers, the prestige languages 
Arabic and Persian had been a source for lexical borrowing but also for copying on 
other linguistic levels from the 11th century until modern times. Chuvash shows 
profound traces of Finno-Ugric influence, which in the first decade of its explora- 
tion even led to the belief that Chuvash was a Finno-Ugric language. The character- 
istics of this influence still need much further investigation. In modern times Rus- 
sian was and still is the socially dominant language for all Kipchak languages of 
Europe as well as for Chuvash and Gagauz. Turkish, whose speakers tried to repel 
the Arabo-Persian influence in the first half of the 20th century, has recently bor- 
rowed a lot from English, especially in the lexicon. Besides these influences from 
typologically different languages, some of the Turkic languages in question are in- 
fluenced by other, more prestigious Turkic languages. Crimean Tatar has been in- 
fluenced by Ottoman Turkish, while Chuvash and Bashkir have been and are still in- 
fluenced by Tatar, and Gagauz nowadays shows traces of the influence of Turkish. 

Apart from obvious influences on the lexical level, all other linguistic levels 
are affected by language contact-induced changes. Due to the insertion of copied 
lexical items, foreign sounds are implemented in the systems. Intonation patterns are 
also affected by surrounding dominant languages. Morphology is the level where 
changes occur last and hesitantly, but cases like the Chuvash plural suffix or the bor- 
rowing of Slavonic feminine suffixes in -(y)ka, etc. into Gagauz and Karaim do occur. 
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Syntactic changes resulting from language contact can be observed particularly 
in the smaller languages spoken in an area they share with a typologically dissimi- 
lar dominant language. Among the Turkic languages of Europe, Karaim and Ga- 
gauz exhibit the most substantial syntactic changes, as has been exemplified in the 
section on complex sentences. Turkish in Germany might be on the way to a simi- 


lar development. 


Table 6: 


Typological features 


Shared typological features 


exceptions 


sound harmony 
agglutinative structure 
suffix initial consonant 
assimilation 
word-final stress 
genera verbi 


possessive construction 


adpositions 
gender 


verbal negation 
tense 
indirective 


habitual present 
mood 


non-finite verb forms 


head-final 
genitive construction 


topic position 
focus position 


present tense copula 
subordinated clauses 


yes 
yes 
yes 


yes 
synthetic 

my xx exists 
postposition 
no distinction 


synthetic 

3 partite (past, present, future) 
evidential, inferential, hear- 
Say, surprise 

yes 

indicative vs. non-indicative, 
synthetic 

imperative, optative, volun- 
tative, necessitative, condi- 
tional, irrealis, possibility 

= converbs, verbal nouns, 
participles 

yes 

genitive precedes head noun 


sentence-initial 
immediate preverbal 


no 
non-finite, left-branching 


Karaim uses also prepositions 
Gagauz has a derivational suffix 
for female members of national- 
ities and occupations as well as 
kinship terms 


Gagauz has developed an analytic 
modal form to express possibility 


Gagauz and Karaim can have in- 
verted order 


focus postverbal in Gagauz and 
Karaim 


finite, right-branching in Gagauz 
and Karaim 


See also the following chapters in this volume: 25 by Extra and 41 by Johanson. 
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Notes 


] In the following, suffixes are presented in a standardized form using capital letters for 
those elements that change according to the various phonological rules of Turkic and 
brackets for elements that may drop due to the nature of the stem they are attached to. In 
principle I follow the suffix notation of Johanson and Csató (1998). Regarding the trans- 
literation of language examples: Gagauz and Turkish are written in a Latin-based alpha- 
bet; Tatar, Bashkir, Noghay, Chuvash, and Karaim are written in a Cyrillic based one. In 
what follows Turkish and Gagauz examples are given in their respective written language 
variety; for the languages in Cyrillic script I use a simplified transliteration system where 
i stands for bi = /ui/, 6 for e = /ø/, ü for y =/y/, ë for e and ë = /V, e for a=/e/, y for u=/j/, 
r for p 2 /t/, k for x, K and x =/k/, g for 2 and e = /g/, § for w=/[/, ž for xc = /3/, č for u = 
/t{/, Y for x and xx —/ds/. This has the disadvantage that similar phonemes are sometimes 
transcribed differently in the various languages but the advantage that Turkish and 
Gagauz examples do not differ from their original. Since examples from the remaining 
languages had to be transliterated, I opted for a system based on what is already found in 
the literature with a moderate adaptation to the Turkish spelling in order to make the 
examples more easily comparable. 

2 Bashkir personal markers are essentially the same as the Tatar ones, displaying the typical 
sound correspondences described above. 

3 Gagauz personal markers are the same as the Turkish ones. 

4 Zeroin parentheses (0) in a suffix notation means that a final vowel of a stem drops before 
this suffix, e.g. ekle- ‘to add’ + -(O)iyor > ekliyor ‘(s)he adds’. 

5 Some Chuvash case suffixes are not as transparent as is usual in Turkic languages; thus, 
they may have different suffix-initial consonants according to the nature of the preceding 
stem-final vowel, a feature that is not found in any other Turkic language. Thus genitive 
and accusative start with an n after a/e and with a y after i, e.g. Sdsi-ye [mouse-oBij], 
lasa-na [horse-oBJ]. After reduced d/é and the labial vowels u/ü the situation is even more 
complicated (for details see Krueger 1961). 
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1. Introduction 


The Uralic language family consists of the following branches: 


1. Finnic and Saami 
a. Finnic (a.k.a. Fennic or Baltic Finnic): Finnish, Estonian, and 5-7 minority 
languages: Karelian (divisible in 2-3 sister varieties or subdialects), Veps, 
Ingrian and Vote in Northwest Russia, Livonian in Latvia. 
b. Saami (Sámi, ‘Lapp’): a chain of sister varieties stretching from Norway 
and Sweden through Finnish Lapland to the Kola Peninsula in Russia. 
2. Mordvin, spoken in the Volga region in European Russia. Two main dialect 
groups and literary standards, Erzya and Moksha. 
3. Mari (a.k.a. Cheremis), spoken in the Volga region. Two main dialect groups 
and literary standards: Hill (Western) Mari and Meadow (Eastern) Mari. 
4. Permic 
a. Komi (a.k.a. Zyryan), spoken mainly in the far Northeast of European Rus- 
sia. Two literary standards: Komi-Zyryan and Komi-Permyak. 
b. Udmurt (a.k.a. Votyak), spoken in the eastern parts of the Volga region. 
5. Ugric 
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a. Hungarian, spoken in Hungary and adjacent areas. 
b. Ob-Ugric, two languages (both with deep internal dialect divisions) of 
Western Siberia: Mansi (a.k.a. Vogul) and Khanty (a.k.a. Ostyak). 

6. Samoyed, a group of indigenous languages in Western Siberia (Nenets has also 
spread to the Far North of European Russia), usually divided into two sub- 
groups: 

a. North Samoyed: Nenets (a.k.a. Yurak), Enets (a.k.a. Yenisey Samoyed), 
Nganasan (a.k.a. Tawgy). 

b. South Samoyed: Selkup (a.k.a. Ostyak Samoyed) and some extinct (cor- 
pus) languages, the last to die was Kamass in 1988. 


The internal structure of the language family has been subject to some debate (for 
a summary, see e.g. Janhunen 2001). Most handbooks depart from a Stammbaum 
with binary splits, beginning with the split of Proto-Uralic into Proto-Finno-Ugric 
and Proto-Samoyed. The competing comb-like model, in which Samoyed is 
merely one of the main branches, would imply that the terms Finno-Ugric and 
Uralic could be used as synonyms. 

All Uralic languages except Finnish, Estonian and Hungarian (in their nation 
states) are endangered. The languages of European Russia (Mordvin, Mari, Komi, 
Udmurt) have hundreds of thousands of speakers, literary standards and titular 
republics with national cultural institutions and are in principle acknowledged 
by law. In most domains of language use outside home, however, the Russian lan- 
guage dominates, and the transmission of the heritage language to the youngest 
generations is threatened especially in towns and cities. Karelian, Vepsian and Ne- 
nets have thousands of speakers and some literary cultivation; most other Uralic 
minority languages are very weakly cultivated, seriously endangered (numbers of 
speakers ranging from a few dozen to a few thousands) or almost extinct. Of the 
Saami varieties, only North Saami (‘Fjell Lapp’, ‘Norwegian Lapp’), with ca. 
30,000 speakers in Norway, Sweden and Finland, appears to be outside imminent 
danger. 

According to most mainstream researchers, Proto-Uralic was spoken during 
the Mesolithic age somewhere in the forest zone of Northern Eurasia (for a survey 
of theories and a somewhat non-mainstream reappraisal, see Janhunen 2001). Dur- 
ing the following six millennia, the ethnolinguistic developments leading to 
today’s daughter languages were conditioned by diverse factors: changes in sys- 
tems of subsistence (agriculture), contacts with neighbouring languages (Indo- 
European and other, probably also extinct and unknown ones), finally the political 
borders and, in particular, the expansion of East Slavic into today’s Central and 
Northern Russia (previously populated by speakers of Finno-Ugric and other lan- 
guages) and the emergence of the Russian empire. The Hungarian nation came into 
being as the half-nomadic Hungarians settled down in the Carpathian Basin at the 
end of the first millennium AD, while Finnish and Estonian in the present sense 
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came into being from the Middle Ages on in political processes which settled the 
borders between Swedish-dominated Finland, German-dominated Estonia and 
other (mostly Russian-dominated) parts of the Finnic area. 

The Uralic languages thus represent a time depth comparable to that of Indo- 
European and, correspondingly, a great variety of typological characteristics, of 
ecological and socio-political circumstances. Similarly, the grade of literarisation, 
language planning and research varies greatly, from unwritten and scantily docu- 
mented varieties to full-scale cultivated and thoroughly researched European 
nation-state languages. However, there is basic documentation (grammars, diction- 
aries, text collections) available for practically all Uralic language varieties. Due to 
the dominance of the Neogrammarian tradition in the first Golden Age of fieldwork 
(until World War I), (historical) phonology, lexicology and, to some extent, mor- 
phology are strongly represented in traditional Uralistics, while there is much less 
research into the syntax of the minor Uralic languages. 

Most Uralic languages have standard orthographies based on the Cyrillic (all 
Uralic languages of Russia except the Finnic group) or Latin alphabet; in Uralic 
studies, the Finno-Ugric transcription (FUPA) based on the use of diacritics (e.g. 
macron for length, acute accent for palatalization) is also used. 


2; Phonology 


2.1. Vowels 


There is no consensus as to the reconstruction of the Proto-Uralic vowel system; in 
particular, questions of quantity correlation or the possibility of mid vowels (mid/ 
back i and/or e) are not completely settled. Sammallahti (1988) postulates an eight- 
vowel system with three tiers and front-back correlation (u o a/ie ä, plus a labial ü 
and an illabial back/mid i), which resembles the Finnic vowel system stripped of its 
obvious innovations such as /ó/. Today's Uralic vowel inventories are often 
relatively large: 8-9 vowel phonemes, or even more, if there is a quantity corre- 
lation (see below) or an opposition between series of full and reduced vowels (as in 
Mari or Khanty). However, Erzya Mordvin has a simple five-vowel system, and 
most Permic varieties have seven vowel phonemes (i e a o u plus a high and a non- 
high illabial mid vowel). 

Many Uralic languages (Finnic, Saami, some Mansi and Hungarian varieties) 
have diphthongs originating from long vowels, reanalysed combinations of vowel 
and glide, or introduced by loanwords (for example, the old Baltic loanwords in 
Finnic). In other languages such as Standard Hungarian, there are no unambiguous 
diphthongs (cf. Fin voi *butter', in which oi is traditionally interpreted as a diph- 
thong, and its cognate, Hng vaj, in which the phonetically fairly similar aj is tradi- 
tionally interpreted as a combination of vowel and glide). Some Finnic varieties — 
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the Vepsian-based Lude variety of Karelian, and Livonian — even have triphthongs 
(Lude rebuoi ‘fox’, d’iäi ‘(s/he) remained"). 

For non-first (that is, non-stressed) syllables in Proto-Uralic, a heavily reduced 
system (only non-labial vowels, only two tiers) is reconstructed. In today’s Uralic 
languages, labial vowels in non-first syllables can often be traced back to a combi- 
nation of vowel plus (suffixal) labial consonant. 

Vowel harmony (only either front or back vowels are allowed in a non-com- 
pound word, which means that suffixes typically have front and back allomorphs) 
or traces of it appear in some Finnic (Standard Estonian and Livonian, for instance, 
have lost it), Mari, Ob-Ugric and Samoyed varieties and in Hungarian. Some Mari 
varieties and Hungarian also know a restricted labial harmony (cf. Hng hdz-hoz ‘to 
the house’: nép-hez ‘to the people’: tuz-höz ‘to the fire"). 


2.2; Consonants 


The consonant phoneme inventory of Proto-Uralic/Proto-Finno-Ugric was fairly 
poor (leading to a wide range of possible consonant substitutions in possible early 
IE loanwords, which has given rise to a heated debate on the limits of the use of 
sound substitutions in etymological explanations). The mainstream reconstruction 
comprises three voiceless stops (p t k), voiced dental spirants, two or three sibilants 
(s, Sand perhaps 5), at least one affricate, two liquidae (l r, perhaps even a palatal- 
ized l’), four nasals (m n ń p) and two glides (w j). Following the suggestion of Juha 
Janhunen, Sammallahti (1988) also reconstructs a mystery consonant *x, function- 
ally somewhat similar to the PIE laryngeals. 

Hungarian, Permic and Mordvin have enhanced this repertoire by introducing a 
voice opposition (voiced stops, sibilants and affricates) and extending the palatal- 
ization opposition to the stops. In Proto-Finnic, in contrast, phoneme mergers (loss 
of palatal opposition) and losses (vocalization of intervocalic 9) took place; how- 
ever, many Finnic varieties have introduced a new palatalization (often triggered 
by a postconsonantal i/j) and new voiced stops and sibilants. 

In addition to voiced stops and sibilants, Uralic consonant repertoires may have 
been enhanced with f, x (often in loanwords), sometimes even other voiceless con- 
sonants (e.g. voiceless liquids appear in Moksha and — in expressive vocabulary — 
in some Saami varieties). 

For the Uralic languages of Russia, the presence and strength of palatalization 
and voice opposition in the consonant system often seems to correlate with the 
strength of Russian influences. 
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2.3. Quantity and stress 


The question of quantity opposition in Proto-Uralic is subject to some controversy, 
but at least geminate stops can be reconstructed. A quantity opposition for vowels 
appears in most of Finnic, in Mansi and in Hungarian; geminate consonants as op- 
posed to non-geminate ones are even more widely spread. In most (Latin-script) 
orthographies, long quantity is marked by doubling the letter, except for the vowels 
in Hungarian (where length is marked with acute accent: á é í ó ú ű ő) and Livonian 

In Standard Finnish and Hungarian, quantity is basically independent of stress. 
In Estonian and Livonian, by contrast, the centralization (erosion of unstressed syl- 
lables) has restricted the long quantity to the stressed (initial) syllable and given 
rise to even more complex quantity alternations. In Estonian, original long sounds 
or syllables have two morphologically conditioned variants. The so-called 3rd 
quantity grade (a.k.a. ‘heavy accent’), alternating with the 2nd quantity grade, can 
in many cases be historically explained as a compensatory lengthening connected 
with the reduction of the suffixal syllables in certain forms: cf. Soome = /2soome/ 
Gen.Sg. ‘(of) Finland’ (< *soome-n) vs. Soome = /3soome/ Illative Sg. ‘to Finland’ 
(« *soome-hen). 

The stress system in Proto-Uralic was probably similar to that of Finnic-Saami 
and Hungarian: main stress always on the first syllable, secondary stresses in prin- 
ciple on odd non-last syllables. Subject to heavy Turkic influence, most notably in 
the languages of Central Russia, the stress may have shifted towards the end of the 
word, to the last syllable (as in Udmurt) or the last full vowel (as in some Mari var- 
ieties); the stress system in Mordvin is particularly complicated (Estill 2004). 


2.4. Word architecture and phonotactics 


The canonic structure of underived lexical word stems in Proto-Uralic was disyl- 
labic (as opposed to monosyllabic deictic or grammatical stems). In Finnic, where 
the original disyllabic structure has been retained fairly well, most word stems of 
three or more syllables are still transparent derivatives or loanwords. In the central 
branches of the Uralic language family in particular, reductive changes have af- 
fected original stem-final and suffixal elements, so that in Komi or Hungarian, for 
instance, underived inherited words are typically monosyllabic. 

Word-initial consonant clusters (of two or more consonants) or word-internal 
consonant clusters (of three or more consonants) were typically avoided in Uralic. 
In older loanwords, word-initial consonant clusters were often broken by dropping 
initial consonants or inserting vowels (cf. Fin koulu, Hng iskola ‘school’; Fin risti, 
Hng kereszt ‘cross’ < [reflexes of] Slavic *kristif). Newer consonant clusters may 
appear in (more recent) loanwords or may be due to loss of (unstressed) vowels, as 
in Mrd (Erzya) pra ‘head, end’ (< *perä), ksi "bread (< *kürsä). 
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2.5. Morphophonology 


Many Uralic languages display morphophonological (stem or suffix) alternations 
resulting in inflection types or 'classes'. In Finnic, where the original stem-final 
elements are well preserved, these are often caused by the loss or change of the 
stem-final vowel before certain suffixes (e.g. Fin kala ‘fish’: Allative Pl. kalo-i-lle 
*to/for the fishes’; tuuli ‘wind’: Gen Se, tuule-n : Partitive Sg. tuul-ta: Allative PI. 
tuul-i-lle ‘to the winds’). In languages characterized by reduction and restructur- 
ation of original suffixal elements such as Hungarian or Permic, there are inflection 
types based on stem alternations or the choice of ‘linking vowel’ between stem and 
suffix (possibly reflecting the original stem-final vowel), e.g. Hng hdz ‘house’: 
Mom DI haz-a-k but lány ‘girl’: Nom.Pl. lany-o-k. 

A peculiar phenomenon causing morphophonological alternations is consonant 
gradation, which affects word- or stem-internal stops in (most of) Finnic and all 
stem-internal consonants in (most of) Saami: depending on whether the following 
(second) syllable was originally open or closed, the stem-internal consonants ap- 
pear in the 'strong' or in the ‘weak’ grade. Similar phenomena also appear in some 
Samoyedic varieties, and Helimski (1995) even traces consonant gradation back to 
Proto-Uralic (but cf. Koivulehto and Vennemann 1996). 


3. Morphology 


Most bound morphs in Uralic are suffixes; exceptions include prefixes or prefix- 
oids used in adnominal negation (Fin epd-, Est eba-, mitte-, Hng nem-) and verbal 
prefixes (or *pre-adverbs") carrying aspectual or Aktionsart functions in some lan- 
guages, most notably in Hungarian (for an overview of these elements and their de- 
batable status, see Kiefer and Honti 2003). 

Typologically, all Uralic languages, although often called 'agglutinative', dis- 
play a fair amount of stem or suffix alternations and even morpheme fusion, and 
are thus rather far away from the ideal agglutinative type (for calculated indices see 
Korhonen 1969). Famous examples of a typological shift towards the fusional type 
are Livonian, Estonian and Saami. (Claims of Estonian being a fusional language 
are exaggerated, however; cf. Grünthal 2000.) 


3.1. Inflectional morphology 


By morphological criteria, words in Uralic can be fairly neatly divided into three 
classes: nouns (inflecting for case and number), verbs (inflecting for person, tense 
and mood) and a residue class of uninflected or only partly inflected ‘particles’. An 
example of the latter are adpositions (sometimes forming series of local-case-suf- 
fixed forms for different spatial relations, as in Fin auto-n alla ‘(be) under the car’: 
auto-n alle ‘(get) under the car’: auto-n alta “from under the car’). 
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In many Uralic languages there are a few word stems which are ambiguous 
concerning the distinction between nouns and verbs, and some languages (notably 
Mordvin and, to some extent, Samoyed) allow the conjugation of predicate nouns 
(as in Nnts ríe-dm? ‘woman-1SG’ ‘I am a woman’). However, these cases are 
either too few or too specific to question the meaningfulness of the noun/verb divi- 
sion in general. (Mordvin and Samoyed, for example, allow the conjugation of 
predicate nouns as a form of subject agreement but not, for instance, deriving infi- 
nite verb forms from noun stems; cf. Turunen 2010.) 


3.1.1. The verb 


The verbs in all Uralic languages inflect for tense, mood, and person/number (1st, 
2nd and 3rd person in singular and plural, in Saami, Ob-Ugric and Samoyed lan- 
guages also dual). 


3.1.1.1. Tense and mood 


Numbers and types of tense and mood categories vary greatly across Uralic; tense 
and mood markers typically occupy one slot in the morphological structure of the 
word and go back to a Proto-Uralic ‘pool’ of verbal suffixes with diverse functions, 
but no clearly structured original system of tense and mood inflections can be 
reconstructed. 

Most Uralic languages have at least one present (non-past) and one past tense, 
but usually (with the exception of Permic) no morphologically expressed future 
tense. The Finnic languages have developed two analytic past tenses, a BE perfect 
(note that there is no HAVE verb in most of Uralic) and a pluperfect, resulting in a 
four-tense system of the ‘Germanic’ type; in Modern Hungarian, only one past 
tense based on person-marked past participles (‘the Slavic type’) is used. Parti- 
ciple-based past tenses occur elsewhere in Uralic, too. Some Samoyed languages 
know the aorist, a temporally ambivalent tense category sometimes traced back to 
Proto-Uralic (e.g. by Janhunen 1981). 


3.1.1.2. Object conjugation 


In Samoyed, Ugric and Mordvin, there are systems, of varying complexity, for 
object conjugation: the absence or presence of the (definite) object or even object 
person can be morphologically marked on the verb, as in Hng /dt-ok ‘I see [some- 
thing, I can see, I am not blind]’ - lat-om ‘I see (ity — lat-lak ‘I see you’. (For the 
morphological structure of these systems, see Trosterud 2006.) 
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3.1.1.3. Negation 


In most branches of Uralic, negation can be expressed with the help of a negative 
auxiliary (PU *e-). While the negative auxiliary carries the person (sometimes even 
tense) marking, the negated verb typically appears in an impersonal ‘connegative’ 
form, as in Fin e-n elä — Nnts ni-dm? jil’e-? (NEG-1SG live-CNG) ‘I do not live’. 

Some languages have generalised the 3rd singular form of the negative auxili- 
ary for all persons, as in Estonian or Terminal Kamass, or developed a special ne- 
gation particle to be used together with normally inflected main verbs, as is typi- 
cally the case in Ugric (cf. Honti 1997a, Miestamo 2005: 218-219). Many Uralic 
languages also know a specific negative counterpart for the ‘existential’ (e.g. Hng 
nincs ‘there is not, does not exist’ as opposed to van ‘(there) is, exists’). 

Non-finite verb forms often have negative counterparts formed with abessive 
or caritive case or derivational suffixes: 


(1) näk-y-mä-tön mies (Fin) 
see-REFL-INF-CAR man 


2) a lát-hat-atlan ember (Hng) 
DEF see-POT-CAR man 
‘the invisible man’ 


(3) hän läht-i sano-ma-tta  sana-a-kaan (Fin) 
s/he leave-3SG.PST say-INF-ABE word-PART-POL.P 
‘s/he left without saying a word’ 


3.1.1.4. Voice 


Passive voice of the SAE type (promotion of non-agents to grammatical subjects) 
is rather rare in Uralic but does appear in Samoyed, Saami and Ob-Ugric (Kulonen 
1989). Reflexive and causative formation, on the other hand, is productive in most 
Uralic languages but is regarded as part of deverbal word formation in most gram- 
mar traditions. True reflexive conjugation appears in Eastern Finnic languages. 


3.1.1.5. Non-finite forms 


The Uralic languages typically display ‘a richness of non-finites that are diachroni- 
cally intermediate between transparent case forms of action nominals and fully 
opaque infinitives or converbs' (Ylikoski 2003: 211). By morphological criteria, 
the non-finite verb forms are traditionally classified into participles and infinitives; 
the border between these classes and other deverbal (agent and action) nouns is 
often unclear. Participles are forms with a full nominal inflection, infinitives may 
inflect in a few cases, as is the case with some of the 4—5 ‘infinitives’ in traditional 
Finnish grammar. 
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Non-finites (infinitives, participles) and other deverbal nouns (e.g. agent 
nouns) may develop into tense or mood-like forms, as the so-called “modus obli- 
quus’ or quotative in Estonian or Livonian. 


(4) ta  tund-vat sind (Est) 
s/he know-QUOT you.PART 
‘(they say ~ s/he says that) s/he knows you’ (“s/he of.the.one.who.knows 
you") 


(5) ta  kitd-b ku ta | sinda tundi-ji (Liv) 
s/he say-3SG that s/he you.PART know-AGT 
“s/he says that s/he knows you’ (“ ... that s/he you the.knower") 


3.1.2. The noun 


In Uralic, there are typically no sharp morphological borders between the sub- 
classes of the noun category: adjectives can be converted to nouns and nouns, 
if semantically and pragmatically appropriate, can sometimes take comparative or 
superlative endings, as in Fin syksy-mmä-llä (‘autumn-COMPR-ADE’) ‘later in 
the autumn’. In principle, all nouns inflect for case and number, and in most of 
Uralic suffixal marking of possessor person (possessive suffixes) is possible as 
well. There are no grammatical genders. 


3.1.2.1. Case 


Most Uralic languages are characterized by large case systems (Finnish has, by most 
accounts, 15 cases, Estonian has 14, Komi has 16, and the number of cases in Hun- 
garian in different grammars ranges between 16 and 28). However, for Proto-Uralic 
only 6 cases can be reconstructed, and in some Khanty varieties there are only three 
productive cases. The number of cases is sometimes difficult to determine, as the 
border between ‘adverbial’ or local cases and derivational suffixes forming adverbs 
is not always clear (see, for example, the discussion in Kiefer 1987 for Hungarian). 
The case endings may consist of agglutinated postpositions (sometimes transpar- 
ently going back to relational nouns, as the bV element in the Hungarian ‘internal’ 
local case suffixes, etymologically related to bel ‘intestine’) or reanalysed suffixal 
elements. (For case systems in Uralic, see especially Korhonen 1992.) 
Traditionally, the case systems in Uralic are divided into grammatical (un- 
marked nominative, genitive, possibly accusative, in Finnic also partitive) and 
local or adverbial cases. The latter group can be divided into (1) tripartite series for 
spatial relations in terms of ‘into/onto’, *in/on' and ‘from’ and (ii) further spatial 
and other adverbial functions such as prolative (‘through, by way of X’), causal/ 
final (‘for the sake of X, in order to get X"), comitative/instrumental (‘with X"), 
essive/translative (‘as X’, ‘becoming X’) or terminative (‘until X, up to X"). There 
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may be more than one tripartite series of type (1): Finnic has two series of local 
cases (‘internal’ and ‘external’), Hungarian has three (or dialectally even four). 

A characteristic innovation in Finnic is the development of the historical 
*from' case (separative in *-/A) into the partitive case. The function of the partitive 
is, generally speaking, to express low transitivity (partial object, imperfective as- 
pect, unfinished or negated action), unboundedness or indefiniteness. 


3.1.2.2. Number 


In most of Uralic, there are two number categories, singular (unmarked) and plural. 
An absolute dual (that is, not connected with person marking) is only preserved in 
Ob-Ugric and Samoyed, while Saami languages only know a relative dual (com- 
bined with the person category, i.e. in person and possessive marking). 

According to most mainstream reconstructions, Proto-Uralic had two plural 
suffixes, *t for absolute plural and *j (*i) for non-nominative plural forms. In the 
languages which preserve (traces of) these suffixes, such as Finnic, morphological 
and morphophonological complexities in the plural paradigms may reflect the fact 
that the plural paradigms were originally deficient, perhaps consisting merely of an 
‘absolute plural’ and a ‘relative plural’ (casus componens) form. 

In the central branches of the language family, the system of plural marking has 
been restructured: the same plural markers, either the old *r (as in Mordvin or 
Ob-Ugric) or innovative suffixes (as in Hungarian, Mari and Permic) appear across 
the whole paradigm. The new plural markers obviously go back to (collective or 
local) derivational suffixes or even lexical morphs: lexical etymologies have been 
suggested for Komi -jas ~ Udmurt -(j)os, Mari -wlak or -Samac. In addition to this 
‘absolute’ plural marking, there is often also a relative plural (or dual) marking 
connected with personal or possessive suffixes (i.e. marking of plural possession). 
(A detailed survey on number in Uralic is presented by Honti 1997b.) 


3.1.2.3. Possessive suffixes 


Most Uralic languages can use possessive suffixes to mark the possessor, in 
attributive possession (Hng autó-m, Fin auto-ni ‘my car’) and in some languages 
also predicative possession (Hng van autó-m ‘I have a car’, lit. ‘there.is my.car’). 
In languages with two different sets of personal suffixes for subject and object con- 
jugation, the possessive suffixes typically coincide with or resemble the personal 
suffixes for the object conjugation (cf. Hng autó-m ‘my car’ ~ lát-o-m ‘I see (it)’). 

In Finnic, the system of possessive suffixes seems to be decaying: there are no 
separate suffix series for plural possession as in many other branches of Uralic, and 
in Southern Finnic languages (such as Estonian and Livonian) in particular, there 
are only historical relicts of possessive suffixes. In most other Uralic languages, 
there are complete possessive paradigms with plural possessor and possession 
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marking, sometimes involving fusion of possessive and case suffixes, as in Komi 
gort-as ‘in(to) his/her house’ (cf. gort-yn ‘in the house’, gort-ys ‘his/her house’). 

Possessive suffixes are often used for the marking of agent person with nomi- 
nalizations and infinites, as in Fin tulle-ssa-ni ~ Hng jött-ö-m-ben ‘as Icome/came' 
(‘in my coming’, note the different order of possessive and inessive case suffix). 


3.1.2.4. Definiteness 


In Permic, possessive suffixes can also act as markers of definiteness (Komi vör-as 
‘in(to) the forest’, lit. ‘in(to) his/her forest’). In Mordvin, definiteness markers 
have developed from suffixed demonstrative pronouns (kudo ‘house’ — kudo-s ‘the 
house’). 


3.1.2.5. Comparison 


Although Finnic-Saamic and Hungarian share a comparative suffix (*mpV > Hng 
-bb), this is generally considered a convergent development rather than ancient 
heritage, as other Uralic languages employ comparative markers of different ori- 
gins (e.g. Komi -dZyk) or analytic/periphrastic comparatives. 


3.2. Derivational morphology 


Uralic languages typically have rich systems of morphological word-formation, 
which is almost exclusively suffixal (or fusional). Unless conversion within the 
noun category (e.g. adjectives to nouns) is considered, conversion or zero deri- 
vation is fairly rare and may represent a very recent development, as in Estonian, 
where the coincidence of the i suffix for denominal verbs and the i stem vowel typi- 
cally added to recent loanwords has resulted in what can synchronically be inter- 
preted as conversion: cf. arst(i-) ‘physician’ [« Grm Arzt] with arsti- ‘to heal’. 

The derivational suffixes are usually divided according to part-of-speech cat- 
egories into noun, verb and adverb suffixes as well as into denominal and deverbal 
ones. Typically, however, these boundaries are far from clearcut, and the same or 
similar suffixes often appear in more than one of these subclasses (for instance, the 
same verb-forming suffix can be attached to both noun and verb stems). 


4. Syntax 


Compared with the traditional core areas of comparative Uralic studies — historical 
phonology, morphology and lexicology - research into comparative Uralic syntax 
is still a relatively weakly cultivated area. This is partly due to great differences be- 
tween research traditions in terminologies; accordingly, there is very little research 
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available on minor Uralic languages that would enable cross-language studies on 
Uralic syntax. 

The question of contact-induced areal features in the syntax of different Uralic 
subgroups often emerges in literature. For instance, it seems obvious that while 
there are signs of ‘Europeanization’ and the impact of written language in the syn- 
tax of the westernmost and *major' Uralic languages (Hungarian, Finnish, Esto- 
nian), the syntax of the Uralic languages of Central Russia and the Volga region is 
characterized by strong Turkic influences. Beyond obvious statements of this kind, 
however, very few solid and fine-grained studies have been undertaken. 


4.1. Word order and sentence types 


The word order of neutral affirmative sentences in most Uralic languages is usually 
described as SOV/SXV, with the notable exception of SVO/SVX in Finnic-Saami 
(and, possibly, Komi). In fact, the word order in Uralic is largely determined by in- 
formation structure (topic/focus, ‘discourse configurationality’), although there 
are considerable differences between individual languages — for instance, in Esto- 
nian the word order, with its ‘Germanic’ V2 tendency, seems to be a little more 
fixed than in Finnish. (For word order in the Uralic languages of Europe, see es- 
pecially Vilkuna 1998.) 

A further challenge to definitions based on the order of S, V and O is posed by 
the fact that sentence types without a canonical subject NP are fairly usual in many 
Uralic languages. Together with the emergence of the partitive case for ‘partial’ 
(irresultative, imperfective aspect, unbounded, negated) objects in Finnic, a type of 
existential sentences (cf. example (8) below) has developed which challenges the 
notions of syntactic subject and object. Meteorological verbs usually do not need a 
dummy subject (Fin Sataa. '[It] is raining.’), and modal constructions with the 
agent or experiencer in a non-nominative case or a non-subject role are usual. 


(6) Men-ne-m kell. (Hng) 
go-INF-PXISG must 
‘I must go.’ 

(7) Minu-n pitä-ä men-nd. (Fin) 
I-GEN keep-3SG go-INF 
‘I must go.’ 


Questions are formed with question words (interrogative pronouns/adverbs), 
which typically appear in sentence-initial position (as in Finnic) or possibly also in 
other positions (as in the pre-verbal focus position in Hungarian), polarity ques- 
tions with intonation or with question particles (as typically in Finnic; Finnish uses 
a question clitic -ko/kó attached to the focalized word or constituent at the begin- 
ning of the sentence, Estonian and Livonian sentence-initial particles). 
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4.2. Alignment 


Most Uralic languages can be classified as belonging to the nominative/accusative 
alignment type, despite some morphosyntactic 'grey zones'. However, all Uralic 
languages do not have a completely productive accusative case which would be 
obligatorily used for all objects; in Finnic, the situation is particularly complicated 
due to the emergence of the partitive case (for 'irresultative' or quantitatively 
indefinite objects) and a phonologically conditioned coincidence of the original 
accusative (in *-m > -n) with the genitive (in -n). Unmarked (nominative) objects 
appear in many Uralic languages, and this possibility of using nominative case for 
(inanimate or indefinite) objects is often considered an ancient feature. Second, the 
Finnic languages know sentence types with object-like case-marking for argu- 
ments which correspond to SAE syntactic subjects, as in the following examples: 
in existential sentences (8) or experiencer-causative sentences (10), compare with 
transitive sentences (9) and (11). 


(8) Piha-lla | juokse-e poik-i-a. (Fin) 
yard-ADE run-3SG boy-PL-PART 
“There are boys running on the yard.’ 


(9) Liisa inhoa-a poik-i-a. (Fin) 
L. despise-3SG boy-PL-PART 
‘Lisa hates boys.’ 


(10) Poik-i-a pelotta-a. (Fin) 
boy-PL-PART frighten-38G 
“The boys are afraid.’ 


(11) Kummitus pelotta-a poik-i-a. (Fin) 
ghost frighten-3SG boy-PL-PART 
"The ghost frightens the boys./The boys are afraid of the ghost.’ 


However, as argued in more detail by Havas (2006), these ergative-like construc- 
tions (appearing alongside ‘normal’ nominative/accusative alignment) do not suf- 
fice to mark the whole language as an ergative one, and genuine ergativity in Uralic 
can only be found in Eastern Khanty, as in example (12), with the transitive verb in 
an object-conjugation form (Havas 2006: 107): 


(12) dpa-m-na tu tdyi  on(t)altd (Khanty E) 
father-PXSGI-LOC that place know.PRF.3SG.SGOBJ 
*my father knew that place’. 
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4.3. The noun phrase 


Modifiers (adjectives, demonstratives, possessors) typically appear before their 
heads. Usually, Uralic adpositions are postpositions (often diachronically going 
back to a possessed relational noun, which means that the head of the postposition 
typically appears in the genitive case in languages which have preserved the old in- 
herited genitive); however, in Finnic and Saami there is also a small group of prep- 
ositions. 

Only Hungarian has developed indubitable articles (indefinite egy from egy ‘1’, 
definite a(z) from the demonstrative pronoun az); besides, Mordvin has suffixal 
definiteness marking (suffixes historically deriving from agglutinated demon- 
strative pronouns) which is traditionally called *the definite declension". Whether 
the demonstrative pronoun determiner in colloquial varieties of Finnish and Esto- 
nian (Fin se, Est. see) can already be called ‘article’ has recently been subject to 
some discussion. 

In Finnic, adjective and pronominal modifiers usually show a complete agree- 
ment in case and number, as in (13a); in other branches of Uralic agreement is oc- 
casional or restricted to pronominal modifiers, as in (13b). 


(13a) nen-de-s suur-te-s maja-de-s (Est) 
DEM.PL-PL-INE big-PL-INE house-PL-INE 
‘in these big houses’ 


(13b) eze-k-ben a nagy háza-k-ban (Hng) 
DEM-PL-INE DEF big  house-PL-INE 
‘in these big houses’ 


Pronominal subject constituents may be omitted (PRO-DROP), unless empha- 
sized, in most Uralic languages. In many Finnic varieties, however, 3SG subject 
pronouns cannot be dropped, as the zero subject does not carry an anaphoric refer- 
ence but refers to a generic subject (‘zero person’). In languages with an object 
conjugation, object pronoun dropping is also possible. 

With numerals and many quantifiers, the head word typically stands in the sin- 
gular; the quantifier often behaves syntactically like an adjective modifier, but in 
Finnic it may also be the syntactic head of the NP (Finnish kolme auto-a ‘three 
cars’ with auto in the partitive singular, literally ‘[a set of] three of/from [the 
species] car"). 
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4.4. Predicate nominals 


In most branches of Uralic, predicate nouns and adjectives do not require the use of 
a copula, except as carrier of person and TAM marking. Consider (14a-c): 


(14a) Peter tanar. (Hng) 
P. teacher 
‘Peter is a teacher.’ 


(14b) Peter tanár ^ vol-t. (Hng) 
P. teacher be-PST.3SG.SUBJ 
‘Peter was a teacher.’ 


(14c) Tanár — vagyo-k. (Hng) 
teacher be-1SG.SUBJ 
‘I am a teacher.’ 


In Mordvin and partly in Samoyed, predicate nouns (even inflected ones) or even 
nominal verb forms can take verbal person or TAM marking (examples from 
Bartens 1999: 169-170): 


(15) tarka-s bolota-l' (Mrd Erzya) 
place-DEF swamp-PST.3SG 
‘the place was a swamp’ 


(16) kuckordo-ms-at (Mrd Erzya) 
kick-INF-2SG 
“you deserve to be kicked’ (‘you are the one to be kicked’) 


On the other hand, Finnic and most Saami varieties regularly use the copula with 
all nominal predicates. 

The case marking of predicate nominals in Finnic is particularly complex. 
In addition to the characteristically Circum-Baltic (cf. Stassen 2001) ‘double 
encoding’ (either nominative or, for expressing a temporary or accidental function, 
quality or occupation, the essive or translative case), there is — particularly in 
Finnish — the option of the partitive case for quantitatively unbounded predicate 
nominals. 

The ‘dynamic predicates’ (which denote becoming something or changing into 
something) show a variety of encoding strategies, employing case marking and/or 
dynamic “copula” verbs. 


(17) Son universit ete-nt’ prado-ma-do 
s/he | university-GEN.DEF finish-NMLZ-ABL 
mejl'e karma-s tonavtica-ks. (Mrd Erzya) 
after begin-PST.3SG teacher-TRANSL 


* After graduating from university, s/he became a teacher.’ 
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(18a) Yliopisto-sta valmistu-ttu-a-an 
university-ELA  graduate-PTC-PART-PX3 
hän ryhty-i opettaja-ksi. (Fin) 
s/he begin-PST.3SG teacher-TRANSL 
"After graduating from university, s/he became a teacher.’ 


(18b) Yliopisto-sta valmistu-ttu-a-an 
university-ELA  graduate-PTC-PART-PX3 
hüne-stü — tul-i opettaja. (Fin) 
s/he-ELA come-PST.3SG teacher 
"After graduating from university, s/he became a teacher.’ 


(19) Miután elvégez-te az X egyeteme-t, 
after finish-PST.3SG.OBJ DEF university-ACC 
tanár le-tt. (Hng) 


teacher become-PST.3SG.SUBJ 
‘After graduating from university, s/he became a teacher.’ 


4.5. Possessive constructions 


For adnominal possession, Uralic languages use one or more of the following three 
strategies: (1) case-marking of possessor, (ii) head-marking with possessive suffix, 
or, marginally, (iii) simple juxtaposition. 

The genitive case in *-n probably goes back to Proto-Uralic and is widely used 
for marking the possessor (in a wide sense), for instance, in Finnic-Saami and 
Mordvin (example (20) from Estonian). Ugric and Permic have lost the original 
genitive and use simple head-marking with possessive suffix or a combination of 
possessive suffix and a dative or secondary 'genitive' case (of local origin). 


(20) Peetri auto (Est) 
Peter. GEN car 
*Peter's car’ 


(21a) Péter autó-ja (Hng) 
Peter car-PX3SG 
*Peter's car’ 


(21b) Péter-nek az autó-ja (Hng) 
Peter-GEN DEF car-PX3SG 
*Peter's car’ 


(22) Petyr-lón | masina-ys (Komi) 
Peter-GEN car-PX3SG 
*Peter's car' 
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Simple juxtaposition of possessor and possession may also be marginally possible, 
at least in those languages which have lost the old genitive case. However, in cases 
of non-canonical or non-referential possession (as in Komi vóv jur ‘a horse['s] 
head’), the boundary between juxtaposition and incorporating composition may 
be difficult to draw (cf. Komi lym mort ‘snow man’ with Est lume-memm, lit. 
*snow.GEN-old.woman'). 

For predicative possession, most Uralic languages use possessive suffixes 
(usually combined with existential or BE verbs) and/or local cases (a construction 
similar to the existential construction, cf. example (8) above) or the genitive 
(in Ugric and Permic, with a dative or habitive background). HAVE verbs are un- 
known or very marginal in most of Uralic except Ob-Ugric. 


(23) Péter-nek van autó-ja (Hng) 
P.-GEN  thereis car-PX3SG 
‘Peter has a car.’ 


(24) Peetri-l on auto. (Est) 
P.-ADE be.3SG car 
‘Peter has a car.’ 


(25) Van auto-m. (Hng) 
thereis car-PX1SG 
‘I have a car’ 


4.6. The complex clause 


According to most mainstream reconstructions, Proto-Uralic knew no conjunctions 
(and probably no relative pronouns either). Most conjunctions in modern Uralic 
languages are either transparently derived from pronominal stems (e.g. Finnish 
ku(i)n ‘as, when’ or the Hungarian complementizer hogy from the pronoun stem 
*ku-) or borrowed from neighbouring languages (for instance, Finnic ja ‘and’ from 
Germanic, Livonian bet ‘but’ from Latvian, Russian jesli ~ koli ‘if’ in many Uralic 
languages of Russia). 

Corresponding to subordinate clauses, there were left-branching construc- 
tions involving verbal nominals (with complements), for instance, participles 
(‘I saw a by-the-river fishing man’ instead of ‘I saw a man who was fishing by the 
river’), infinitives (“I him-to-go-fishing thought’ for ‘I thought he would go fish- 
ing’) or action nominals (‘his-by-the-river fish-catching surprised me’ for ‘I was 
surprised (to hear) that he caught fish by the river’), with possessive suffixes for 
the marking of subject person. These construction types are still present in all 
branches of Uralic (in the Finnish grammar tradition, they are called ‘sentence 
equivalents’), although there are also subordinate clauses. In many Uralic lan- 
guages of Russia, borrowed Russian conjunctions (such as ¢ctoby ‘in order to, so 
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that’ or jesli ‘if’) have contributed to a radical restructuring of the syntax of com- 
plex clauses. 


See also the following chapters in this volume: 1 by Holvoet, 3 by Askedal, 5 by 
Hansen, 7 by Menz, 15 by van der Auwera, 17 by Wälchli, 19 by Ostman, 20 by 
Stern, 22 by Hill, 23 by Breu, 37 by Mailhammer, 40 by Fellerer and 42 by Stern. 


Acknowledgments. My thanks to Beáta Wagner-Nagy, Lotti Viola and Nikolay 
Kuznetsov for discussing and checking the Samoyedic, Hungarian and Komi 
examples, and to Bernhard Wälchli for some very constructive remarks and sug- 
gestions. 


Bibliography 


Bartens, Raija 
1999 Mordvalaiskielten rakenne ja kehitys. [Structure and development of the 
Mordvin languages.] Helsinki: Suomalais-Ugrilainen Seura. 
Estill, Dennis 
2004 Diachronic Change in Erzya Word Stress. Helsinki: Suomalais-Ugrilainen 


Seura. 
Grünthal, Riho 
2000 Typological characteristics of the Finnic languages: a reappraisal. In: Johanna 


Laakso (ed.), Facing Finnic: Some Challenges to Historical and Contact Lin- 
guistics, 31—63. Helsinki: Helsingin yliopisto, suomalais-ugrilainen laitos. 
Havas, Ferenc 
2006 Die Ergativität und die uralischen Sprachen. Finnisch-Ugrische Forschungen 


59: 81-131. 
Helimski, Eugen 
1995 Proto-Uralic gradation: continuation and traces. In: Heikki Leskinen et al. 


(eds.), Congressus Octavus Internationalis Fenno-Ugristarum. Pars I, 17—51. 
Helsinki: Suomalais-ugrilainen seura. 
Honti, László 
1997a Die Negation im Uralischen I-II. Linguistica Uralica 1997: 81-96 and 
161-176. 
Honti, László 
1997b | Numerusprobleme. (Ein Erkundungszug durch den Dschungel der uralischen 
Numeri.) Finnisch-Ugrische Forschungen 54: 1-126. 
Janhunen, Juha 
1981 On the structure of Proto-Uralic. Finnisch-Ugrische Forschungen 44: 23-42. 
Janhunen, Juha 
2001 On the paradigms of Uralic comparative studies. Finnisch-Ugrische Forschun- 
gen 56: 29-41. 
Kiefer, Ferenc 
1987 The cases of Hungarian nouns. Acta Linguistica Hungarica 37: 93-101. 


The Uralic languages 197 


Kiefer, Ferenc and László Honti 
2003 Verbal ‘prefixation’ in Uralic languages. Acta Linguistica Hungarica 50: 


137-153. 
Koivulehto, Jorma and Theo Vennemann 
1996 Der finnische Stufenwechsel und das Vernersche Gesetz. Beitrüge zur Ge- 


schichte der deutschen Sprache und Literatur 118: 163—182. 
Korhonen, Mikko 
1969 Die Entwicklung der morphologischen Methode im Lappischen. Finnisch- 
Ugrische Forschungen 37: 203—362. [Reprinted as "Die Entwicklung der mor- 
phologischen Technik im Lappischen' in: Tapani Salminen (ed.), 1996, Typo- 
logical and Historical Studies in Language by Mikko Korhonen. Helsinki: 
Suomalais-Ugrilainen Seura.] 
Korhonen, Mikko 
1992 Remarks on the structure and history of the Uralic case system. Journal de la 
Société Finno-Ougrienne 82: 163-180. [Reprinted in: Tapani Salminen (ed.), 
1996, Typological and Historical Studies in Language by Mikko Korhonen. 
Helsinki: Suomalais-Ugrilainen Seura.] 
Kulonen, Ulla-Maija 
1989 The Passive in Ob-Ugrian. Helsinki: Suomalais-Ugrilainen Seura. 
Miestamo, Matti 
2005 Standard Negation: The Negation of Declarative Verbal Main Clauses in a Ty- 
pological Perspective. Berlin/ New York: Mouton de Gruyter. 
Sammallahti, Pekka 
1988 Historical phonology of the Uralic languages with special reference to 
Samoyed, Ugric, and Permic. In: Denis Sinor (ed.), The Uralic Languages, 
478—554. Leiden/New York: Brill. 
Stassen, Leon 
2001 Nonverbal predication in the Circum-Baltic languages. In: Osten Dahl and 
Maria Koptjevskaja-Tamm (eds.), The Circum-Baltic Languages. Typology 
and Contact. 2: Grammar and Typology, 569—590. Amsterdam/Philadelphia: 
Benjamins. 
Tapani Salminen (ed.) 
1996 Typological and Historical Studies in Language by Mikko Korhonen. Helsinki: 
Suomalais-Ugrilainen Seura. 
Trosterud, Trond 
2006 Homonymy in the Uralic Two-Argument Agreement Paradigms. Helsinki: Suo- 
malais-Ugrilainen Seura. 
Turunen, Rigina 
2010 Nonverbal predication in Erzya: Studies on morphosyntactic variation and 
part of speech distinctions. (PhD thesis, University of Helsinki. http://urn.fi/ 
URN:ISBN:978-952-10-6277-3) 
Vilkuna, Maria 
1998 Word order in European Uralic. In: Anna Siewierska (ed.), Constituent Order 
in the Languages of Europe, 173—234. Berlin: Walter de Gruyter. 
Ylikoski, Jussi 
2003 Defining non-finites: action nominals, converbs and infinitives. SKY Journal of 
Linguistics 16: 185—237. (http://cc.oulu.fi/~jylikosk/defining.pdf, 25 Nov. 2008) 


9 Albanian 


Dalina Kallulli 


1. Introduction 

2. Phonetics and phonology 
3. Morphology 

3.1. The verb 

3.2. The noun 

3.3. Adjectives 

4. Syntax 


1. Introduction 


A separate branch of the Indo-European language family, the Albanian language 
is spoken by nearly 6 million people, primarily in Albania and Kosovo but also 
in other areas of the Balkans in which there is an Albanian population. It is 
also spoken in isolated locations of Southern Italy. There are two major dialectal 
varieties, Gheg (spoken in the north) and Tosk (spoken in the south), which differ 
from each other both with regard to phonological and morpho-syntactic proper- 
ties. Standard Albanian is based on Tosk. Two features distinguishing these two 
varieties from each other are rhotacism (Gheg n for Tosk r) and the existence 
of a true infinitive in Gheg (comparable in syntactic function and distribution 
to the infinitive in Germanic and Romance), which is different from the one in 
Tosk not only in form but also in function. Both of these properties set Gheg 
apart from the Balkan Sprachbund, despite the fact that most studies recognise 
Albanian as a central member of it (Tomié 2004, 2006; see also Tomié in this 
volume). The first written documents stem from 1462 (for Gheg) and 1500 
(for Tosk). The oldest known printed book in Albanian, Meshari (the so-called 
"The Missal’) by Gjon Buzuku, dates back to 1555. The modern Albanian alpha- 
bet is based on the Latin alphabet, with the addition of the letters &, c, and nine 
digraphs. 


2. Phonetics and phonology 


Albanian has an inventory of 36 distinctive sound units (i.e., phonemes), 7 vowels 
and 29 consonants. These are (grapheme first, phonetic symbol second): a [a], 5 
[b], c [ts]. c [tf], d [d], dh [ð], e [e], ë [9], f [f£]. g [g], gj L3, ^ [b], i [i], j Li), k [k], 7 
[1], 1 [£], m [m], n [n], nj [n]. o [o]. p [p], q [c], r [e], rr [r], s [s], sh Uf], # [t], th [0], 
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u [u], v [v], x [dz], xh [d3], y [y]. z [z], zh [3]. (For a detailed classification of pho- 
nemes, see Newmark et al. 1982.) 

In addition to the simple vowels, Albanian has many words with vowel 
clusters. Of these, the clusters ie, ua, ye and ue are diphthongs. 

One peculiarity is the mid-central vowel ë, which is normally dropped at the 
end of the word, but can occur word-internally in stressed syllables. The pronun- 
ciation of stressed ë often reflects the regional origin of the speaker. It ranges from 
a nasalized, rounded, mid-back vowel (like in the French word bon) in Gheg- 
speaking regions, to an oral unrounded, low front vowel (like in the English word 
pat) in some southeast Tosk-speaking regions. As noted in Newmark et al. (1982), 
in Standard Albanian it is an oral unrounded (or very slightly rounded) mid-central 
vowel. Another peculiarity is the phonemic distinction between r [r] (voiced apical 
flap liquid) and rr [r] (voiced apical trill liquid), as in ruaj ‘I guard’ versus rruaj 
‘I shave’. 

In Albanian, uninflected word units (e.g. nominative indefinite nouns) have 
final stress if the final syllable is closed or ends in a non-reduced non-mid vowel, 
while stress falls on the penultimate syllable if the final syllable ends in a mid- 
vowel (Bevington 1974). However, there are exceptions like bukur (‘beautifully’), 
where the stress is on the first syllable. 


3. Morphology 


In terms of morphological type, Albanian is an analytical language (Demiraj 
1986). Adverbs, prepositions, conjunctions and interjections are non-inflecting. 
Nouns, pronouns, articles, adjectives and verbs are inflecting. There is also rich 
derivational morphology. 


3.1. The verb 


Albanian has developed an analytical verbal structure instead of the earlier syn- 
thetic system inherited from Proto-Indo-European. The finite verb forms are 
marked for voice, mood, tense, person and number. With the exception of the 
participle, the non-finite forms are marked only for voice. Thus voice is a major 
category of the Albanian verb system. Specifically, similar to Latin and Greek, AI- 
banian has two distinct conjugational paradigms, ACTIVE and NON-ACTIVE, corre- 
sponding roughly to the distinction unergative/transitive versus unaccusative, 
where the latter category includes passive, reflexive and deponent (Kallulli 1999, 
2006, 2007, 2009). In other words, while the active form of a verb such as thaj 
‘I dry’ means ‘I dry stuff’, its non-active form may have several meanings, includ- 
ing the following: ‘I dry (myself)’, ‘I am being dried (by someone else)’, ‘I am 
dryable’/‘I am able to dry’, ‘I dry (e.g. in the sun)’. (The citation form of the Al- 
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banian verb is its first person, singular number, active voice, indicative mood, 
(simple) present tense form.) For the purpose of illustration, the active and non-ac- 
tive paradigms for the verb thaj ‘I dry’ in present tense indicative mood are given 
in Table 1. (The segment -h- in the non-active paradigm of thaj is epenthetic and is 
inserted in order to prevent hiatus. It does not occur with verbs whose stem ends in 
a consonant, such as hap ‘I open’.) 


Table 1: The voice paradigms for the verb thaj ‘I dry’ in present tense indicative mood 


Active Non-active 
Ist Sg tha-j tha-h-em 
2nd Sg tha-n tha-h-esh 
3rd Sg tha-n tha-h-et 
Ist Pl tha-jmé tha-h-emi 
2nd Pl tha-ni tha-h-eni 
3rd Pl tha-jné tha-h-en 


The Albanian non-active paradigm is built by employing three distinct linguistic 
means with a well-defined distribution: (1) auxiliary selection (namely the Alban- 
ian counterpart of the verb ‘to be’) if the clause contains perfect tense; (ii) an 
inflectional affix if the clause contains a simple tense form other than the aorist, but 
not admirative, optative or imperative mood; and (iii) the reflexive clitic u (derived 
from the Indo-European sue, Demiraj 1986) in all other contexts. 

A crucial characteristic of the Albanian non-active paradigm is that it does not 
accept noun phrases that bear accusative case (Kallulli 1999). In other words, non- 
active verbs cannot combine with accusative objects, unlike for instance deponents 
in Latin or Greek. 

Many descriptive grammars of Albanian describe a few dozen conjugational 
classes within the active voice. This classification is based entirely on the morpho- 
phonemic properties of verbs and includes many subclasses in addition to excep- 
tions (cf. Newmark et al. 1982, Buchholz and Fiedler 1987). 

Albanian has a complex system of tenses and moods. There are five moods, 
namely indicative, subjunctive, imperative, optative and admirative.! Grammatical 
tense distinctions are made in the indicative, subjunctive, admirative and optative 
moods. Abstracting away from the future tenses as these are built on the basis of 
subjunctive forms, the indicative mood has three simple tenses (namely simple 
present, imperfect and aorist) and three complex or periphrastic tenses (namely 
present perfect, past perfect and pluperfect). Tenses also carry aspectual as well as 
modal meaning. For instance, the difference between the simple past tenses imper- 
fect and aorist is aspectual; the former is aspectually imperfective, whereas the 
latter is perfective (see Kallulli 1995). The particle po combined with simple pres- 
ent tense signals progressive aspect (e.g. po punoj ‘I am working’), but progressive 
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aspect can also be expressed covertly by the simple present and imperfect tenses, 
as well as periphrastically. The present perfect, past perfect and pluperfect tenses 
are formed by combining the inflected forms of the auxiliary verbs kam ‘I have’ 
(for the active conjugation) and jam ‘I am’ (for the non-active conjugation) either 
in the present or in the past tense (both imperfect and aorist) respectively, with 
the participle form of the main verb. The future tenses in Albanian are formed by 
combining the particle do, which historically speaking is the 3rd person singular 
indicative form of the verb dua ‘I want’, with the (relevant) tenses of the subjunc- 
tive.? There are two 'simple' future tenses (future and past future) and two peri- 
phrastic ones (future perfect and past future perfect). 

The subjunctive mood is characterized by the presence of the particle të, which 
precedes the verb stem, but may be split from it by clitics and the negation particle. 
The verb is also marked by special inflectional morphology. This however is the case 
only for the present tense, since in all other tenses the inflectional morphology is the 
same as for the indicative. There are four tenses of the subjunctive: two simple ones 
(present and imperfect) and two periphrastic ones (present perfect and past perfect). 

The admirative mood in Albanian is used to express a (speaker's) sense of sur- 
prise, doubt or dissent concerning an event that has taken place in the past or is tak- 
ing place at the moment of speaking. The admirative mood has the following tenses: 
simple present, imperfect, present perfect, past perfect, and future. The simple pres- 
ent tense of the admirative is a synthetic form created by compounding in inverted 
order the two formative elements of the present perfect tense. The admirative can 
occur in embedded clauses and is compatible with a factive proposition. Thus, the 
Albanian admirative is different from the evidential mood in some languages. 

The optative in Albanian is restricted to main clauses and expresses the modal- 
ity of desirability in the form of a wish or a curse. It has two tenses, simple present 
and present perfect. 

The imperative mood has forms only for the second person singular and plural. 
For the active conjugation, for all verbs the imperative plural has the same form as 
the second person plural of the present indicative. These two homonymous forms 
are distinguished from one another by the contexts in which they appear. Only 
a small number of verbs have the same form in the second person singular of the 
present indicative. 

The participle is the non-finite form of the verb in Albanian (that is, the verb 
form that does not reflect the person and number of its subject). As already men- 
tioned, one type of constructions using the participial form are the periphrastic 
(or compound) tenses. The second type includes the participle itself and those 
non-finite constructions using it as their base: these are the privatives (e.g. pa tharé 
‘without drying’), gerundives (e.g. duke tharé ‘drying’), infinitives (e.g. pér té 
tharé ‘to dry’), and absolutives (e.g. me té tharé *by drying, having dried’). 

As mentioned earlier, all non-finite forms except the participle have non-active 
forms as well. These are formed by the clitic u appearing immediately before the 
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verbal stem: pa u tharé ‘without drying oneself’, duke u tharé ‘drying’, pér t'u 
tharé ‘to dry oneself’, me t'u tharé ‘having dried oneself’. 


3.2. The noun 


The citation form of nouns is normally the nominative, singular indefinite form. As 
in other Balkan languages, the definite article in Albanian is a suffix on the noun. It 
is a so-called “portmanteau” morpheme; that is, the definite article simultaneously 
marks definiteness, case, gender and number. (The indefinite article, on the other 
hand, is prenominal, proclitic and non-declinable has the same form as the cardinal 
numeral ‘one’.) 

There are two main declensions, feminine and masculine, but there are also 
some nouns that appear in the neuter gender. Hence three genders can be distin- 
guished. The neuter gender is in fact productive in forming deverbal nouns (from 
participles). Neuter nouns never designate animate beings. Even though gender is 
not fully predictable from phonetic form, as a rule of thumb, most nouns whose ci- 
tation form ends in a consonant are masculine and most nouns whose citation form 
ends in a vowel are feminine (cf. Newmark et al. 1982). However, there are many 
exceptions, even to the exceptions. The definite article for the masculine nomi- 
native case singular declension has the form -u when the noun stem ends in k/g and 
-i in all other cases (e.g. shok-u ‘the friend’ vs. mal-i ‘the mountain’). The corre- 
sponding feminine gender has the form -a (e.g. vajz-a ‘the girl’). The majority of 
nouns in Albanian have the same gender for singular and plural. However, in cer- 
tain types of plurals a shift of gender from masculine to feminine takes place, as 
evidenced by the agreement marker on the adjective. 

Traditional grammars of Albanian distinguish five morphological cases, 
namely nominative, accusative, genitive, dative, and ablative, with the latter three 
being mostly syncretic — the equivalent of a genitive is formed by using the pre- 
posed articles i/e/té/sé with the dative form. There is, however, a special ablative 
form in the plural. In addition, the vocative case is still existent, although it has a 
very restricted use. It is signalled by the special pre-nominal (interjection) marker 
o (e.g. o cun ‘hey/you boy’), which is sometimes ‘doubled’ by an enclitic post- 
nominal segment (e.g. o cuno or o cuni). 


3.3. Adjectives 


Adjectives, including ordinal numerals, agree with the head noun in gender, 
number and case. So do other noun modifiers such as possessive noun phrases. Al- 
banian adjectives fall into two classes: those with and those without the so-called 
“adjectival article”. This is a preposed agreement marker (e.g. i/e miré ‘good’ vs. 
besnik ‘loyal’) which also accompanies genitives (e.g. libri i Anës ‘Anna’s book’, 
fletorja e Anës * Anna's exercise book"). Notably, the differences in the form of the 
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adjectival article do not just depend on the definiteness, gender, case and number of 
the noun that the adjective modifies, but also on immediate precedence (see New- 
mark et al. 1982). 


4. Syntax 


Albanian is a so-called "free word-order" language. That is, the order of major syn- 
tactic constituents such as subject, verb and object is relatively free both in matrix 
and subordinate clauses. In spite of this, SVO is traditionally recognised as the 
basic, or “unmarked”, word order in a declarative transitive sentence, both in ma- 
trix and subordinate contexts. Albanian is also a so-called “null-subject” language. 
There is free inversion in that, when overt, subjects may appear either pre-verbally 
or post-verbally, both in active and non-active constructions. In double-object con- 
structions either the indirect (dative) object precedes the direct object, or vice 
versa. Both orders are equally unmarked. Similarly, adverbials may occupy differ- 
ent positions in the clause. 

Albanian is a wh-fronting language, though there is also the option of wh-words 
remaining in situ. Albanian is not a multiple wh-fronting language though. Wh-el- 
ements are case-marked and partially also animacy-marked. Because of case-mark- 
ing, so-called superiority effects (e.g. Mary asked who read what vs. *Mary asked 
what who read) do not obtain. One peculiarity of wh-movement in Albanian is the 
possibility of wh-phrases to land in an ‘intermediate’ position. For instance, in (1c), 
the wh-phrase has ‘moved’ from the object position of the embedded verb, in which 
it occurs in (1a), to a more left-peripheral position within the embedded clause, thus 
contrasting also with (1b), where the wh-phrase occurs sentence-initially. 


(1) a. Ana tha se Beni bleu çfarë? 

Ana.the(nom) said that Ben.the(nom) bought what 
‘Ana said that Ben bought what?’ 

b. Çfarë tha Ana se bleu Beni? 
what(acc) said Ana.the(nom) that bought Ben.the(nom) 
‘What did Ana say (that) Ben bought?’ 

c. Ana tha se çfarë bleu Beni? 
Ana.the(nom) said that what bought Ben.the(nom) 
‘Ana said that Ben bought what?’ 


Albanian has an optional question particle for yes-no questions, as in (2), which 
also illustrates the phenomenon of clitic doubling:? 


(2) (A) *(e) pe Janin? 


[+Q] cl,3s saw Jan.the(acc) 
‘Have you seen John?’ 
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In spite of the fact that Albanian is a free word order language, functional cat- 
egories such as negation, modal and aspectual particles, auxiliaries, clitics, tense 
and agreement, etc. are rigidly ordered (for details, see Kallulli 1995). As a general 
rule, tense and person agreement go with the first verb in a finite clause. Tense and 
person agreement are realized as a portmanteau morpheme suffixed to the verb. 

In Albanian, dative objects are invariably doubled by clitics. As is obvious 
from the example in (2), Albanian violates the so-called “Kayne’s Generalization” 
(Jaeggli 1982), which informally stated, says that clitic doubling is possible when- 
ever a noun phrase is preceded by a preposition. In fact, prepositional objects may 
not be clitic doubled in Albanian. 

Both direct and indirect (i.e. dative) objects may be realized by way of a clitic. 
In other words, both the direct and the indirect object may be dropped in the pres- 
ence of a corresponding clitic, as illustrated in (3a) and (3b) below. There are no 
subject clitics. Except in imperatives, clitics always immediately precede the verb 
(Rivero 1994, Kallulli 1995). 


(3) a. (A) *(e) pe? 
[+Q] cl,3s saw 
“Have you seen him/her/it?’ 
b. U dhashé libra. 
them, par gave.I books 
‘I gave them books.’ 


In a clause containing a verb in the imperative mood, an unusual order of mor- 
phemes may obtain; the verb stem may precede all other morphemes, such as 
clitic(s) and inflection, as illustrated in (4a) and (4b). 


(4) a. Hap-e-ni (derén)! 
open-cl, „-2pl door.the, ec 
‘Open it (the door)!’ 
b. Hap-ni-m-a rrugen! 
open-2pl-cl, 1.156" Clscc Sen way.the,.. 


‘Get off my way! 


Both the ‘mesoclisis’ pattern illustrated in (4a), as well as the one in (4b), where 
the clitic follows the inflected verb, are interesting theoretically because they pose 
non-trivial questions about how these orders are derived. 

The canonical order that obtains between a noun and a modifying adjective, in- 
cluding ordinal numerals, is noun followed by adjective. However, the modifying 
adjective may precede the noun, in which case the enclitic definite determiner 
(when present) invariably attaches to the adjective, not to the noun, as it does under 
the canonical order. Demonstrative pronouns, cardinal numerals and quantifiers fol- 
low the noun. This also holds for quantifiers like secil-i/a (‘each’), which in Alban- 
ian always bear the enclitic definite determiner (e.g. secil-a vajzé ‘each-the girl’). 
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A rather productive phenomenon of Albanian syntax and its interfaces involves 
so-called non-selected dative arguments in a morphologically non-active (i.e. pass- 
1ve-like) construction (Kallulli 2006, 2007). For instance, a sentence like (5a), in 
which a dative argument combines with a non-active verb, yields among several 
possible interpretations, a so-called ‘dispositional’ reading, in which the dative ar- 
gument is interpreted as the experiencer of a certain disposition, or state. Crucially, 
under this reading no actuality entailments obtain. That is, the sentence is true even 
if no breaking event has taken place. In other words, the sentence is truly stative 
under this reading. This reading is however not available for the minimally differ- 
ent (5b), which differs from (5a) with respect to tense only; (5b) contains aorist 
whereas (5a) imperfect tense. Instead, (5b) has the flavour of a so-called 'adver- 
sative' passive familiar from languages like Japanese, a reading that is also poss- 
ible for (5a) albeit in the latter case only under a habituality construal of the event. 
So, crucially (5b) lacks the 'feel-like' reading of (5a). 


(5) a. Benit i thy-hej njé vazo. 
Ben.the,,r him, break-Nact,P,Imp3S a vaseyom 
‘Ben felt like breaking a vase’, or ‘Ben was in a vase-breaking mood’ 
b. Benit i-u thye njé vazo. 
Ben.the,,r himg-Nact break.Aor3S a vaseyom 
(i) ‘A vase broke on Ben.’ 
(ii) *'Ben felt like breaking a vase’ 


Interestingly, the pattern described for (5a) and (5b) does not replicate with all 
verbs. For instance, the sentences in (6a) and (6b) containing the verb ha ‘eat’ 
differ from each other morphologically in exactly the same way that (5a) and (5b) 
differ, namely only with respect to their tense specifications. That is, (6a) is mor- 
phologically identical to (5a), and (6b) is morphologically identical to (5b). Yet, 
the ‘feel-like’ reading is available both for (6a) and (6b). Crucially, unlike in (5a), 
no 'adversative' or unintentional causation reading is available for either of the 
sentences in (6). 


(6) a. Benit i ha-hej njé molle. 
Ben.they,; hima, eat-Nact,PImp,3S an appleyoy 
(i) ‘Ben felt like eating an apple’ 
(ii) **Ben unintentionally ate an apple’ 

b. Benit i-u héngér njé mollé. 

Ben.they,r him(-Nact ate.Aor3S an appleyoy 
(i) ‘Ben felt like eating an apple’ 
(ii)* ‘Ben unintentionally ate an apple’ 


Thus, it seems that the unintentional causation reading only obtains with verbs 
that denote events that are externally caused; so-called “externally caused” verbs 
(cf. Levin and Rappaport-Hovav 1995, chapter 3).^ In turn, such data highlight the 
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importance of systematic and detailed cross-linguistic comparative research in 
reaching insightful generalisations concerning the building blocks of syntactic and 
semantic computation. 

Some major reference works on Albanian including issues deserving further 
research are Bevington (1974), Newmark et al. (1982), Hubbard (1985), Buchholz 
and Fiedler (1987), Massey (1991). 


See also chapter 16 by Tomié in this volume. 


Notes 


1 Some traditional grammars of Albanian additionally recognise the so-called “condi- 
tional" and “lessive” (or “permissive”) moods. However, these do not involve special 
marking. 

2 Future time relations may also be instantiated through the use of so-called deontic kam 
‘Ihave’ combined with the (respective) infinitive form in both varieties (Gheg and Tosk); 
see e.g. Newmark et al. (1982). 

3 Clitic doubling is a phenomenon whereby a clitic pronoun ‘doubles’ (or ‘reduplicates’ ) 
a verbal argument inside the same clause, in the sense that the verbal argument and the 
clitic pronoun have the same referent, or are co-referent. On the obligatoriness of clitic 
doubling the direct object in contexts such as (2) and others, see Kallulli (2000). 

4 According to Levin and Rappaport-Hovav (1995), unlike so-called "internally caused" 
verbs (e.g. eat, laugh), externally caused verbs (e.g. break, melt, dry) imply the existence 
of an EXTERNAL cause with immediate control over bringing about the eventuality de- 
scribed by the verb; this can be an agent, an instrument, a natural force, or a circumstance. 
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Basque (Euskara or Euskera) is the only Pre-Indo-European language which has 
survived the Indo-European invasion of Europe. It is a language isolate, i.e. it is 
unrelated to other existing languages and is typologically very different from the 
surrounding language families. It is spoken by about 600,000 speakers (Aizpurua 
1995), called euskaldunak in Basque, in an area called Euskal Herria (“Basque 
Country’) along the Atlantic Ocean on both sides of the Pyrenees (cf. Map 1). 
The frontier between France and Spain divides the Basque Country into a North- 
Eastern and a South-Western part, commonly called North (Iparralde) and 
South (Hegoalde). Moreover, the South is subdivided into the autonomous regions 
(communities) Euskadi and Navarre (Nafarroa), the latter having a mostly Span- 
ish-speaking population. In spite of a certain autonomy, these regions belong to 
the kingdom of Spain, although the Basque people do not consider themselves to 
be Spanish. Almost all speakers of Basque are bilingual: in the South, Castilian 
Spanish and Navarro-Aragonese (a Romance dialect) are the contact languages; in 
the North the traditional Romance contact language is Gascon, but nowadays all 
speakers of this area speak French as well. 

The dialects of Basque differ very much from each other, but mutual com- 
prehensibility is granted by a standard variety, called Euskara Batua (‘unified 
Basque"). Standard Basque (as described in de Rijk 2008) will be the basis of this 
typological analysis, which is inspired by Ernst Lewy's language-immanent ap- 
proach to typology (Lewy 1931). 
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I Biscayen dialect 
II Gipuzkoan dialect 

III High Navarrese dialect 
IV  Lapurdian 

V Low Navarrese dialect 
VI Suletin dialect 

dotted line: dialect boundaries 
stroked line: frontier between France (F) and Spain (S) 


Figure 1: Map of the Basque country (Haase 1992) 


1. General classification 


Basque is a representative of the agglutinating type with a tendency towards fusion 
and a high degree of synthesis, but it is rather difficult to be classified other than 
morphologically. As a language isolate it cannot be affiliated to any known lan- 
guage family, despite the fact that since the 19th century many scholars have 
thought of an affiliation with the Caucasian language group, as Georgian has a 
similar case system and a comparatively complex verbal inflexion. In the mean- 
time, however, quite a number of unrelated languages have been found which have 
an ergative case system (cf. below) and a complex conjugation. In order to prove a 
genetic relationship, languages must show systematic grammatical and lexical 
similarities. Even hypotheses which are geographically motivated (highlighting 
the relationships between Basque and Iberian, as Wilhelm von Humboldt did in the 
19th century (Humboldt 1963), or between Basque and Berber) are falsified by the 
lack of such similarities (Trask 1995, 1997). Thus, Basque remains a language iso- 
late, with Aquitanian as a possible ancestral language (Gorrochategui 1995). 
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From a typological point of view Basque differs very much from the surround- 
ing Indo-European languages. Although language contact has brought about gram- 
matical changes, the language type has not been affected. As can be seen from the 
examples throughout this article, the morphology of Basque is agglutinating, with 
a tendency towards fusion. Formal synthesis is high. In the following sections, 
only the most important characteristics of Basque will be discussed. 


2. Remarks on phonology 


Basque has the vowel phonemes: a, e, i, o, u, which are pronounced as in Spanish. 
Souletin and some dialects of Navarre have a front round vowel /y/, which came up 
through contact with Gascon. The consonant system of Standard Basque is as fol- 
lows (graphemes are added in pointed brackets where they differ from the pho- 
nemic transcription): 


Table 1: The consonant system of Standard Basque 


p t (ti) <tt> k 
b d (dj) <dd> g 
(f) S <z> e «s» J «x h 
ts «tz» te «ts» tf «tx» 
m n (n) <fi> 
1 (A) «ll» 
ES 
r <rr> 
y/1/x <j> w <u> 


The plosives /p/, /t/, /k/ are slightly aspirated in most contexts. The pronunciation 
of the semi-vowel /j/ differs strongly between dialects (all variants are permitted in 
the standard). Some dialects partly neutralize the distinction between sibilants. 
Moreover, many speakers of Basque (especially in the South-West of the Basque 
Country) do not pronounce /h/, although it is phonologically relevant in the stan- 
dard. The bracketed palatal consonants appear phonemically only in forms of fam- 
iliar address (allocutive, cf. section 4.2), diminutives and hypocorisms; /f/ is li- 
mited to modern loan words (telefonoa ‘telephone’) and substituted by /p/ in older 
loan vocabulary (pagoa *beech tree' from Latin fagus). Phonemically, there are no 
voiced fricatives. 

Since the grammar of Basque has as its basic item not the lexical entity, but 
the phrase (as will be shown in section 3), it is not surprising that Basque lacks 
word stress. Word groups in focus position are stressed, but there is no fixed ac- 
cent position (‘floating accent’). Within the phrase, sandhi phenomena can be ob- 
served: 
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(1 Ez da. [esta] 
NEG be.3SG.PRS 
‘It is not.’ 
(2) Gusta-tze-n zait. [gustatsen tsajt] as if written: tzait 


please-VN-INE be.PRS.3SG.1SG.DAT 
‘It pleases me./I like it.’ 


3. Word order and focus 


The major constituents of a sentence (i.e. phrases) can be moved around freely, i.e. 
according to discourse-pragmatic considerations (“scrambling word order"). A 
highlighted noun phrase or adverbial phrase — often containing new information — 
is located before the verbal complex. In other words, Basque has a preverbal focus 
position. The unmarked situation is the following: 


(3) constituents: NP + NP + verbal complex 
information: given new 
topic comment 
focus: low high 


Here is a typical example (Lafitte 1979: § 117): 


(4) Aita-k untzi-a aurdiki du. 
father-ERG container-IDV throw:PTCP PRS.3SG<3SG 
‘Father has thrown the vase.’ 


If *father' is emphasized, e.g. when answering the question *Who has thrown the 
vase?", the emphasized (inquired) element is preverbal: 


(5) Untzia aitak aurdiki du. 
‘Father has thrown the vase.’ 


In normal conversation untzia (‘vase’) would not be repeated in an answer to a 
question relating specifically to the vase. The explicit mentioning of untzia is more 
common if the topic has to be fixed, e.g. when contrasting different topics (‘As far 
as the vase is concerned, it was thrown by father’). In a less marked way, however, 
the topic can be mentioned in an afterthought position: 


(6) Aitak aurdiki du untzia. 
‘Father has thrown it — the vase.’ 


The preverbal focus position must always be filled, even in thetical clauses, i.e. in 
clauses which do not have a bipartite (given — new, topic — comment) information 
structure: 
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(7) Ba-da bala. 
ENC-PRS.3SG ball 
"There was a ball.’ 


The dummy focus element ba- in (7) is called enunciative and etymologically 
related to bai ‘yes’. It is used in sentences which contain only a synthetically con- 
structed verb, such as the one in (8): 


(8) Ba-dakit. 
ENC-know.PRS.3SG<1SG 
‘I know.’ 


The focus position of Basque thus appears to be grammaticalized, whereas in 
many subject-prominent languages, i.e. languages with an obligatory subject 
position and reflexive constructions (cf. Sasse 1982 and Wehr 1995), focus is 
achieved by extracting the emphasized constituent (topicalization, dislocation). 
Basque can be called a focus-prominent language; it does not have a grammati- 
calized subject. 

Phrase-internal word order is very rigid, as illustrated in (9) to (14) below: 


adjectives (including demonstrative adjectives) follow nouns: 
(9) etxe haundi hori 

house big DEM, 

‘that big house’ 


nouns (in genitive or possessive case) precede nouns: 
(10) aita-ren etxe-a 

father-POSS house-IDV 

‘father’s house’ 


relative clauses precede the relativized noun: 


(11) aita-k erosi du-en etxe-a 
father-ERG buy.PTCP PRS.3SG«3SG- house-IDV 
SUB 


‘the house father has bought’ 


Basque has postpositions (no prepositions): 
(12) iri barne-ko etxe-a 

town inside-DEL house-IDV 

‘the house inside town (i.e. in the town center)’ 


in affirmative sentences, the finite (auxiliary) verb follows the non-finite 
verb: 
(13) Ikusi dut. 
see.PTCP PRS.3SG<1SG 
‘I have seen him/her.’ 
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in negative clauses, the order within the verbal complex is inverted: 
(14) Ez dut ikusi. 

NEG PRS.3SG<1SG see.PTCP 

‘I haven't seen him/her.’ 


4. Syntagmatic inflexion 


The noun phrase always ends in an inflected form, i.e. the basis of inflexion is not 
the word, but the phrase (phrasal or syntagmatic inflexion): 


(15) a. etxe-ak ‘houses’ 
b. etxe haundi-ak ‘big houses’ 
c. etxe haundi haiek ‘those big houses’ 


Syntagmatic relations are not marked by agreement, but are implied by the lack of 
an ending. The importance of the phrase is also reflected by the accentual system of 
Basque: not the word, but the phrase is its basic unit, i.e. the accentual emphasis in- 
creases towards the end of the phrase. 

In the verbal complex, the inflected auxiliary (which contains the inflexion for 
person, number, tense, aspect and mood) follows a non-finite verb form (e.g. the 
participle, as in (16)). The verbal complex can be modalized by the insertion of an 
operator before the inflected auxiliary, as in (16b and c); modalization is not 
achieved by special modal auxiliaries like can, may, must in English: 


(16) a. Jakin dut. 

know.PTCP PRS.3SG<1SG 
‘I have known (it/him/her).' 

b. Jakin nahi dut. 
know.PTCP VOLITIVE PRS.3SG<1SG 
‘I want to know (it/him/her).’ 

c. Jakin behar dut. 
know.PTCP OBLIGATIVE PRS.3SG<1SG 
‘I must know (it/him/her).* 


As shown in (14) above, the order is inverted in negative clauses: 


(17) Ez dut Jakin. 
NEG PRS.3SG<1SG know.PTCP 
‘I haven't known (it/him/her).’ 
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4.1. Ergative case inflexion 


Table 2 shows the case system of Basque. On formal grounds grammatical and 
local cases can be distinguished. Grammatical cases (in the upper part of the table) 
mark grammatical relations, local cases (in the lower part of the table) are used for 
spatial orientation, similar to postpositions, with which they are formally and ety- 
mologically related. 


Table 2: The Basque case system 


Case Transnumeral Singular Plural 
Absolutive -a -ak 
Ergative -(e)k -a-k -e-k 

Dative OH -a-ri -e-i/-e-r 
Genitive -(r)en -a-ren -en 
Comitative -(r)ekin -a-rekin -ekin 
Partitive -(r)ik — — 

Prolative -(e)tzat -a-ren-tzat -en-tzat 
Instrumental -(e)z -a-Z -e-z/-eta-z 
Inessive -(e)ta-n -(e)an -eta-n 
Delimitative -(e)ta-ko -(e)ko -eta-ko 
Ablative -(e)ta-tik -(e)tik -eta-tik 
Directive -(e)ta-ra(t) -(e)ra(t) -eta-ra(t) 
Terminative -(e)ta-ra(d)ino -(e)ra(d)ino -eta-ra(d)ino 
Destinative -(e)ta-ru/antz -(e)ru/antz -eta-ru/antz 


Basque does not only distinguish between plural and singular, the latter being 
marked by a special morphological ending - the so-called singulative or individ- 
ualizer. It has special transnumeral forms (unmarked in grammatical cases, marked 
in local cases) and is used in the following environments: 


a. with indefinite or interrogative pronouns/adjectives: 
(18) zenbat etxe? ‘how many houses?’ 


In this environment, the question does not imply a number, which is why the nu- 
meral distinction is neutralized and the transnumeral is used here. 


b. with numerals: 
(19) hiru etxe ‘three houses’ 


The plural is implied by the numeral already. The plural can be marked if the 
numeral doesn't contain any new information, but functions as an appositive con- 
stituent: 


(20) hiru etxe-ak ‘the three houses (I mentioned before)’ 
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c. With partitive case: 
(21) Ez da etxe-rik. ‘There is no house.’ or: etxe-rik gabe ‘without house(s)' 


Here again, the numeral distinction is neutralized: There is no reference to the ab- 
sence of a specific number of houses (in the ‘without’ construction this becomes 
obvious even in the English translation). 


d. with instrumental case: 
(22) oine-z ‘on foot’ 


Here only the general concept of the instrument is implied, not a specific number. 
There is no distinction of gender in Basque; with the exception of the allocutive, 
where special forms are used for addressing men or women (cf. (28) in section 4.2 
below), gender has not been grammaticalized in Basque. 

Basque has an ergative case system. Languages with a nominative-accusative 
system treat the only participant of an intransitive clause like the agent of a transi- 
tive clause, leaving both in the typically unmarked nominative (both are con- 
sidered "subjects" of the clauses), whereas the patient of the transitive sentence 
(“object”) is marked by accusative case. Ergative systems work the other way 
round: The agent of a transitive clause gets special ergative case marking, whereas 
the patient remains unmarked (absolutive case) just like the only participant of an 
intransitive clause. 


nominative-accusative system 
transitive clause agent (NOM) patient (ACC) 
intransitive clause only participant (NOM) 


ergative-absolutive system 
transitive clause agent (ERG) patient (ABS) 
intransitive clause only participant (ABS) 


Compare the following examples: 


(23) Txori-a | zuhaitz-ean dago. 
bird-IDV  tree-INE be.3SG.PRS 
“The bird is sitting on the tree.’ 


In this intransitive sentence, the only participant is txoria ‘the bird’. In a corre- 
sponding transitive sentence, another participant can be added who does some- 
thing to ‘the bird’. In Basque this agent appears in ergative case, whereas the pa- 
tient ‘the bird’ remains unchanged (txoria): 


(24) Gatu-a-k txori-a | arrapa-tze-n du. 
cat-IDV-ERG bird-IDV catch-VN-INE PRS.3SG<3SG 
"Ihe cat is catching the bird.’ 
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4.2. Polypersonal verbal inflexion 


The personal conjugation of Basque consists of pronominal personal affixes which 
form the conjugated verb together with a lexically meaningful element: 


(25) Ba- d- "XE E 
ENC- ABS.3.PRS- -PRS- -know- -ERG.1SG 
INDICATIVE root 


‘I know (it/him/her).’ 


Only a small class of verbs can be inflected directly, i.e. they can have personal 
affixes. Here is a list of these so-called synthetical verbs in modern Basque: 


izan (intransitive conjugation auxiliary, ‘to be’) 
ukan (transitive conjugation auxiliary, ‘to have’) 
egon ‘to stay, to be’ 

eduki ‘to hold, to have’ 

Jakin ‘to know’ 

ekarri ‘to bring’ 

eraman ‘to take (to)’ 

Joan ‘to go (directed)’ 

ibili ‘to go (undirected), to wander’ 

etorri ‘to come’ 

etzan ‘to lie’ 

eritzi ‘to think’ 

irudi ‘to seem’ (third person only) 

(jaugin ‘to come’, in the imperative only) 

(very rare: ikusi ‘to see’, erabili ‘to conduct, use’) 


In comparison with early records (16th century) the number of synthetically con- 
strued verbs has significantly decreased. 

All other verbs are used in analytical constructions according to the scheme in 
(26) and exemplified in (27): 


(26) infinitival form (optional operator) conjugation auxiliary 


(27) ema(i)-te-n d- -io- -t. 
give-VN-INE ABS.3.PRS- -DAT.3SG- -ERG.1SG 
[infinitival form] INDICATIVE 
‘I am giving it to him/her.’ 


In (27) note that the verb form in the interlinear morphemic translation has been 
morphologically analysed. Due to its high degree of synthesis and a tendency 
towards fusion, this can only be a tentative analysis, though. That is the reason why 
such a close analysis is not given in other examples. A hyphen before and after a 
morpheme indicates that it can only be found within a verb form: 
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In (27) the conjugation auxiliary contains personal affixes for three partici- 
pants. However, it is even possible to find a fourth personal affix in such construc- 
tions for reference to the hearer: 


(28) Ema(i)-te-n z- -io- -na- -t. 
give-VN-INE ABS.3.FAM- -DAT.3SG- -FAM.F- -ERG.1SG(PRS) 
‘I give it to him/her, woman.’ 


In these familiar allocutive forms, the sex of the person spoken to is marked. Such 
familiar forms vary quite a lot between dialects (and even within them). They are not 
standardized. The verb form in (28) is typical of Navarre (Bonaparte 1991: I, 437). 

Even if reflexives are excluded, the combination of up to four person markers 
in two tenses and four moods leaves us with a huge amount of combinations (about 
3000 verb forms). More temporal categories (as well as modal and aspectual ones) 
are formed by adding case markers to the infinitival form. A detailed account is 
given in Haase (1994). 


5. Word classes 


Lexical entities can be attributed to word classes only according to the syntactic 
(syntagmatic) context in which they appear. 


(29) Ezti da. 
honey | be.3SG.PRS 
‘It is sweet.’ lit. ‘It honeys.’ 


In (29) ezti which means ‘honey’ or ‘sweet’, seems to be an adjective or an intran- 
sitive verb (‘to honey, to sweet’). In (30) it is a noun (‘honey’): 


(30) Ezti-a gusta-tzen zait. 
honey-IDV  please.PRS3SG.1SG.DAT 
‘I like honey.’ 


The distinction between verbs and adjectives is purely morphological: 


(31) Eri da. 
ill be.3SG.PRS 
*(S)he is ill.’ 


(32) Eri-a da. 
ill-IDV — be.3SG.PRS 
‘(S)he is invalid/sickish.’ 


In (31) eri functions as a verb (‘to be ill’), whereas in (32) eri comes with the in- 
dividualizer -a which makes it more noun-like and denotes a permanent character- 
istic. 
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The previously mentioned verbal or modal operators are particularly difficult 
to categorize. Take, for example, the operators nahi and ohi. They can function as 
verbs, like nahi in (33): 


(33) nahi-ko du. 
want-FUT have.3SG<3SG.PRS 
*(S)he will want it.’ 


But with the individualizer, nahia means *wish' or *wanted', as in nere nahia *my 
wish’ or erantzun nahia ‘the wanted answer’. Likewise, ohi expresses habitual as- 
pect as in (34): 


(34) Euskara-z | mintza-tze-n ohi zen. 
Basque-INS speak-VN-INE HAB be.3SG.PRT 
‘(S)he used to speak (in) Basque.’ 


As an attributive adjective, ohi means ‘dead’ (e.g. in apez ohia ‘the dead priest’). 
This clearly shows how much word classes are determined by the syntagmatic con- 
text in Basque, which comes as no surprise in a language whose fundamental gram- 
matical unit is the phrase. 


6. Conclusion 


The key entity of Basque grammar is the phrase. Noun phrases can be moved 
around within the clause, depending on discourse-pragmatic considerations. The 
main focus is placed immediately before the verb. Informationally less important 
constituents can be added as afterthoughts following the verb, which is otherwise 
clause-final (for example in subordinate clauses). The syntax of Basque can there- 
fore be characterized as focus-prominent and phrase-based (syntagmatic). Within 
the phrase the constituent order is fixed. Case endings are positioned at the end of 
the noun phrase. Most verbs are inflected by special tense auxiliaries which contain 
person markers for up to four case roles (absolutive, ergative, dative and allocu- 
tive, polypersonal inflection). 
The major typological features of Basque are summarized in Table 3. 
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Table 3: — Typological features of Basque 


Structural features 


realisation in Basque 


Inflexion 


Gender 


Definiteness/Specificity 


Number 


Case 


Possessive 
Aspect 
Tense 


Mood 


Passive/Antipassive 


Adpositions 
Adverbs 
Adjectives 


Word Order 


Alignment 
Stress 


Phonemes 


syntagma-final, agglutinating; polypersonal verbal inflexion 


no gender or sex distinction, with the exception of the 
allocutive (familiar address) where forms are different when 
speaking to single male or female interlocutor 


singular and plural (as opposed to transnumeral) contain 
a marker for individualization which covers aspects of 
definiteness/specificity 


singular, plural, transnumeral 


grammatical and local case inflexion; concord between noun 
phrase and inflected verb (for primary grammatical cases 
only); non-split ergativity 


possessive case (sometimes called possessive genitive) 
imperfective vs. perfective 
present vs. preterite; analytical future 


morphological: indicative vs. subjunctive/imperative and 
factive (unmarked) vs. potential; analytical: conditional/ 
irrealis 


ergativity > no passive, but antipassive constructions 
marginally possible (within progressive or resultative 
analytical constructions) 


postpositions 
status of primary adverbs unclear, derived adverbs 


distinction between some verbal predicates and adjectives 
depends on context 


free scrambling clause constituents (depending on pragmatics, 
with preverbal focus, resulting in a tendency towards 
verb-final constructions), strict word order inside constituents: 
N Adj, Gen N, N Postp, N Dem, Num N and Rel N (postponed 
Rel are marginally possible) 


based on prominence (focus, animacy); topic continuity 
floating, no word stress 


aspirated plosives; no voiced fricatives; consonant clusters 
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1. Introduction 


Greek refers to the Standard Modern Greek variety, the official language of Greece, 
arguably the sole descendent of Classical Greek. Holton, Mackridge and Philip- 
paki-Warburton (1997), Joseph and Philippaki-Warburton (1987) and Mackridge 
(1985) probably constitute the most often cited descriptions of the language. A re- 
lated variety (Cypriot Greek) is the official language of the Republic of Cyprus; 
some smaller Greek speaking communities are also found in various parts of 
the world (mainly, USA, Australia and Germany). There have been attempts to de- 
scribe dialectal variation within Greece, the most notable distinctions being the 
one between mainland and island varieties and the one between the Northern and 
the Southern varieties. These mainly involve minor phonological and lexical dif- 
ferences and generally share the same grammatical core; however, grammatical 
differences do exist and some of the varieties have even been approached as dis- 
tinct languages (e.g. Pontic Greek in works such as Drettas 1997; cf. the relevant 
discussions in Trudgill 2003 and Joseph 2009). 

Compared to Classical Greek, the modern varieties appear to have lost a 
number of inflectional categories and / or distinctions (e.g. the optative mood, the 
dative case and the dual number, among others) as well as various distinctions in 
the sound system (e.g. the diphthongs, the length contrasts, the tones, not to forget 
a number of rearrangements within the phonological system). However, such com- 
parisons are not historically accurate, nor even relevant, as a large number of 
intermediate stages succeeded Classical Greek before the modern language was 
formed (cf. Joseph 1987 and Horrocks 1997). 
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Indeed, the relation between the various historical stages and the high status 
universally attributed to Classical Greek were probably the fundamental causes of 
the diglossic situation, which officially ended in the latter decades of the 20th cen- 
tury, after serving as a textbook case of diglossia. In terms of the split between the 
High (‘katharevousa’) and Low (‘demotic’) varieties, Standard Modern Greek is 
now a mixed variety mainly based on Demotic, with many features of Kathare- 
vousa (see Moschonas 2004 for a number of issues concerning language ideology 
in present-day Greece). 

The Greek alphabet is used (and lexical stress is marked on stressed vowels); 
there are also some idiosyncratic punctuation conventions (e.g. the semi-colon is 
used as a question mark, and a raised full stop is used as a semi-colon). Various 
forms of transliteration into Roman script are used especially by younger Greeks in 
situations of fast, informal writing (sms, chat, web-based communication) and also 
whenever necessary (e.g. in transactions with foreign officials or machines which 
can only accept Roman script). 


2. Phonetics and phonology 


Greek has an inventory of five vowels (/i/, /e/, /a/, /o/, /u/) and a number of conson- 
antal segments, the phonemic status of which has been debated (the number of 
phonemes ranges from 15 to over 25, depending on the analysis). Probably most 
analysts would agree on a number around 22 (though Holton, Mackridge and Phil- 
ippaki- Warburton (1997: 3) list 15 consonantal phonemes only). For example, the 
status of the voiced stops could be reduced to predictable allophonic realizations of 
[N + (voiceless) stop] as far as native Greek vocabulary is concerned (but would 
have to be different in the case of recent or not-recent borrowings; cf. the dis- 
cussion in Malikouti-Drachman 2001). 

Sounds like [f] and [3] do not occur in the standard variety, but are very com- 
mon products of palatalizations before front vowels in many dialects. Similarly 
there are no geminates apart from island dialects, including Cypriot Greek; the use 
of glides ([j] and [w]) has also been reported in some northern dialects. The velar 
nasal is present, but it is never contrastive, it is always phonologically predictable, 
and never appears in word initial position. Nasal place-assimilation is very com- 
mon, as are various palatalization processes. The following is a quite regular se- 
quence of phonological processes, available for all nasal+/p, t, k/ sequences (see, 
e.g., Efstathiadis 1974). A nasal first assimilates to the following voiceless stop, 
then that stop assimilates in terms of voicing and the original nasal may be 
dropped. Any one of the following sound sequences may be actually used (depend- 
ing on situational factors). Indeed the four forms below arguably constitute a con- 
tinuum from the most careful / formal pronunciation to the least careful and most 
informal one: 
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(1) mv TÓTA 
tin porta 
the.F.acc door. F.ACC 
tin porta > tim porta > tim borta > ti borta 


There is no fixed stress location. Stress is lexical and therefore unpredictable. 
There is room for a 'three-syllable rule', though this has been questioned (cf. 
Joseph 1987). Being lexical, stress can be distinctive, e.g. /'jeros anOropos/ (‘old 
man") contrasts with /je'ros anOropos/ (“strong man’). There are no tone distinc- 
tions. Rhythm is often said to be trochaic, but there are good reasons for consider- 
ing it mixed. Syllable structure is rather complex, especially in the onset position. 
Arvaniti (2007) discusses a number of these and related issues. 

Normally words end in a vowel; in the case of native vocabulary only /s/, /n/ 
(and perhaps /r/) are possible consonants in word-final position. Words of foreign 
origin may of course end in other consonants, but these often get regularized and 
domesticated. 


3. Morphology 


Typologically, Greek is quite mixed, having inherited a number of complex inflec- 
tional features from its source systems, but also having developed a number of quite 
analytical features throughout its history (some of them, arguably, as the result of 
contact within the Balkan Sprachbund — cf. Joseph 1983). However, its rich mor- 
phology clearly places it closer to the inflectional end of any relevant continuum. 

Inflectional morphemes are mostly suffixes which generally involve a ‘port- 
manteau' realization of the various categories involved. A circumfix can be seen to 
be involved in the formation of the active past, but there are interesting morpho- 
phonological complications as well; see, e.g., Ralli (2003). 

There are no infixes, though there are various vowel and consonant changes 
(which encode either productive or fossilized processes). There are many word- 
formation processes and types of compounding (though polysynthesis is at best ex- 
ceptional). 

Most parts of speech inflect; nouns and adjectives, articles, numerals, demon- 
stratives, and pronouns inflect for gender, number and case; verbs, auxiliary and 
modal verbs inflect for tense, aspect, mood, voice, person and number (and may 
belong to one of three verb-classes / conjugations); adverbs, prepositions, conjunc- 
tions, particles and interjections are generally invariant (though most adverbs can 
inflect for comparative and superlative. In fact, inflectional comparative and super- 
lative forms (available for adjectives and adverbs) are generally interchangeable 
with corresponding periphrastic formations, e.g.: orea ‘nice’, oreotera ‘nicer’, 
oreotata ‘nicest’ (or pco orea ‘nicer’, poli orea ‘nicest’ ). 
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It is very hard to identify monomorphemic words in Greek (mainly some func- 
tion words and some foreign borrowings). Most words involve at least one inflec- 
tional suffix added on to a root, which may also be a complex of a stem and some 
derivational element. 

There is no systematic morphological reduplication — only in relics from earlier 
stages in cases like ÓóeÓouéva (data < 'give-PARTIC.N.PL'), yeyovota (‘facts’ 
< ‘happen-PARTIC.N.PL’). However, there are various cases of whole word redupli- 
cation, some of which are quite productive, e.g. in the case of distributive nu- 
merals, as in (2): 


(2) Avéßnxe Tota vota Ta oxaMá 
anevice tria tria ta skalia 
climb-PsT.PFv.3s three-N-PL three-N-PL the-N-PL stairs-N-PL 
‘He climbed the stairs three by three’ 


3.1. Inflectional morphology 


3.1.1. The verb 


Verbal morphology is centred around the basic perfective / imperfective distinc- 
tion. All verbs (with very few exceptions, such as prototypical stative verbs) 
involve a simple imperfective stem and a perfective counterpart (often marked by 
affixation of {-s} onto the imperfective stem, but also marked in various other 
ways, including sound changes, suppletion and, very rarely, zero-affixation). Both 
of these stems can then inflect for practically any tense, mood, voice, person (see, 
e.g. Joseph 1983: 80), using more or less the same sets of affixes, thus leading to a 
highly symmetrical system (NB: the English glosses are only meant as approxi- 
mate interpretations of the Greek data in (3)-(7)): 


(3) yoad-w / vocnp-o 
yraf-o / yraps-o 
write-IPFV.NPST.1s / write-PFV.NPST.1S 
‘I write (or I am writing)’ / ‘I write’ (in subordination only) 


(4) &yoad-a./ é-yoonj-o 
e-yraf-a / e-yraps-a 
write-IPFV.PsT.ls / write-PrFv.PsT.1s 
‘I was writing (or I used to write)’ / “I wrote’ 


(5) yoad-e / yoayp-e 
yraf-e / yraps-e 
write-IPFV.IMP.2s / write-PFV.IMP.2s 
“Write! (or Keep writing!)’ / 'Write" 
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Moreover, a periphrastic perfect construction is formed using the auxiliary exo 
(‘have’) and the ‘perfect formant’, which is formally identical with a perfective 3rd 
singular, active or non-active. The perfect construction itself can inflect for past, fu- 
ture, past future, subjunctive, and past subjunctive in six persons. These are all 
marked on the auxiliary — whereas the perfect formant remains unmarked (and has 
thus been variously described as an ‘infinitive’, a ‘participle’ or a ‘non-finite’ form). 


(6) (8a/ va) exo youwet/ yoa pret 
(0a / na) exo vrapsi / yrafti 
(FUT / sBjV) have-1s write-PFV.ACT.3s / write-PFV.NACT.3s 
‘I will have written’ / ‘I will have been written’ 
*(that) I have written’ / ‘(that) I have been written’ 


(7) (8a/ va) eiya voanper / yoadtei 
(0a / na) ixa vrapsi / yrafti 
(FUT / sBjV) have-PsT.ls write-PFV.ACT.3s / write-PFV.NACT.3S 
‘I would have written’ / ‘I would have been written’ 
*(that) I had written’ / ‘(that) I had been written’ 


Alternative perfect formations based on the auxiliaries ‘be’ or ‘have’ and the non- 
active participle are also available, mostly used as a perfect of result (e.g. ine 
yrameno ‘it is written / it has been written’ or exo yrameno ‘I have written’). 


Subject agreement is marked on all verb forms (other than the perfect formant, par- 
ticiples and gerunds); in effect there are no infinitives of the type familiar in the Ro- 
mance and Germanic languages (or Latin and Classical Greek for that matter). The 
loss of the infinitive is a well-established feature of the Balkan Sprachbund (e.g. Jo- 
seph 1983). Thus, for example, there is agreement on all verbs in the following: 


(8) Oëim va düyaıs 
Belo na Dis 
want-1s that go-2s 
‘I want you to go’ 


The active participle has been lost as a nominal element and only survives as a kind 
of invariable adverbial / gerundive form (cf. Tsimpli 2000 on the synchronic facts 
and Manolessou 2005 on the development of ‘gerunds’ out of participles). Non- 
active participles are still available, fully inflecting for aspect, gender, number and 
case. 

The only tense distinction which is marked morphologically is the [+/-past] 
distinction (available for perfective, imperfective and perfect). There is a standard 
periphrastic future marked by the particle 0a (of volitional origin — see Tsangalidis 
1999), also available for all aspectually distinct forms, and various other future- 
oriented periphrases. 
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A morphological imperative (which does not negate) can be identified for both 
perfective and imperfective in both 2nd singular and plural (in the general case; 
however, 2nd singular imperfective non-active is very rare and 2nd plural imper- 
fective active and non-active forms are not distinct from their indicative counter- 
parts). Moreover, a quite distinct prohibitive can be identified (which formally re- 
sembles the subjunctive rather than the imperative). 

In the general case, person, number, tense, voice and mood do not involve 
suppletive forms at all (though different classes of verbs use different sets of af- 
fixes). Only perfective stem formation can be seen to be unpredictable. The voice 
distinction is generally available and there are many verbs which appear in active 
and non-active forms. The latter can be interpreted as a true passive, but also as re- 
flexive or reciprocal depending on a number of factors (e.g. Tsimpli 2006). There 
are, however, many deponent verbs (which inflect according to the non-active 
paradigm but have active meanings — and, of course, do not contrast with any ac- 
tive form). 

At the morphological level the verb will be either imperative or non-impera- 
tive. The non-imperative forms can further combine with modal particles. These 
constitute the periphrastic future, the periphrastic subjunctive (with na) and a peri- 
phrastic hortative / optative construction (with as). The simple non-imperative 
forms and the future formations share the same negation marker (den, at the left- 
most position in the verb group), whereas na and as forms use another negator 
(min, immediately after the modal particle). This has formed the basis of the most 
widespread analysis of the former forms as ‘indicative’ and the latter as ‘subjunc- 
tive’ (see 4.1 below). All these periphrases involve various modal interpretations; 
however, none of the periphrases is a dedicated marker of either epistemic modal- 
ity or evidentiality. 


3.1.2. The noun 


Nouns, pronouns, articles, adjectives, quantifiers, demonstratives as well as most 
numerals inflect in more or less the same manner for gender and number and can all 
be grouped to one of many distinct noun classes (though these are not always easily 
identifiable). Gender-number-case and noun class appear as portmanteau suffixes 
and it is very difficult to make generalizations without exceptions. For example, 
most, though not all, masculine nouns end in [s] in nominative singular; con- 
versely, most, but not all, nominative singular nouns ending in [s] are masculine, 
and so on (see Ralli 2003). The definite article does not inflect for vocative, but 
may accompany any noun (including proper names) in all other cases (nominative, 
genitive, accusative) and in all three genders. 

There are three genders, which may be argued to constitute a sex-based system, 
to the extent that male and female entities will be normally described as masculine 
or feminine. This, however, does not account for arbitrary gender assignment in the 
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case of nouns corresponding to sex-less entities (e.g., “winter’ is masculine, 
‘spring’ is feminine, ‘summer’ and ‘autumn’ are neuter; the word for ‘week’ is 
feminine, a ‘fortnight’ is neuter, a ‘month’ is masculine). Moreover, there are cases 
like to koritsi (the girl), to ayori (the boy) etc. which appear in the neuter, despite 
their unambiguous reference to female and male entities (see Anastasiadi-Symeo- 
nidi, Ralli and Chila-Markopoulou 2005). 

Case affixes are always suffixes, and they are at most four distinct forms in 
each number (nominative, genitive, accusative, vocative), at least in some of the 
masculine and feminine singular paradigms (though there is some syncretism in 
most masculine and feminine cases and in all neuter cases: neuter nouns only dis- 
tinguish between genitive and non-genitive forms). 


Table 1: Nominal cases in the singular: some examples of the inflectional paradigm 


good-M good-F good-n ` Man-M woman-F child-N 
nominative 40). 0G xah Karo ávioac yvvaixa tout 
genitive «0).00 rang  xañoŭv Oto v'ovatxac TLALOLOV 
accusative Karo Ram Karo óáv1oa yuvatxa tout 
vocative «Q0. € xah Karo óáviQa yuvatxa tout 


3:2: Derivational morphology 


Borrowings and acronyms are usually regularized and incorporated into the mor- 
phophonological system. There are many derivational affıxes (more suffixes than 
prefixes) which are used in various word formation processes. Nouns, verbs, ad- 
jectives and adverbs may all be the result of productive derivational processes from 
various sources. There are also various morphophonological processes (usually de- 
scribed as ablaut) which may also relate two lexemes. Unsurprisingly, itis not easy 
to identify all processes that are fully productive synchronically. 

Various types of compounding are available, including some processes that 
seem to resemble Adv and N ‘incorporation’, in that they involve a certain amount 
of regularity; clearly however, these cannot be considered syntactic processes 
(Smirniotopoulos and Joseph 1998). P incorporation may also be seen in cases 
which apparently involve items that have descended from Classical Greek preposi- 
tions, but whose status in the present stage of the language is not always clear. The 
status of neo-classical compounds (such as avOowmsodoyia ‘anthropology’) is also 
usually debated in that some of them have always been part of the language 
whereas others were clearly borrowed from other languages (or were modelled on 
them) in recent times. 
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4. Syntax 


Greek is considered a basically VSO language with an alternative SVO order (e.g. 
Philippaki-Warburton 1985; cf. Alexiadou and Anagnostopoulou 2000); however, 
virtually any order is possible, depending on information structure. Indeed there 
are not many adjacency requirements (strict adjacency is rather exceptional, and 
mainly concerns clitics and clitic-like elements which are the exponents of particu- 
lar grammatical categories — see below). The preferred relative order of V and O is 
VO but again this may be different — and may, anyway, be interrupted by various 
other categories. 

Greek is a ‘pro-drop’ language and overt pronouns normally occur in marked 
topic/focus positions only. It is a nominative-accusative language (of the standard 
type, for both nouns and pronouns) and pronominal subjects appear in the verbal 
ending (whereas the pronominal special clitics which, arguably, indicate object 
agreement, precede the verb in the general case). 

Word order is relatively free as far as the order of major constituents is con- 
cerned. Lexical categories may appear in various configurations, and only 'func- 
tional heads' are subject to strict ordering requirements (e.g. prepositions and ar- 
ticles have to precede NPs and nouns, respectively, and complementizers and 
clause typing conjunctions normally appear in sentence-initial positions). Indeed 
there are more complex rules concerning the internal structure of the VP and the 
NP, as will be discussed immediately below. 


4.1. The verb phrase 


On the basis of the two distinct negation possibilities, two major types of verbal 
formations can be identified, namely the ‘indicative’ and the ‘subjunctive’ pattern, 
respectively (e.g. in Veloudis and Philippaki-Warburton 1983). As shown in the 
examples below, the presence of all clitics is optional. However, if any of them do 
appear, they will have to occupy particular slots — all of them preverbal (in the case 
of finite verb forms) and strictly ordered: 


Realis/indicative: 
(9) (dev) (8a) (vov) (10) EEL 
den 0a tu to lei 


(NEG1) (fut) (IndirectObject) (DirectObject) say-3s 
“S/he will not say it to him’ 


Irrealis/subjunctive: 
(10 Na (unv) (tov) (To) NEEL 
na min tu to lei 


SBJV (NEG2) (INDIRECTOBJECT) (DiRECTOBJECT) say-3s 
“Let him/her not say it to him / S/he should not say it to him’ 
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In (11) a related prohibitive is exemplified, which is usually analyzed as omission 
of the subjunctive marker (cf. 10): 


(11) Mm (tov) (tO) AEG 
Min (tu) (to) les 
(NEG2) (INDIRECTOBJECT) (DIRECTOBJECT) say-2s 
‘Don’t tell (him) (that)’ 


Imperative verbs do not negate at all and may only be followed by pronominal 
(en)clitics; pronominal clitics also follow gerundive forms, which, however, can 
be negated (using mi). In all cases, the strict ordering of all clitics (be they pro- 
nominals or markers of negation, future, subjunctive) within the verb group is 
obviously in marked contrast with the relative freedom in the case of lexical 
items. 

One obvious peculiarity involves the availability of two distinct negation 
markers, one dedicated to the indicative paradigm and one available in the sub- 
junctive. This latter form is also used in the prohibitive (which itself is regarded as 
a reduced subjunctive in most approaches) and the (adverbial) gerundive construc- 
tions. The imperative does not negate at all, and therefore the relevance of negation 
to the definition of different *moods' is empirically supported. 

Ditransitive constructions may be expressed through either a V NPgen NPacc 
construction or a V NPacc PP (or simply through the use of pronominal clitics). 
Again, the two lexical types appear to be in free variation, whereas there is strict 
ordering of the particular (genitive and accusative) pronominals in the case of the 
clitic construction (cf. Anagnostopoulou 2003). 


4.2. The noun phrase 


Gender and number agreement is required between major elements in a noun 
phrase (articles, adjectives, nouns, many numerals and most pronouns extending 
over noun classes) 


(12) o «aAóc domme 
o kalos  fititis 
“The good student’ (masculine singular) 


(13) n xa dovuto 
1 kali fititria 
“The good student’ (feminine singular) 


(14) ot xaXo( pontés 
i kali fitites 
"Ihe good students’ (masculine plural) 
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(15) ov xaXég potes 
i kales ` fititries 
“The good students’ (feminine plural) 


Within the NP the order is not strictly determined, especially in the case of indefi- 
nite NPs: 


(16) a. oe o X0).0G Potts 


ire o kalos fititis 

came the good student 
b. *node o owtyntys xoAóc 

*irde o fititis kalos 


*came the student good 
“The good student came’ 


VS 


(17) a. node vac xadrdc porns 


irðe enas kalos fititis 

came  a/one good student 
b. oe evas domme xoAóc 

ire enas fititis kalos 


came  a/one student good 
*A/One good student came’ 


There is also ‘determiner spreading’ in the case of the definite article, yielding pe- 
culiar structures like (18) and (19): 


(18) a. o xaddg o domme 


o  kalos o fititis 

the good the student 
b. o domme o  xaAóc 

o fititis o  kalos 


the student the good 
“The good student’ 


(19 a o xardg o EEumvog o domme 
o kalos o  eksipnos o fititis 
the good the intelligent the student 
b. o xaÀóg o domme o  é&umvog 
o kalos o  fititis o  eksipnos 
the good the student the intelligent 
“The good, intelligent student’ 
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Indeed, the definite article must be used together with demonstratives, again 
in various possible orders. Further to (20) all the alternatives in (21) are equally 
possible: 


(20) avtdg o  xaXóg domme 
aftos o kalos  fititis 
this the good student 
“This good student’ 


(21) avróc o xaAóc o domme 
AUTOS O ports o xaAóc 
O XAOS AUTOS o domme 
o POLTHTNS AUTOS o xaAóc 
O XAOS O POTTS AUTOS 
O POLTHTHS o xoAóc avtóc 


The order of genitive modifying a noun is generally noun-genitive, but this is also 
subject to information structuring restrictions. Similarly, the order of adjective and 
noun is predominantly A-N, but again the reverse is not ungrammatical. In the case 
of numerals, the unmarked order is numeral-noun, but even sequences like (22) are 
not uncommon: 


(22) o pontés o TEEL 
1 fitites 1 tris 
the students the three 
"The three students’ 


Normally a relative clause follows the noun it modifies, but various other combi- 
nations are possible. 


4.3. The simple clause 


As mentioned above, word order is relatively free. Various operations may be ar- 
gued to effect various dislocations (again of lexical categories only; functional cat- 
egories, Le modal particles, future and negation markers as well as pronominal 
clitics, maintain their designated positions): 


(23) Tov EWTNOA TOV Tiavvn, | óvav uov 
ton rotisa ton jani otan mu 
him-acc asked-1s the-acc John-Acc, when me-GEN 
TOV OVOTNOES 
ton sistises 


him-Acc introduced-2s 
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(24) Tov Tıavvn TOV EWTNOA, OTAV [LOU 
the-Acc John-Acc him-Acc asked-Is, when me-GEN 
TOV OVOTNOES 


him-Acc introduced-2s 


(25) Tov Tıavvn, OTAV uov TOV OVOTNOEG, 
the-Acc John-Acc, when me-GEN him-Acc introduced-2s, 
TOV Qotrnoo 


him-Acc asked-1s 


(26) Tov EWTNOR, 6Tav uov TOV OVOTHOES, 
him-Acc asked-Is, when me-GEN him-Acc introduced-2s, 
TOV Tıavvn 


the-acc John-Acc 
‘I asked John when you introduced him to me’ 


However, crucially, if any one of the pronominal clitics (such as wou or Tov in the 
examples above) changes place, the result will be ungrammatical. 

Interrogative intonation is the primary (and often the sole) indication that a sen- 
tence is interrogative (of the *yes-no' type). Given the freedom of word order, it is in- 
tonation that determines whether a V-S (or even S-V) sequence is affirmative or not: 


(27) ‘Eovye o Tıavvns / 
efije o janis / 
Leave-PsT.PFV.3s the-M.NoM.s John-Nom.s / Leave-PsT.PFV.3s 
'EQvys o Ttdvvys; 
efije o janis? 
the-M.NOM.S John-NoM.s? 

28) O Tıavvns ebuye / O Tıavvns 
the-M.NoM.s John-NoM.s leave-Psr.PFv.3s / the-M.NoM.s John-NoM.s 
ÉQvYys; 


leave-PsT.PFv.3s? 
‘John left / Did John leave?’ 


Constituent questions normally involve ‘wh-movement’, but they need not: 


(29) nov 0a To Boets; 
pu 0a to vris 
where fut itacc find-npast.2s 


(30) Oa to Boeg OU; 
0a to vris pu 
fut itacc find-npast.2s where 
“Where are you going to find it?’ 


Greek 235 


Similarly, ‘subject-verb inversion’ appears to be the norm, but there are also cases 
which suggest that it may be an optional feature. 


4.4. The complex clause 


Complement clauses are introduced by one of a set of complementizers, depending 
on the indicative / subjunctive distinction (cf. Roussou 2000) and normally follow 
the main clause. 

Adverbial clauses can be distinguished into those which can introduce a 
‘dependent’ verb form (signalling [+PFv][-psT]) and those which cannot. There is 
room for arguing for a distinction between ‘balanced’ and ‘deranked’ verb forms, 
as well as between clauses that behave more like main clauses and others that are 
*more subordinate'. 

The perfective non-past form (also known as the dependent verb form, after 
Holton, Mackridge and Philippaki-Warburton 1997) can only appear in some 
(though not all) subordinate clauses and, crucially, it cannot appear in a main 
clause (unless it is introduced by one of the modal particles — or the negator min 
in the prohibitive). Its inability to function as a main clause verb may be attributed 
to its historical development or to general semantic considerations (e.g. the contra- 
diction involved in combining the features [+pfv] and [-pst] — see, e.g., Tsangalidis 
1999) or both. Interestingly, this is matched by its unavailability in certain kinds 
of subordinate clauses which also seem to require assertion of their content 
(e.g. clauses introduced by because, while, and similar items). 

In the general case, it is always possible to alternate between a sequence of 
Main Clause-Subordinate Clause or Subordinate Clause-Main Clause with no sig- 
nificant grammaticality effect; in other words, it is only discourse considerations 
that determine the ordering of clauses. However, there are also restrictions on 
the use of particular connectives, as in the contrast between the two versions of 
‘because’; one of them, jati, necessarily introduces a clause that has to follow the 
main clause, while a clause introduced by epidi can appear either before or after the 
main clause (see Kitis 2006). 


5. Suggestions for further research 


The relationships between the various historical stages as well as the geographical 
varieties of Greek need to be investigated. It is not clear how much dissimilarity 
would be sufficient before one argues for separate language status (as, for example, 
one might suggest in cases like Pontic or Cappadocian). For a large number 
of cases in Standard Modern Greek, it is still not always clear what determines the 
norm, owing to the post-diglossic situation and despite attempts to provide com- 
prehensive grammatical descriptions (as in Clairis and Babiniotis 2005). 
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Theoretical questions also need to be addressed, to the extent that it is desir- 
able, for example, to settle on a definitive number of phonemes or to define ques- 
tion formation in terms of movement processes, intonation patterns, etc. The extent 
to which intonation can be the direct and sole exponent of a basic grammatical op- 
eration in a European language is another intriguing question. 

Similarly, the Greek facts provide great challenges for standard assumptions 
concerning the interaction of pragmatics and grammar, e.g. when considering the 
available interpretations of the gerundive constructions, the differences between 
various types of subordinate clauses, or in the available uses and interpretations of 


the perfective non-past. 


Structural features 


Greek (basically Standard Modern Greek) 


animacy distinctions 
gender distinctions 


formal expression of 
definiteness 
number 
singular/plural 
morphological case 


evidential 
aspect 


tense 


mood and modality 


passive 


agent phrase in passive 
construction 

possessive construction 

adpositions 

alignment 


no animacy distinctions 

three genders (M, F, N), not always based on natural sex, 
available in the case of nouns, pronouns, adjectives, articles, 
and participles 

yes, in the form of a definite article 


in the case of nouns, pronouns, adjectives, articles, participles 
and verbs 

nominative, accusative, genitive, vocative marked on nouns, 
pronouns, adjectives, articles and participles 

no formal expression 

inflectional [+/— perfective], extremely grammaticalised (un- 
available only in the case of prototypical stative verbs). 
moreover, have-based perfect periphrasis (and also be-based 
periphrasis restricted in form and function) 

inflectional [+/-past] 

various future periphrases (involving invariable particle Oa + 
any finite verb form) 

(a) inflectional [+/-imperative] mood 

(b) systematic marking of speaker-oriented, epistemic and 
subordinating modality through the use of modal particles 
(c) identifiable necessity and possibility modals (ambiguous 
between epistemic and non-epistemic interpretations) 
inflectional [+/-active] system, available in all tenses, aspects 
and moods. Ambiguous or vague between passive, middle, re- 
ciprocal and reflexive interpretations. 

yes, introduced by the preposition apo (‘from’) 


of the ‘have’ type 
mainly prepositions 
accusative 
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Structural features Greek (basically Standard Modern Greek) 
non-canonical subjects genitive quasi-subjects (lexically restricted) 
adjectives agree with their heads (in attribution and predicatively) in 


gender and number; 
analytical and synthetic comparatives coexist 

word order pragmatically determined SVO or VSO as basic word order 
(not excluding all other possible orders), usually Adj N (not 
excluding N Adj), N Gen (not excluding Gen N), Prep N (rare 
N Prep), Dem N (not excluding N Dem), Num N (rare N Num) 
and N Rel, fixed order of elements within the ‘verb group’ 


See also the following chapters in this volume: 15 by van der Auwera, 16 by 
Tomić, 18 by Sansó, 22 by Hill, 23 by Breu, 35 by Myhill and 41 by Johanson. 
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1. Introduction 
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2.2. Stress 

2:3. Syllable structure, phonotactics, morphonology 
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3.2. Word structure 

3.3. Nominal categories 

3.4. Verbal categories 

4. Syntax 

4.1. Word order type and alignment 

4.2. Order inside the nominal phrase 


4.3. Order inside the verbal phrase 


1. Introduction 


Maltese is the only Semitic language in Europe with the status of state language. 
It is also exceptional because it employs a Latin-based alphabetic script. The be- 
ginnings of Maltese are obscured by the lack of historical documentation. The lan- 
guage is the last surviving offspring of Siculo-Arabic transplanted from Sicily to 
Malta in the 11th century. This variety lost contact with Arabic in the 13th century, 
when the Muslims were expelled from the Norman Sicily (Brincat 2001). The 
oldest document in Maltese — the Cantilena by Pawlu Caxaro (11485) — dates back 
to the 15th century. Only from the 18th century onwards did Maltese come to be 
used more frequently in written form such that in the 19th century, a genuine Mal- 
tese literature took shape. The orthographic norm of standard Maltese was estab- 
lished by the Academy of Maltese in 1924. In the early 20th century, the Maltese 
language question escalated. To the detriment of the previously dominating Italian, 
Maltese became co-official with English in 1934 (Hull 1993). Since 2004, Maltese 
counts among the official working languages of the European Community. 

90 % of the 450.000 speakers of Maltese are resident on Malta and Gozo. The 
majority of the inhabitants of Malta are native speakers of Maltese. There is bilin- 
gualism with co-official English (the resident population of English native speak- 
ers is minimal). Code-switching and code-alternation with aspects of diglossia are 
common on Malta. The English impact on Maltese grammar and lexicon has not 
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been determined yet. Nevertheless, Maltese is not an endangered language. A non- 
native variety of English has been developing. 

There are Maltese emigrant communities in Australia, Canada and the USA. 
Australian Maltese has acquired (and preserved) features which let it diverge from 
standard Maltese (Bovingdon 2001). The Maltese dialect map still has to be drawn 
up. The existence of four non-standard varieties can be taken for granted, namely 
the dialects of Gozo, the Zurrieq region, the village varieties and the urban dialects. 
The dialect features are mainly phonological, e.g. the preservation of post-velar 
consonants or the merger and shift of low and mid-high vowels (Puech 1994: 
17—23). In this chapter, I refer exclusively to standard Maltese. 

Owing to its age-long exposure to Italian and Sicilian, Maltese bears evident 
signs of Romancisation. Borrowings and calques from Italo-Romance are not con- 
fined to the lexicon (which consists of over 55 % elements of Romance origin) but 
also make themselves felt in morphology, syntax and semantics (Brincat 2001). 
For this reason, Maltese has been termed a mixed language (Aquilina 1958). Con- 
sider (1): 


(1) Ghax ghandkom tkunu tafu li t-IIIB 
because have:2Pl 2:be.Fut:Pl 2:know:Pl that Def-IIIb 
ma kinitx magħrufa għal xi 
Neg1 be:3Sg.Perf:Neg2 PP:know:F for some 
kwalitajiet tajbin jew xi rizultati 
quality:Pl good:Pl or some result:Pl 


eccellenti fl-ezamijiet. 

excellent in:Def-exam:Pl 

*Because you must know that the IIIB was not known for good qualities or 
excellent results in the exams.’! 


The four words in boldface are typical lexical Italianisms whereas the remainder of 
the sentence is entirely Semitic. The Italian pluraliser -i (in rizultati ‘results’) is a 
newcomer to the morphological system of Maltese, too. Like -i a number of other 
bound morphemes of Italian (or Sicilian) have become part of the Maltese system 
of encoding devices. This high degree of Romancisation (together with the less 
well integrated more recent influx of English elements and features) lets Maltese 
stand out among other Neo-Arabic varieties although the basically Semitic char- 
acter of the grammatical system of Maltese is still intact. 
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2. Phonetics and phonology 


2.1. Phoneme inventory 


There are five phonemic vowel qualities /1/, /e/, /a/, /o/, /u/ represented by the gra- 
phemes «i», «e», «a», «o», «u», respectively The phonetic realisation of mid- 
high vowels oscillates between closed (= [e]. [o]) and open (= [e]. [5]) articu- 
lations. Reduced vowel phonemes are not attested whereas moderate phonetic re- 
duction of vowels in unstressed syllables is optional. In addition, two diphthongs 
/ai/ = [er] ~ [er] and /au/ = [eu] ~ [90] form part of the phoneme chart. Their 
graphic representation is «ghi» for /ai/ and <ghu> for /au/. Quantity is distinctive 
with vowels. For all short qualities, there is a phonemically long partner i.e. /1:/, 
/e:/, /a:/, /o:/, /u:/. Vowel length is not systematically marked by orthographic 
means; the immediate neighbourhood of the otherwise "silent" consonants <gh> 
and «h» is often an indicator of vowel-length although long vowel quantities occur 
also in other context. 

The consonantal phonemes are (i) plosives, (a) voiceless: /p/, /t/, /k/, /?/ (= <p>, 
«t», «k», <q>), (b) voiced: /b/, /d/, /g/ (= «b», «d», <g>); (ii), affricates (a) voice- 
less: /ts/, /t{/, (= <z>, <C>), (b) voiced: /d5/ (= <g>), (iii) fricatives, (a) voiceless: /f/, 
Isl, I fI, Ih/ (= <f>, «s», «x», >), (b) voiced: /v/, /z/ (= «v», <z>); (iv) nasals: /ny, /n/ 
(= «m», <n>); (v) liquids, (a) lateral: /I/ (= <I>), (b) rhotic: /r/ (= <r>), (vi) approxi- 
mants: /w/, /j/ (= <w>, <j>). The phonemic status of the class of approximants is 
doubtful as the two representatives may also be considered consonantal allophones of 
the high vowels /u/ and /i/, respectively. Marginal phonemes are the voiced alveolar 
affricate /dz/ (= <z>) and the voiced (pre-)palatal fricative /3/ (= <si> + vowel), both 
of which are rarely attested and are restricted to recent Italian and English loanwords, 
respectively. It is preferable to consider [dz] an occasional allophone of /ts/ (pace Borg 
and Azzopardi-Alexander 1997: 301) and, analogically, [3] an occasional allophone 
of /[/ (Borg and Azzopardi-Alexander 1997: 303). For the rhotic, two non-distinctive 
articulations exist, namely alveolar tap [r] or retroflex approximant [4] (Borg and Az- 
zopardi-Alexander 1997: 302). The quantity correlation is also distinctive with con- 
sonants. Only the voiced labio-dental fricative /v/ escapes this pattern. With the ap- 
proximants, phonetic geminates occur but minimal pairs based on the contrast short 
vs. long are lacking (Borg and Azzopardi-Alexander 1997: 317). 

The Arabic feature of pharyngealisation is absent from Maltese. Likewise, 
interdental fricatives and some of the post-velars found in other Neo-Arabic lan- 
guages have disappeared from the inventory. In the word-lists of Maltese compiled 
in the 16th-17th century, some transcriptions seem to indicate that at that time at 
least the voiceless interdental fricative /0/ had not merged yet with the voiceless 
denti-alveolar plosive /t/. Vella (2006: 85) assumes that the same holds good for the 
voiced interdental fricative /Ö/, the velar fricatives /x/, /y/, the uvular plosive /q/ 
and the pharyngeal fricative /S/. These elements became obsolete during the 18th 
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century. The sound change from Arabic to Maltese comprises the following pro- 
cesses (Cardona 1997: 18-24): (i) emphatic obstruents > plain obstruents, (ii) in- 
terdental fricatives > denti-alveolar plosives, (iii) voiceless velar and pharyngeal 
fricatives > /h/ with variegated realisations ranging from post-palatal to pharyngeal 
(Borg and Azzopardi-Alexander 1997: 301), (iv) voiced velar and pharyngeal 
fricatives > () (usually represented in writing by the digraph «gh» which is pro- 
nounced [h] in word-final position or word-medially in combination with ortho- 
graphic <h>), (v) glottal fricative > Ø (usually represented in writing by <h>, 
which is pronounced [h] in word-final position or word-medially in combination 
with orthographic <gh>), (vi) uvular plosive > glottal plosive, (vii) glottal plosive 
> Ø. Remnants of class (iv) and (v) are said to occur in dialectal Maltese occasion- 
ally. 


2:24 Stress 


In polysyllabic words, there are several possibilities for the placement of primary 
stress. In the Semitic component of the lexicon, stress can only fall on the penulti- 
mate or on a heavy (= closed) ultimate, cf. bajjad = /'baj.jat/ ‘he painted’ vs. bajjad 
= /baj'ja:t/ ‘painter’ (Borg and Azzopardi-Alexander 1997: 319). The enclitic ne- 
gator -x usually attracts stress such that bajjad ‘he painted’ becomes ma bajjadx = 
/baj'jat=J/ ‘he did not paint’; accent shift onto the ultimate is caused also by suf- 
fixation of the adjectival and nominal pluraliser -in: Malti = /mal.ti/ *(a) Maltese’ 
vs. Maltin = /mal'tin/ ‘(several) Maltese’. Loans from Italian have introduced 
(pre-)antepenultimate stress site and the possibility of a light (= open) ultimate to 
be stressed: intrépidu ‘intrepid’, fedelta ‘loyalty’. Long vowels and stress tend to 
correlate. Secondary stress can occur only on syllables to the left of primary stress. 


2.3: Syllable structure, phonotactics, morphonology 


Cumulation of more than two consonants on either margin of a syllable is dis- 
favoured although, in their lexicon form, words may contain initial syllables 
or final syllables with a more complex syllable head or coda. The syllable types are 
(i) naked-open V hu ‘he’ (= /u/), (ii) covered-open CV ra ‘he saw’, (iii) covered- 
closed CVC rat ‘she saw’, (iv) covered (binary onset)-open CCV blù ‘blue’, (v) 
covered (binary onset)-closed CCVC ħmir ‘donkey’, (vi) covered (binary onset)- 
closed (binary coda) CCVCC ktibt ‘I/you wrote’, (vii) naked-closed VC id ‘hand’, 
(viii) naked-closed (binary coda) VCC art ‘land’, (ix) covered (ternary onset)- 
closed (binary coda) straht ‘I rested’. The cliticisation of the negator -x may create 
ternary codas of syntactic words such as CCVCC ma ktibtx ‘I did not write’. 

In case of higher complexity on the margins, epenthesis applies on the level of 
sentence phonetics, by which the consonant clusters are resyllabified. A prosthetic 
/i/ is attached to a word with binary or ternary onset if the word occupies the sen- 
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tence-initial slot or is preceded by a word ending in a consonant: il-bierah istraht 
‘yesterday, I rested’ vs. jiena straht ‘I rested’. The prosthetic /i/ is employed op- 
tionally also with word-initial sonorants in binary clusters, e.g., ltaqa’ ‘he met’ = 
[l'ta.?a] ~ [ıl'ta.?a] where the prosthetic vowel is not represented orthographi- 
cally (Borg and Azzopardi-Alexander 1997: 305). 

Final devoicing applies. In a word like kiteb ‘he wrote’ the final bilabial plosive 
is phonetically unvoiced: ['kr.tep]. At the same time, anticipatory assimilation 
is compulsory in consonant clusters. This means that in the imperfective plural of 
the same verb — jiktbu ‘they write’ — the bilabial plosive is no longer in word-final 
position and thus retains the feature [+voice]. In addition, those consonants to the 
left which form a medial cluster with /b/ assimilate to the voiced bilabial plosive 
in terms of phonation yielding the chain of segments ['j1g.dbu]. Since the relative 
position of consonants in a Maltese lexical morpheme is subject to systematic 
alterations such that in some word-forms, consonants may be immediate neigh- 
bours but in others not, these assimilation processes occur frequently in Maltese 
paradigms. 

The definite article is a proclitic i/- whose final consonant regularly assimilates 
to the initial alveolar consonant of its host: in-nar ‘the fire’, id-dar ‘the house’, 
ic-Cumnija ‘the chimney’, ix-xemx ‘the sun’, etc. vs. il-qamar ‘the moon’, il-Belt 
‘the City (= La Valletta)’, il-knisja ‘the church’, il-hobza ‘the slice of bread’. An 
exception to this rule is the voiced alveo-palatal affricate <g> = /ds/ as in il-genna 
‘the paradise’ (Cardona 1997: 25-26). 

Vowel harmony is restricted to the morphological expression of the 2nd person 
singular in the following functions: direct object, pronominal possessor, and pro- 
nominal complement of prepositions. The affix/clitic under scrutiny has the basic 
shape -(V)k. The vowel slot is filled only if the preceding stem ends in a consonant 
(Au-k ‘your brother’ with a vocalic stem). The default realisation of the vowel is /e/ 
as in dar-ek *your house'. If the last vowel of the lexical morpheme is /o/, the suffix 
comes in the shape of -ok, cf. omm-ok *your mother'. Phonologically determined 
allomorphy also applies in a variety of other cases. The suffix of the 1st person sin- 
gular (possessor, prepositional complement), for instance, is -i on consonantal 
stems and -ja on stems ending in a vowel or the palatal approximant, cf. lil-i ‘for 
me' vs. bi-ja *with me'. The same affix causes morphonological changes on the 
plural marker of nouns: id-ejn *hands' — ide-ej-ja *my hands'. This suggests that 
Maltese reflects aspects of the fusional type. 
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3. Morphology and grammatical categories 


3.1. Morphological type 


Typologically, Maltese is mixed as to the morphological strategies it employs. 
Mostly in the Semitic part of lexicon and grammar, introflexion (by [multiple] 
vowel replacement etc.) is prominent. However, Fabri (2009: 20) argues that what 
appears to be introflexion does not serve a morphological purpose. In his mind, the 
vowel alternations are phonologically motivated. He argues that “morpho-phono- 
logically, Maltese is indeed a mixed language, being based on both roots and 
stems.” This observation refers to the combination of two kinds of lexical mor- 
phemes, namely the typically trilitteral consonantal root associated with Semitic 
etyma and the fully vocalised stem representative of Indo-European etyma. The 
former type implies that in word-forms like KiTeB ‘to write’, KTiB ‘writing’, 
KiTBa ‘document’, KTieB ‘book’, KTejjeB ‘booklet’, KiTTieB ‘writer’ miKTeB 
*writing-desk', etc. the three consonants identified by the upper-case letters K-T-B 
form the discontinuous root equipped with the meaning WRITE. The insertion/ 
addition of a vowel melody is called for to render the word-form pronounceable. 
Further consonantal and vocalic material and the position of the root consonants 
serve the purpose of derivation and inflection. The massive adoption of foreign 
words however, has introduced and reinforced a different morphological principle 
according to which lexical morphemes are represented by internally unmodified 
segmental chains i.e. by stems as with (i)ppretenda ‘to pretend’ (< Italian preten- 
dere) whose paradigm consists entirely of word-forms with invariable stem (stem 
= boldface): (i) perfective 1Sg = 2Sg (i)p-pretend-ej-t, 3SgM (i)p-pretend-a, 3SgF 
(i)p-pretend-iet, 1Pl (i)p-pretend-ej-na, 2Pl (i)p-pretend-ej-tu, 3Pl (i)p-pre- 
tend-ew; (ii) imperfective 1Sg ni-p-pretend-i, 28g = 3SgF ti-p-pretend-i, 3SgM ji- 
p-pretend-i, 1Pl ni-p-pretend-u, 2Pl ti-p-pretend-u, 3Pl ji-p-pretend-u (Cardona 
1997: 65). Owing to the many Italian-derived verbs and nouns, stem-based and 
root-based morphology co-exist in present-day Maltese. 

Maltese employs prefixation, infixation and suffixation for a variety of gram- 
matical purposes: prefixation is a means to form passive participles, e.g., mi-ktub 
‘written’ («— kiteb ‘to write’), infixation occurs with passive-reflexive formations 
of verbs, for instance, as in f-t-ahar ‘to brag, to boast’ («— fahar ‘to praise’), 
suffixes indicate among many other things the plural of nouns, cf. le hj-iet ‘beards’ 
(< lehja ‘beard’). Internal modification is employed as well. Very regularly, the 
causative and/or intensive of trilitteral verbs is formed via gemination of the 
medial consonant: kiseb ‘to obtain’ — kisseb ‘to obtain after some effort; to pro- 
vide s.th. for s.o.’. So-called broken plurals of nouns and adjectives illustrate non- 
concatenative morphology: belt ‘city’ — bliet ‘cities’. Suppletion is only weakly 
represented in the morphological system. One of the very few examples is qal ‘to 
say' whose paradigm contains two etymologically distinct lexical morphemes, viz. 
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qal (for the 3rd persons of both numbers of the perfective) and (-)g hid- (for the re- 
mainder of the paradigm). 

Typologically, Maltese morphology is partly agglutinative and partly fusional. 
This dichotomy can be illustrated by person-number inflection on verbs. In the per- 
fective, the suffix is a cumulative exponent of person, number and aspect whereas 
in the imperfective, this information is distributed over two affixes, a prefix and a 
suffix: xrob-na ‘we drank’ vs. ni-xorb-u ‘we drink’. In addition, there are also peri- 
phrastic constructions. Suprasegmentals are not primary markers of inflectional 
categories. Relatively rarely, stress site distinguishes verbs and non-verbs, cf. 
stress on penultimate bdttal ‘to empty’ vs. stress on ultimate battdl ‘empty’ (Borg 
and Azzopardi-Alexander 1997: 319). There is also a plethora of inflectional 
classes for nouns, adjectives and verbs. As to nouns, two major classes of plurali- 
sation are distinguished, namely “sound” plurals which are formed via suffixation 
(with four sub-classes according to the affix employed, e.g., singular diskors 
‘speech’ — plural diskors-i, singular omm ‘mother’ — plural omm-ijiet, singular 
imgħa ‘week’ — plural gimg h-át, singular ħalliel ‘thief’? — plural fallel-in) and 
"broken" plurals whose encoding strategy is introflexion (with a controversial 
number of sub-classes according to the alternation-patterns of the vowels in singu- 
lar and plural, e.g., singular kelb ‘dog’ — plural klieb, singular gżira ‘island’ > 
gzejjer with infixation). There is also the possibility that suffixation and intro- 
flexion jointly form the plural of a noun. Suppletion applies in the case of singular 
mara *woman' plural nisa. 


3. Word structure 


Nouns, adjectives, the numeral one, demonstratives, verbs, pseudo-verbs and 
prepositions inflect whereas personal pronouns, definite article, the majority of the 
numerals, conjunctions and interjections do not inflect. Prepositions host the per- 
son marker of their pronominal complements. A case in point is fi ‘in’ > 1Sg fi-ja, 
2Sg fi-k, 3SgM fi-h, 3SgF fi-ha, 1Pl fi-na, 2Pl fi-kom, 3Pl fi-hom. 

Words may consist of a root or stem plus affixes and internal modification. Sev- 
eral morphemes beyond the lexical morpheme may combine such as missir-ijiet-na 
= {father}-{Pl}-{Por.1Pl} ‘our fathers’. The phonological word is rendered more 
complex via cliticisation. Proclitics may fuse to form complex proclitics. Two 
types of complex proclitics occur: (i) demonstrative + definite article as in dat-tifel 
~ dan it-tifel “this boy’, (ii) preposition and definite article as in mit-tifel ~ minn 
it-tifel 'from the boy'. While the cliticisation of the definite article is compulsory, 
the formation of complex proclitics is largely optional. As to enclitics, direct and 
indirect pronominal objects are typical examples. Finite (di-)transitive verbs host 
enclitic object markers which are attached to the rightmost element of the host: nik- 
teb ‘I write’ — ni-ktb=u ‘I write it’ / ni-ktib=lu ‘I write to him’ / ni-ktib=hi=lu 
‘I write it for him’, ktib-tu ‘you (Pl) wrote’ — ktib-tü=h ‘you (Pl) wrote it’ / ktib- 
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tu=lu ‘you (Pl) wrote to him’ / ktib-tu-ház lu ‘you (Pl) wrote it for him’. The order 
of direct and indirect object clitics is always DO > IO. Combinations of both object 
clitics on a finite verb are only admissible with direct objects of the 3rd persons. 
The enclitics have an effect on the stress site and the phonological shape of the 
lexical morpheme (raising of unstressed /e/ and secondary stressed /a/, syncope 
with *vocalic" clitics, etc.). Simultaneous cliticisation on both margins of a poten- 
tial host is ruled out in Maltese. Proclitics select nouny hosts, whereas enclitics 
combine with finite verbs. 


3.3. Nominal categories 


There are two genders, masculine and feminine, which are distinguished formally 
in the singular whereas the plural is not sensitive to gender distinctions. Notwith- 
standing the fact that there is a handful of exceptions, the grammatical gender 
of nouns is largely predictable on formal grounds (Stolz 2009). Singular nouns 
ending in a consonant or in /u/ tend to be masculine with overwhelmingly more 
than chance frequency, cf. dqiq ‘flour’ and genju ‘genius’. With an even higher 
degree of probability, singular nouns ending in /a/ are grammatically feminine, cf. 
nida ‘dew’ and tentufa ‘trifle’. The situation is less clear with nouns ending in /i/ 
which may be either feminine (e.g. doti ‘dowry’) or masculine (gidi ‘kid’). For a 
number of nouns, there exist pairs of masculine and feminine forms such as Amir 
‘donkey’ and /imir-a ‘she-donkey’ with a common plural Amar ‘donkeys’. 

For adjectives, there is usually a tripartite paradigm consisting of two gender- 
sensitive forms in the singular and a common plural form, cf. ahmar ‘red’ > M 
ahmar, F hamra, Pl homor. Attributive and predicative adjectives agree in gender 
and number with the noun they modify as in tifel zghir ‘a little boy’, tifl-a zghir-a 
‘a little girl’, tfal zg ħar ‘little children’. Some adjectives borrowed from Italian are 
morphologically invariable, e.g., M = F = PI intelligenti ‘intelligent’. A closed set 
of adjectives has distinct morphological comparative forms such as positive tajjeb 
‘good’ — comparative itjeb ‘better’. For the majority of Maltese adjectives, an 
analytic comparative construction with aktar ~ iktar ‘more’ is employed: aktar in- 
telligenti ‘more intelligent’. For both strategies, the superlative is formed by ad- 
ding the definite article proclitic to the comparative i.e. /-itjeb ‘the best’ and l-aktar 
intelligenti ‘the most intelligent’. Morphological comparatives distinguish neither 
gender nor number whereas in analytical constructions, the adjective is sensitive 
to gender and number distinctions: l-aktar mara maghruf-a ‘the best known/most 
famous woman'. In the analytical comparative construction, the head-noun is 
sandwiched between the comparative marker on the left and the lexical adjective 
on the right. 

The gender opposition is also relevant for the 3rd person singular subject of 
verbs as masculine and feminine trigger different affixes. In the perfective, the 
masculine form is morphologically unmarked - kiteb ‘he wrote’, whereas the femi- 
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nine is encoded by a dedicated suffix -(e)t, cf. kitb-et ‘she wrote’. In the imperfec- 
tive, two distinct prefixes are employed: ji-kteb ‘he writes’ vs. ti-kteb ‘she writes’. 
This distinction is also relevant in subject-verb agreement: ir-ragel ji-tlaq / telaq 
‘the man leaves / left’ vs. il-mara ti-tlaq / telq-et ‘the woman leaves / left’. Gender 
and number distinctions are overtly marked also on the active participle (mexa ‘to 
walk’ > M miexi, F miexj-a, Pl mix-in) and the passive participle (kiteb ‘to write’ 
— M miktub, F miktub-a, Pl miktub-in). 

Nouns distinguish singular and plural. There is a small number of pluralia tan- 
tum like flus ‘money’. For a small set of items, there is also a residual dual number 
(Fenech 1996), cf. Sg gimgha ‘week’, Pl gimgh-at, Du gimagh-tejn ‘two weeks’. 
Quite a few nouns have a collective and a singulative form and thus some nouns 
have a paradigm of five numbers including the so-called “exact” plural used with 
cardinal numerals up to ten (Mifsud 1996): Collective kliem ‘words’, Singulative 
(= unit noun) kelma ‘word’, Pl (= exact plural) kelmiet ‘words’, Du kelmtejn ‘two 
words’. Adjectives and verbs only distinguish singular and plural such that a noun 
in the dual always triggers plural morphology on the agreeing words. For the col- 
lective, agreement oscillates between singular and plural. 

Case is not a morphological category. Word order and prepositions are employed 
instead. For definite animate direct objects, the object marker lil ‘for’ is widely 
used, as in /-ghalliem ra lil Guze / lilek ‘the teacher saw Guzé / you’, otherwise a 
bare NP can function as direct object as well, cf. I-ghalliem ra d-dar ‘the teacher 
saw the house’. Pronominal objects are encoded by object clitics on the verb. 

Indefiniteness is not overtly marked. Definiteness is marked by proclitics on 
the initial word of the NP, which may be a noun, adjective, numeral or the com- 
parative marker aktar ~ iktar. The use of the definite article is compulsory in the 
presence of a demonstrative: dawn id-djar ‘these houses’. There is also formal 
definiteness agreement on adjectives which, however, is used for contrastive focus, 
cf. il-wardiet sbieh ‘the beautiful flowers’ vs. il-wardiet is-sbieh ‘the beautiful 
flowers’ as opposed to the ugly ones. 

Inalienably possessed nouns may be possessor-marked by suffixation, e.g., ras 
‘head’ — ras-i ‘my head’. Nouns equipped with a possessor suffix cannot combine 
with the definite article, whereas under alienable possession, overt definiteness 
marking is in order: il-karrozza tieg hi ‘my car’, literally ‘the car of-me’ where the 
possessor is the pronominal complement of the preposition ta’ ‘of’. The same 
preposition is used in so-called genitive constructions with lexical possessor and 
lexical possessee (il-ktieb tat-tifla ‘the book of the girl’). However, for inalienable 
possession, there is also the construct state, a construction in which the NPs are 
(recursively) juxtaposed in the order possessee > possessor: isem omm Albert "AL 
bert’s mother’s name’ (Stolz et al. 2008). For feminine nouns ending in -a, con- 
structions associated with inalienable possession trigger the so-called t-marbuta 
(“bound -?”), cf. mara “woman; wife’ —> mar-t-i ‘my wife’ and mar-t habib-i ‘my 
friend’s wife’. 
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3.4. Verbal categories 


Maltese has no proper copula in the present tense. In nominal sentences, the copula 
slot is either empty or filled by a personal pronoun, cf. Pawlu (hu) g halliem ‘Pawlu is 
a teacher’ with the optional personal pronoun hu ‘he’ inserted between the two NPs. 
The copula is special as it not only has a perfective paradigm (e.g. kont ‘I was/you 
were’, kienet ‘she was’, etc.) but also distinct future/irrealis forms (e.g. nkun ‘I will 
be’, tkun *you/she will be’, etc.). Full verbs normally distinguish two basic aspectual 
categories by inflectional means, viz. the imperfective whose pronominal subject 
markers are prefixes complemented by a number suffix in the plural (e.g. ni-tlab ‘I 
beg' vs. ni-talb-u *we beg") and the perfective whose pronominal subject markers are 
suffixes (e.g. tlab-t ‘I begged’ vs. tlab-na ‘we begged’). The imperfective has various 
readings, mostly present and future but also habitual, whereas the perfective is norm- 
ally associated with past time. The imperfective can also be employed to describe 
events in the past provided the time reference is clarified via the use of an auxiliary. 
Maltese has a very rich system of lexical and grammatical aspectual and aktionsart- 
like distinctions (Vanhove 1993). Particles are employed to encode the progressive 
(ged ~ qieg ħed/qieg hda/qieg hdin), the future (se(r) ~ sejjer/sejra/sejrin) and the con- 
ditional (kieku). The general auxiliary used for the formation of past imperfective, 
pluperfect and future periphrases is kien ‘to be’, e.g., kien jara ‘he was reading’, kien 
qara ‘he had read’. There is a class of so-called aspect auxiliaries which express cat- 
egories like continuative (baqa' ‘to remain’), inchoative-continuative (beda ‘to 
begin’), repetition (rega’ ‘to return’), etc., e.g., bgajt naqra ‘I kept on reading’. All 
these categories are encoded periphrastically by constructions made up of two asyn- 
detically combined finite verb forms which may yield rather longish verb chains. 
On Semitic verbs, the reflexive-passive and causative-intensive is encoded 
morphologically either by prefixes (bena ‘to build’ vs. n-bena ‘to be built’), infixes 
(nesa ‘to forget’ vs. ntesa ‘to be forgotten’) or consonant gemination (nesa ‘to for- 
get’ vs. nessa ‘to make s.o. forget s.th."). Alongside this inherited voice system, 
Maltese also employs a periphrastic construction for the dynamic passive calqued 
on an Italian pattern. The auxiliary is the motion verb gie ‘to come’ which com- 
bines with the passive participle as in ji-ge-w mi-bghut-in ‘they are being sent’. 
Maltese has no morphological infinitive. The two participles (active and pass- 
ive) do not replace the infinitive but function as gerunds and proper passive parti- 
ciples, respectively. In the absence of a dedicated copula in the present tense, the 
active participle (which is in use only for a closed set of mostly motion verbs) may 
function as the main verb of a predication, e.g. hija tiela’ fug il-bejt ‘she is ascend- 
ing to the roof’ with the active participle tiela’ ‘ascending’ of tela’ ‘to ascend’. 
Maltese displays a class of so-called pseudo-verbs i.e. elements which fulfil 
verb-like functions but do not conform to the morphological patterns of typical 
verbs. Cf. the pseudo-verb ghand- (< preposition ghand- ‘at one’s place’), which 
translates English to have in predicative possession: ghand-i zewg karrozzi ‘I have 
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two cars’. Its paradigm deviates from full verbs in two aspects. The “subject” is in- 
dicated by the pronominal suffixes, which elsewhere function as markers of the 
possessor or as pronominal complements of prepositions. There is tense-based 
suppletion as the past and future forms are morphologically independent of the 
present tense form, namely kel-li “I had’ and i-kol-li ‘I will have’. 


4. Syntax 


4.]. Word order type and alignment 


Maltese displays the properties of a SVO language with a flexible, pragmatically 
determined word order. This flexibility is reminiscent of Italian. To the left of the 
verb, there are several slots which may be occupied by (multiple) topics. Relative 
clauses follow their head nouns. Pronouns are only used as topic or focus pro- 
nouns, in all other respects, Maltese is a pro-drop language. In (2), the personal 
pronoun jiena ‘I’ marks contrastive focus in an exchange of opinions. 


(2) Jien ngħid li għandna nsibu triq oħra. 
I l:say that have:1Pl 1:find:Pl street other:F 
‘I for my part say that we have to find a different way.’ 


Example (3) further illustrates that the use of subject pronouns is largely optional: 


(3) wasl-u l-villa, feth-u u dahl-u. 
arrive-3Pl.Perf Def-villa open-3Pl.Perf and enter-3Pl.Perf 
“They arrived at the villa, they opened [the door] and entered.’ 


Maltese displays accusative alignment. Since morphological case marking does 
not apply, inanimate and indefinite direct object NPs are formally indistinguishable 
from subject NPs. Word order in combination with semantics helps to disambigu- 
ate the syntactic functions. If a definite direct object NP ranks high on the animacy 
hierarchy, the object marker lil ‘for’ is usually employed (Borg 1988: 114-148). 


4.2. Order inside the nominal phrase 


Adjectives follow their head nouns (e.g. ktieb ġdid ‘a new book’, ktieb antik ‘an old 
book’, kotba interessanti ‘interesting books’). Pre-nominal position is obligatory 
with morphological comparatives/superlatives as in (4): 


(4) Infatti qatt ma kellhom l-icken dubju 
in fact never Negl have.Perf:3Pl Def-small.Comp doubt 
la fiha u langas f Ineż. 
neither in:38g.F and nor in Inez 


‘In fact, they never had the slightest doubt about her and Inez.’ 
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Except wie Bed ‘one’ (mara waħda ‘one woman’ vs. mara ‘a woman"), all cardinal 
numerals precede the noun they quantify (zewg nisa ‘two women’) and this is also 
the order for other quantifiers (hafna nisa ‘many women’). The cardinal numerals 
2-10 trigger plural inflection on the quantified nouns whereas all other numerals 
combine with the singular (għoxrin mara ‘twenty women’). Demonstratives are 
located on the left edge of the NP. The first content word of the NP hosts the defi- 
nite article if the demonstrative is used (proper nouns are exempt from this rule), 
cf. dawn iz-zewgt idjar kbar hafna ‘these two very big houses’. 


4.3. Order inside the verbal phrase 
The verb chains of Maltese are ordered internally as follows: 
Auxiliary > TMA particle > Phasal verb > Modal verb > Lexical verb 


In this chain, non-verbal elements may be inserted mostly in the Wackernagel posi- 
tion to the right of the initial member of a verb chain as in kienu digà qaluli ‘they 
had told me already’ with diga ‘already’ separating the auxiliary kienu ‘they had’ 
from the lexical verb qaluli ‘they told me’. Inside the VP direct objects precede in- 
direct objects which in turn precede adjuncts such as adverbial phrases, cf. it-tifel, 
xtara rigal, lil huh; bil-flus, li kellu ‘the boy, bought a present, for his brother, 
with the money, he had' (Borg 1988: 115). This order is also subject to pragmati- 
cally motivated re-ordering. 
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Table I: Typological features of Maltese 


Structural features 


realisation in Maltese 


animacy 
gender 
definiteness 
number 


morphological case 
syntactic case 


possessive constructions 
aspect 
tense 


copula 
mood 


passive 
adpositions 


adverbs 
adjectives 


word order 


alignment 


differential object marking (cf. syntactic case) 

two — masculine vs feminine — in the singular of nouns, 

adjectives and agreement markers on verbs in the 3rd person 

singular 

proclitic definite article, no indefiniteness marker; definiteness 

marking blocked with certain possessive constructions 

singular, plural (on nouns, adjectives and verbs); singulative, 

collective; residual dual (on nouns only) 

absent 

1) In the VP: differential object marking: definite animate 
direct objects are marked by the analytical marker of 
indirect objects 

2) in N-N constructions: construct state = juxtaposition of 
nouns in a genitival relationship 

1) attributive: alienable vs inalienable possession 

2) predicative: no etymological habeo-verb 

3) external possession: abundantly attested 

imperfective vs perfective 

future only with the copula, present and future only in 

predicative possession 

no present tense form (various replacements) 

morphological indicative vs imperative; analytical 

conditional/irrealis 

morphological: reflexive/passive; analytical: dynamic 

passive, resultant state; agent phrase: yes 

prepositions inflected for person, gender and number 

only a small number of primary adverbs 

agree with their heads (in attribution and predicatively) in 

gender and number; definiteness marking on adjectives 

for contrastive focus in attribution; analytical and synthetic 

comparatives coexist 

pragmatically free SVO with N Adj, N Gen, Prep N, Dem N, 

Num N and N Rel 

accusative 


See also the following chapters in this volume: 18 by Sansó, 23 by Breu and 38 by 


Ramat. 
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Notes 


1 Comp = comparative, Def = definite, DO = direct object, Du = dual, F = feminine, 
Fut = future, IO = indirect object, M = masculine, Neg = negation, NP = noun phrase, 
Perf = perfective, Pl = plural, Por = possessor, PP = passive participle, Sg = singular, VP 
= verb phrase. 
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1. Introduction 


Romani is the only Indo-Aryan language that has been spoken exclusively in Eu- 
rope (and by European emigrants to the New World) since the Middle Ages. It was 
apparently brought into Anatolia under Byzantine rule sometime around the elev- 
enth century by a caste-like population specialising in itinerant services such as 
metalwork and entertainment. Their self-appellation, rom, appears to be related to 
that of the dom — a low-status and landless caste specialising in manufacturing 
small tools and artefacts, weaving, and entertainment. Other groups with cognate 
names, a similar socio-economic profile and an Indo-Aryan language living out- 
side of India include the dom of the Middle East and the lom of the Caucasus. The 
dom are often described not just as segregated but also as an enslaved population, 
and it has been suggested that the ancestors of the fom may have been camp-fol- 
lowers who accompanied the Seljuk invasions through western Asia and were 
either captured or recruited by the Byzantines. Historical sources testify to a wave 
of Romani migrations from the Balkans into western and northern parts of Burope 
during the fifteenth century. The names most commonly used during this period to 
refer to them by the majority sedentary population are ‘Gypsy’, *Yifti', ‘Gitan’ etc. 
(all derived from ‘Egyptian’, based on a misguided assumption about their origin), 
and “Isigan’, ‘Zigeuner’, “Tsigngani’, ‘Çingene’, etc., all possibly derived from an 
old Turkic appellation for low-caste slaves (ciyan). 
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Settlement in the various regions of Europe led to the dispersal of Romani 
populations and to their isolation from one another, and this, along with the in- 
fluence of different contact languages, is what gave rise to the emergence of con- 
siderable differences among the dialects of Romani. The earliest known attestation 
of Romani has only just recently been discovered in a manuscript dating from 
around 1515 composed by by Johannes ex Grafing, who may have collected his 
material in Vienna (Knauer 2010). A text published by Andre Borde in 1542 con- 
tains sample phrases in Romani thought to have been collected in England or 
France. The earliest known attestation from the Balkans is in the travel diary of Ev- 
liya Celebi from 1668 (Friedman and Dankoff 1991). Sources from the mid-eight- 
eenth century offer a sufficiently wide coverage to be able to establish that differ- 
ences among the dialects of Romani by that time closely resembled the patterns of 
differentiation found today. It was in this period that hypotheses concerning the 
Asian origin of Romani were first made, culminating in Johann Rüdiger's (1782) 
publication that conclusively proved that Romani was closely related to Hindu- 
stani. Today, Romani constitutes one of the largest minority languages of Europe, 
with upwards of 3.5 million speakers residing mainly in southeastern Europe (Ro- 
mania, Bulgaria, Serbia, Macedonia, Greece, Albania) as well as in central Europe 
(Slovakia, Hungary). There are also sizeable Romani-speaking communities in the 
United States and in South America as well as in urban centres of western Europe. 
In most of the fringe regions of Europe — Scandinavia, Britain, and the Iberian Pen- 
insula — Romani populations abandoned their language toward the beginning of the 
nineteenth century or even earlier, but maintain an in-group vocabulary of Romani 
origin, which comprises up to 600 lexical roots. All Romani speakers are bilingual 
and often multi-lingual from a very young age. The language has traditionally been 
limited to oral usage, primarily within the extended family and with neighbouring 
clans. Code-switching and mixing are common, and dialects of Romani have ab- 
sorbed considerable structural and lexical influences from their respective contact 
languages. 

The lexicon of most Romani dialects contains a shared inventory of only 
around 1,000-1,200 lexical roots. Of those, only around 800 are pre-European, 
consisting of a core vocabulary of around 600 Indo-Aryan roots along with layers 
of early loans from Iranian languages and from Armenian (as well as other Cau- 
casian languages). All Romani dialects also contain a significant layer of Greek 
loans, as well as a much smaller number of shared lexical items of Balkan Slavic 
and Balkan Romance origin. The remainder of the vocabulary is typically derived 
from subsequent contact languages, and thus differs among the individual dialects 
of the language. The shared lexicon provides some additional clues to the history 
of the population, suggesting a prolonged settlement in an area in which Greek, Ar- 
menian, and Iranian will have played a significant role as contact languages — in all 
likelihood eastern or central Anatolia — and a shared migration into the Balkans be- 
fore the dispersal into groups. 
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Traditionally an oral language of the household, Romani is increasingly being 
used mainly in electronic communication among community activists, who prac- 
tice a pluralistic approach to codification and the selection of dialect features. 
Some programmes to promote the language in education and media are under way, 
though there is no standard written form of the language nor a prestige variety that 
would lend itself for the creation of such standard. 


2. Phonetics and phonology 


2.1. Phoneme inventory 


Romani has five inherited phonemic vowel qualities /i/, /e/, /a/, /o/, /u/. Many Ro- 
mani dialects also have a central or centralised vowel, /2/ or /i/ or sometimes /w/, 
which may be limited to loanwords but is often found to infiltrate the inherited lexi- 
con as well. Some dialects take on additional phonemes, such as /y/ (in contact 
with Turkish) and /e/ (in contact with German and English). In contact with west- 
ern and central European languages, some Romani dialects acquire vowel length 
distinctions, which, however, are not always phonemic. There are few inherited 
words containing the diphthongs /aj/,/oj/, and /ej/. 

The consonantal phonemes are (1) plosives: (a) voiceless, non-aspirated: /p/, 
/t/, /k/, (b) voiceless, aspirated /p"/, /t'/, /k"/, (c) voiced: /b/, /d/, /g/; (2) affricates: 
(a) voiceless, non-aspirated: /ts/, /t{/, (b) voiceless, aspirated: /tf^/, (c) voiced: /d/; 
(3) fricatives: (a) voiceless: /f/, /s/, /[/, /x/, /h/, (b) voiced: /v/, /z/; (4) nasals: /m/, 
/n/; (5) liquids: (a) lateral: /l/, (b) rhotic: /r/, (6) approximant: /j/. A marginal pho- 
neme is /5/, often limited to loanwords. Many Romani dialects show palatalisation 
of consonants in positions preceding front vowels, and some have palatal pho- 
nemes such as /c/ and /y. Conservative Romani dialects in southeastern Burope 
tend to preserve two rhotics, a trilled /r/ and an additional sound that is realised 
either as a uvular /R/ or even a voiced velar fricative /y/, or in some dialects as a 
retroflex /t/. Some dialects show a collapse of /x/ and /h/ in /x/. 


2-2. Stress 


Conservative Romani stress patterns are grammatical. A series of older inflection 
suffixes carry stress, e.g. the nominative ending in rakl-ó ‘boy’, rakl-i ‘girl’, and 
the person ending in /i-j6m ‘I took’, dikh-jö-l ‘it is seen’. A secondary or histori- 
cally younger set of grammatical suffixes is external and does not carry stress, e.g. 
the remoteness tense marker -as as in dikh-l-jóm-as ‘I had seen’, or so-called Layer 
II case-markers that follow the oblique ending, as in rakl-és-ke "Tor the boy’. In 
some dialects, especially the western dialect of Romani, stress patterns are taken 
over or partially replicated from the contact languages. Stress shifts forward and 
becomes fixed to a particular syllable rather a grammatical ending. 
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2:3. Syllable structure, phonotactics, morpho-phonology 


The typical syllable structure is CV or CVC, with most words having up to three or 
at most four syllables, typically CVCVCV or CVCCV(CV), etc. Consonant 
clusters in the inherited lexicon are largely limited to the combinations Cr (in in- 
itial or medial position) and Cn, Cm (mainly in medial or final position), though 
through the large number of loanwords from different languages syllable patterns 
show enormous variation across the dialects. Several issues involving morphopho- 
nology are noteworthy. First, there is a tendency for segments representing person 
endings on the past-tense verb as well as medio-passive and sometimes also cau- 
sative derivational morphemes to show jotation, which in turn often leads to pala- 
talisation of the preceding segment: Thus gel-jom ‘I came’ > gejom (but gel-o ‘he 
came’, with an adjectival-participial ending), kerd-jov-el ‘it is being done’ > 
kerd’ovel, kerdzovel. Next, Romani as a whole shows alternation in grammatical 
paradigms between the phonemes /s/ and /h/, with individual dialects opting for 
different solutions. The most likely position for /h/ to appear is in morphological 
endings in intervocalic position, thus Russian Romani le-sa ‘with him’, dZin-es-a 
“you know’, German Romani le-ha ‘with him’, dZin-eh-a ‘you know’. Alternations 
are also found in the root form of the copula verb: Erli Romani from the southern 
Balkans sin- ‘is’, Finnish Romani hin ‘is’. Some dialects show /h/ in the inter- 
rogative paradigm: East Slovak Romani sar ‘how’, West Slovak Romani har 
‘how’. Third, Romani second-layer or external case-markers are enclitics that 
show voice assimilation to the preceding segment: tu-tar ‘from you’, les-ke ‘for 
him’, but man-dar ‘from me’, len-ge “for them’. Finally, Romani past-tense or per- 
fective markers show inflection class re-assignment based on a hierarchy of fea- 
tures represented by the final segment of the verbal root: -m > -k, -kh > -& -ch > -$, 
-s, with stems at the top of the hierarchy more likely to be re-assigned from the 
class with in -t- to that in -/-: *mukh-t-jom ‘I left’ > mukh-l-jom (see Matras 2002: 
138-139). 


3. Morphology and grammatical categories 


3.1. Morphological type 


Typologically, Romani morphology is partly fusional and partly agglutinative. The 
fusional element tends to be older and is best represented by the person conjugation 
on the verb, especially in the present tense, and by the retention of declension 
classes in the first layer of nominal inflection, thus: rakl-o, ‘boy (nominative)’, 
rakl-e ‘boys (nominative)’, rakl-es ‘boy (oblique)’, rakl-en ‘boys (oblique)’, rakl-i 
‘girl (nominative)’, rakl-jan ‘girls (oblique)’, and so on. (Note that nominal inflec- 
tion of this type also appears in personal pronouns, demonstratives, interrogatives, 
definite articles and adjectives). The agglutinative element is generally younger 
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and is best represented by second-layer case-markers, which are enclitic and not 
sensitive to declension class: rakles-tar ‘from the boy’, raklja-tar ‘from the girl’. 
Some tense and modality markers as well as verb derivational markers indicating 
transitivity and intransitivity are also agglutinative: Russian-Baltic Romani 
sikhlja-kir- ‘to teach, to show’, xolja-kir- ‘to upset, to anger’. Some older deri- 
vational markers — for example the transitive/causative augments -av- and -ar- re- 
tain their agglutinative character (dar- ‘to fear’, dar-av- ‘to frighten’; dand ‘tooth’, 
dand-ar- ‘to bite’), while some younger markers are in the process of losing their 
agglutinating-derivational character and are becoming fusional-inflectional: ker-el 
‘he/she does’, kerd-jov-el ‘it is done’ > kerdj-ol ‘it is done’, and by analogy also 
ach-ol ‘he/she stays’ and other intransitive or situational verbs. In addition to the 
pre-European morphological component, Romani dialects inherit an inventory of 
Greek-derived inflectional markers, such as the tense-aspect markers -in-, -az- and 
so on. In many dialects these markers take on the function of agglutinative deri- 
vational elements, identifying the root as a loan: thus Turkish Romani anlad-iz-av 
‘I understand’ (from Turkish anladı ‘understood’, with Greek-derived -iz-), Ursari 
Romani (Romanian Black Sea coast) entr-iz-av ‘I enter’ (Romanian fntr- ‘enter’, 
with Greek-derived -iz-). Suppletion is rare in Romani and appears primarily in the 
pronominal system, where we find a split between renewed nominative forms in ov 
‘he’, oj ‘she’ etc., and more conservative oblique forms in les ‘him’, la ‘her’ and so 
on (the same split also affecting the set of demonstratives and in some dialects also 
the definite articles), and in the verb ‘to go’ (present dZa-, past ge-). Further supple- 
tion often comes about through reliance on a loan word for individual words such 
as ordinals (especially ‘first ) and comparatives (especially ‘worse’). 

On the whole the language can be said to be drifting toward stronger reliance 
on analytical structures. This is in part a contact phenomenon. The Romani dialects 
of western Europe have largely abandoned flexional derivations in the verb and 
rely instead on analytical constructions with auxiliaries. In southeastern Europe 
(Albania, Macedonia, and Greece), Romani dialects have developed analytical 
perfect tenses, while in central and eastern Europe medio-passive constructions 
tend to rely on the active verb with a reflexive pronoun. Inflectional case markers 
also show strong competition from prepositions, though the extent varies among 
the dialects. An outstanding characteristic feature of Romani morphology is the 
division between pre-European vocabulary and European loanwords in both nom- 
inal and verbal morphology. In nouns, the difference is expressed primarily in the 
choice of declension class, which determines the expression of the nominative and 
partly also the oblique form. In verbs, European loans require a loan-verb adap- 
tation marker, which functions essentially as a derivational marker that mediates 
between the verb stem and its tense-aspect and person inflection. 
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3. Word structure 


The majority of words inflect: nouns, adjectives, lower numerals, demonstratives, 
verbs, interrogatives, personal pronouns, definite and indefinite articles, local 
relations adverbs, and some indefinites. Words that do no inflect are conjunctions, 
interjections, and prepositions (though the latter are often derived from inflected 
local relations adverbs). The word typically consists of a word root, followed by a 
derivational marker and the by inflectional markers. Most morphological material 
is thus suffixed. Prefixed elements are found in the noun phrase, and are usually 
analytical; they include attributes of various kinds — definite and indefinite articles, 
adjectives, prepositions, and genitive-possessive nouns. On the verb, the only 
regular prefixed structure are the aspectual markers borrowed from Slavic lan- 
guages in the Romani dialects of Slovakia, Poland, Russia and the Baltic regions. 
Similar structures are emerging in the Romani dialects of Bulgaria, Macedonia as 
well as Greece and Austria, modelled on or directly borrowing aktionsart prefixes 
of the respective contact languages. Most grammatical morphemes have their fixed 
positions and there is a clear distinction in this respect between bound and unbound 
morphemes. A possible exception and a case for cliticisation is the set of more 
conservative nominative pronominal forms in (l)o etc. In some dialects, they occur 
exclusively in non-verbal predications, where they tend to accompany demon- 
strative, interrogatives, or adverbs (e.g. Balkan Romani kaj-lo? ‘where is he’), or 
in copula predications, while in other varieties they may also accompany lexical 
verbs and have rather flexible distribution patterns within the sentence (cf. German 
Romani phenas-o miro phral ap mande-lo ‘my brother [he] said [he] to me’). Only 
one Romani dialect, that of Epiros in northwestern Greece (Matras 2004), is known 
to have incorporated oblique personal pronouns as bound morphemes into the verb: 
dikhl-jom-os ‘I saw him’, dikhl-jom-i ‘I saw her’. Compounding is extremely rare. 


3:3. Nominal categories 


There are two genders, masculine and feminine, which are always distinguished 
formally in the singular. The plural is also sensitive to gender distinctions, though 
not in personal pronouns or in adjectival-attributive inflection, and in many dia- 
lects not in the definite article or in oblique forms of the noun, either. Nominal 
declension paradigms are otherwise quite stable and are generally clearly distin- 
guished for gender and number. 

Declensions are represented at the so-called Layer 1 level of nominal case. This 
is for most classes the level at which fusional case markers are direct descendents 
of Old and Middle Indo-Aryan case markers: The masculine singular nominative is 
expressed either by a vocalic ending -o or by the consonantal ending of the word 
root, the masculine singular oblique is expressed by -es- or by -as-, the feminine 
singular nominative is expressed by -i or by the consonantal word root, and the 
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feminine singular oblique is expressed by -a-. Plural formation has -e in mascu- 
lines ending in a vowel and -a in most other cases. In addition to the various de- 
clension classes for pre-European nouns, a set of declension classes exists that 
specialise in European loans (including most Greek loans). These vary somewhat 
among the dialects, both in the retention of differences and the specific shape of the 
morphs. The historical classes are masculine nominatives in -os, -us, -o and -is, and 
feminine nominatives in -a. Oblique forms are generally subject to contamination 
between the European nominative and the pre-European oblique forms, with a 
strong tendency toward analogies to the pre-European forms, while plural endings 
vary considerably, with dialects often borrowing plural endings from contempor- 
ary contact languages. 

Layer 2 case markers consist of a fixed set of agglutinating markers that follow 
the fusional oblique stem of the noun. Cases that are distinguished at this level are 
the Dative (which encodes the beneficiary and recipient), and Locative (which en- 
codes location, possessor, target, and is also used as a default prepositional case), 
the Ablative (source or substance), Sociative-Comitative (also used as Instrumen- 
tal), and the Genitive. As in other Indo-Aryan languages, the Romani Genitive 
shows adjectival agreement with the head: rakl-es-ker-o dad ‘the boy's father’, 
rakl-es-ker-i daj ‘the boy’s mother’. 

Personal pronouns show their own individual oblique forms: me ‘T , man- "me: 
ov ‘he’, les- ‘him’. The regular Layer 2 markers then attach to the oblique pronomi- 
nal stems: man-sa ‘with me’. Possessive forms of the first and second person differ 
from those of the nominal paradigm, showing an old possessive formation in -ro- 
attached to the oblique stem: miro ‘my’, tiro ‘your’, amaro ‘our’, etc. 

Animacy is of relevance to the marking of direct objects. Animate direct objects 
take the unmodified oblique case, which is often referred to as an ‘accusative’. In- 
animate direct objects take the nominative. The oblique also functions as the case of 
the possessor in most dialects, and sometimes also of the external possessor, while 
in some dialects it may also accompany a preposition, indicating an indirect object. 

Definiteness is marked by preposed definite articles which agree with the noun 
in gender, number, and case: o graj nasl-o ‘the horse ran away’. Indefiniteness may 
be marked or unmarked, depending on the topical status of the noun. Adjectives 
and demonstratives are generally preposed and agree with the noun in gender, 
number, and case: mir-o bar-o phral ‘my big brother’, aka-ja phur-i dZuvl-i ‘this 
old woman'. 

Attributive agreement (as used in definite articles, demonstratives, and adjec- 
tives) is reduced to marking the gender of singular nouns while plurality is marked 
without distinguishing gender, and in the domain of case to marking the Layer 1 
contrast between nominative and oblique. Only some dialects extend attribute 
agreement to the full set of Layer 2 case markers. 

Romani shows an exceptionally complex, four-term system of demonstratives. 
The set encodes the immediate presence/absence of the referent (often understood 
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as proximity/remoteness, but usually defined in terms of the location of the referent 
in the physical setting of the conversation or in the abstract realm of the discourse), 
as well as the degree of specificity or (dis-)ambiguity of the referent. Thus we find 
adava ‘this one (immediately visible and unambiguous)’, akava ‘this one (immedi- 
ately visible, distinguished from other potential candidates)’, odova ‘that one 
(aforementioned and unambiguous)’, and okova ‘that one (aforementioned, distin- 
guished from other potential candidates)'. The forms themselves are subject to nu- 
merous analogies, re-shaping and re-duplication processes. 


3.4. Verbal categories 


Romani verbs are derived from verbal roots (xa- ‘to eat’), from de-nominal stems 
(dand-ar- ‘to bite’, from dand ‘tooth’) or from de-adjectival stems (bar-jov- ‘to 
grow’, from bar-o ‘big’). The verb stems consists of the lexical root, which may be 
followed by a derivation affix. The most common derivational affixes are the in- 
transitive form -jov- (derived from the auxiliary jov-, from *bhuv- ‘to become") 
which is often contracted to -jo- or even just -o-, and the transitive forms -ar-, -av- 
(also -er-, -ev-), -ker-, -arker-, -avker-. A younger, alternative intransitive marker 
is -áv-, derived from the auxiliary av- ‘to become’, from av- ‘to come’. Transitive 
affixes attach to intransitive, de-nominal and de-adjectival roots to form transitive 
verbs: sikhlj-ar- ‘to teach’ (sikhlj- ‘to learn’), dar-av- ‘to frighten’ (dar- ‘to fear’), 
bar-av- ‘to enlarge’ (bar-o ‘large’). In some dialects they may also attach to transi- 
tive verb roots to form causative verbs: dar-av-av- ‘to cause somebody to frighten’. 

Borrowed verb stems are integrated by attaching a derivational vowel which 
mediates between the stem and tense-aspect inflection. The original inventory of 
loan verb adaptation markers appears to have consisted of a combination of a Greek- 
derived affix representing the inflectional class of the Greek verb — this model was 
set, apparently, during contact with Greek in the Byzantine or ‘Early Romani’ 
period — and an inherited verb-derivational marker that indicated transitive or in- 
transitive valency. Traces of this old system still appear in some dialects: vorb-is- 
ar-el ‘he speaks’ (vorb- from Romanian ‘to speak’, -is- representing the Greek aor- 
ist, -ar- transitive derivation), beklet-is-ker-d-jom ‘I waited’ (Turkish bekled-, 
Greek aorist -is-, inherited transitive -ker-, past-tense -d-). Most dialects have sim- 
plified the inherited system and retain just one or two loan verb adaptation markers, 
often representing either derivational affixes (-ev-, -er-) or Greek-derived inflec- 
tional markers (-iz-, -in-) or combinations of both (-is-ar-, -is-dv-). In a number of 
dialects there are tendencies to incorporate some loan verbs along with their original 
tense-aspect and person inflection. This is common especially with modal auxili- 
aries. With lexical verbs the system is consistent especially in the Balkan and Black 
Sea dialects of Romani that have been under considerable influence of Turkish (and 
other Turkic languages, such as Tatar). Here, Turkish verbs often carry their entire 
Turkish inflection, resulting in an etymological split in inflection classes. 
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As in other Indo-Iranian languages, Romani verbs show present and past stems. 
On the whole, the forms are similar to the present forms and only a few verbs show 
a distinction, e.g. sov- ‘to sleep’, past su-t-; per- ‘to fall’, past pe-I-. The character- 
istic feature of the past (also: perfective) stem is usually just the added augment or 
perfective marker, as in ker- ‘to do’, past stem ker-d-. The perfective marker de- 
rives from the historical marker *-t- which has taken on the forms -t-, -d-, and -I-, 
depending on environment (i.e. the final segment of the verb root). The present 
stem shows a split between vocalic roots and consonantal roots, which affects the 
shape of the person or subject concord marker that immediately follows it: xa-s 
*you eat', but phen-es *you say'. In some dialects, new inflection classes are 
formed through contraction of the intransitive derivation (dikh-el ‘he says’, 
dikhj-ol ‘it is seen’) or the loan verb adaptation marker (vorb-il “he speaks"). In the 
past stem, inflection classes are identified by the distribution of the individual per- 
fective markers named above. The group of mono-consonantal verb roots consti- 
tutes an independent inflection class. Additional markers/classes are verbs of mo- 
tion and change of state, including mediopassive, which often take the perfective 
marker -i/-, and those that end in -a and express senses/emotions, such as dara- ‘to 
fear’ and asa- ‘to laugh’, which have a variety of perfective markers, including 
-ndil- (thus asa-andil-jom ‘I laughed’). The present and past stems each have sep- 
arate set of subject concord markers. Those used with the present verb stem are di- 
rect descendants of Old and Middle Indo-Aryan person markers; Romani is one of 
the most archaic New Indo-Aryan languages in preserving this set. The set of per- 
son suffixes used with the past or perfective stem derive from enclitic oblique pro- 
nouns which originally identified the agent in an ergative construction and became 
re-analysed as subject concord markers in a new conjugation, a development that is 
typical of many of the Indo-Iranian frontier languages. 

The verb has two external tense markers, which follow person endings: The 
suffix -a attaches only to verbs in the present stem and indicates either the indica- 
tive present-future, or specifically the future or conditional. The suffix -as (also 
-ah, -ahi, -s) indicates a ‘remote’ perspective; combined with the present stem it 
forms the imperfect or habitual, and combined with the perfective or past stem it 
forms the pluperfect or counterfactual. Most Romani dialects lack an infinitive, 
though in some dialects, especially in central Europe, there is a tendency to gener- 
alise just one finite form in modal constructions and so to drop agreement between 
identical subjects in the main and complement clause. Many Romani dialects pre- 
serve a present-participle (gerundial) form of the verb; otherwise, verbs in clauses 
are generally finite. The finite form of the verb without any extension serves as a 
subjunctive. 

Apart from the verb ‘to go’, the copula is the only verb in Romani that is genu- 
inely suppletive. Future and subjunctive forms of the copula derive from the verbs 
‘to become’, ‘to come’ or sometimes ‘to stay’. The present set of the copula makes 
use of the same set of person concord markers as the perfective stem in lexical 


266 Yaron Matras 


verbs, indicating in all likelihood that the present copula derives from forms that 
had once been used for the past tense, while now the past tense of the copula is 
formed exclusively by adding the remoteness marker -as, without forming any sep- 
arate perfective stem. 


4. Syntax 


4.1. Word order type and alignment 


Romani generally displays the properties of an SVO language with a flexible, prag- 
matically determined word order. This flexibility is reminiscent of Greek. In decla- 
rative clauses, there are several slots to the left of the verb which may be occupied 
by (multiple) topics. The subject appears to the left of the verb when it is topical 
and when it helps establish the perspective on the proposition. The subject appears 
to the right of the verb when there is a tight, consequential connection between the 
sentence and the preceding context. Relative clauses follow their head nouns and 
contain resumptive pronouns wherever there is reference to the head in a non-sub- 
ject role. Pro-dropping occurs especially in the chaining of subjects, and in this 
respect Romani is a pro-drop language to a limited extent. Clauses are generally 
finite and subordinations are introduced by conjunctions, which are typically de- 
rived from interrogatives or markers expressing local relations, or borrowed. Like 
other Balkan languages, Romani shows a split between the factual and non-factual 
complementisers. The difference is indicated by the choice of conjunction that in- 
troduces the complement clause, as well as in the choice of verb mood (i.e. sub- 
junctive in non-factual clauses, indicative in factual clauses). The subject tends to 
precede the verb in factual clauses, but follows the verb in non-factual clauses. 

Romani displays accusative alignment. However, its verbal system shows 
some traces of ergativity in earlier stages of the language. Thus the past-tense 
conjugation is derived from a set of oblique enclitic pronouns which once indicated 
the agent. In the more conservative dialects, primarily in southeastern Europe, 
there is still a tendency for intransitive verbs of motion and change of state (includ- 
ing mediopassives) to show participial (adjectival) agreement in the third person, 
rather than person inflection (deriving from agentive morphology), thus: ov/oj 
kerd-jas ‘he/she did’, but ov gel-o ‘he went’, oj gel-i ‘she went’. 


4.2. Order inside the nominal phrase 


Attributes generally precede their head nouns. Definite and indefinite articles and 
demonstratives usually occupy the first slot in the noun phrases, followed by nu- 
merals, then by genitive-possessive nouns, then by adjectives. For emphasis, ad- 
jectives and genitive-possessive nouns may be placed after the noun, and in some 
dialects this is true of demonstratives, too. 
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Table I: Typological features of Romani 


structural features realisation in Romani 

animacy differential object marking 

gender two — masculine vs feminine — in nouns, singular attributes, 
and non-finite verbs in the 3rd person singular 

definiteness pro-clitic definite article and indefiniteness marker 

number singular, plural 

morphological case present 

syntactic case marked by adpositions 

possessive constructions 1) attributive 


2) predicative: no etymological habeo-verb 
3) external possession 


aspect non-perfective, perfective 

tense future, remoteness 

copula suppletive in subjunctive and future 

mood morphological indicative, subjunctive and imperative; 

passive morphological: intransitive derivation (mediopassive), 
reflexive/passive; analytical. 

adpositions prepositions 

adverbs small number of primary adverbs 

adjectives agree with their heads (in attribution and predicatively) in 


gender, case and number; analytical and synthetic 
comparatives coexist 


word order pragmatically free SVO with Adj N, Gen N, Prep N, Dem N, 
Num N and N Rel 

alignment accusative 

5. Dialects 


Geographical dispersion and the influence of diverse contact languages have led to 
the emergence of considerable difference among the dialects of Romani. Most of 
these differences are thought to have emerged during the period of settlement in the 
various regions of Europe, between the migrations of the fifteenth century and the 
period of dense attestation in the eighteenth century, by which time the overall pic- 
ture of dialect differentiation was very similar to the present-day pattern. Support 
for this hypothesis can be found in the geographical distribution patterns of isog- 
losses. A dense cluster of isoglosses, referred to as the Great Divide (Matras 2005), 
separates Romani dialects on both sides of what was, during the relevant period, 
the border zone (and war zone) between the Austrian and Ottoman empires. There 
are both innovations and conservative features on both sides of the Divide. Other 
geographical distribution patterns separate a "Black Sea' zone of innovations in 
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and around Romania and northern Bulgaria, a western zone with its centre in and 
around Germany, as well as a centre-periphery dynamic. On the basis of the devel- 
opments mainly in morphological paradigms, it is possible to apply a classification 
grid to Romani dialects which recognises the following groups: Northwestern, 
Northeastern, Central, Vlax (Transylvanian-Vojvodina), and (southern) Balkan 
(cf. ElSik and Matras 2006, Boretzky and Igla 2004). Typical isoglosses that divide 
the Romani-speaking landscape pertain to the status of prothetic segments 
(j-ov, v-ov vs. ov ‘he’, a-sa vs. sa- ‘to laugh’), palatalisation and affrication (tikno 
vs. cikno ‘small’, gi vs. dži ‘soul’, geljas vs. gejas 'gone"), the distribution of a set 
of lexical variables, and reduction and analogical formations within morphological 
paradigms. Once speakers adjust to a monolingual mode of discourse, it is gen- 
erally possible for Romani speakers east of the Great Divide (from Greece to the 
Baltics) to understand one another, while the (much smaller) population of speak- 
ers to the west of the Great Divide speak dialects that are more fragmented. 
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1. Introduction 


Signed languages are the natural, visual-gestural languages of Deaf communities. 
They are not universal — they differ from region to region. Even where a common 
spoken language is used, the signed languages may differ (e.g. while Dutch is 
spoken in Flanders and the Netherlands, there are two independent signed lan- 
guages in these territories, Flemish Sign Language and Sign Language of the Ne- 
therlands). This illustrates how signed languages have evolved independently from 
spoken languages, though there is also evidence of language contact (e.g. the role 
of mouthing in signed languages). Despite a shared history of suppression since the 
1880s, European signed languages have survived and evolved. Signed languages 
are minority languages in the territories in which they exist. There are also regional 
signed languages, for example, Catalonian Sign Language and Spanish Sign Lan- 
guage co-exist in Spain. In the 17th century, French approaches to deaf education 
were adopted in many countries, with the result that elements of old French Sign 
Language influenced local signed languages. 

Modern sign linguistics was pioneered by Tervoort's (1953) doctoral disser- 
tation regarding the signing of deaf children in the Netherlands and Stokoe's 
(1960) description of the linguistic structure of American Sign Language (ASL). In 
the 1970s, several European linguists began to independently study their local 
signed languages. It is often thought that this arose as a result of ASL research but 
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personal communication with some of these European pioneers has revealed that 
this is not the case (e.g. Brita Bergman (Sweden), Virginia Volterra (Italy), Jim 
Kyle and Bencie Woll (UK), Trude Schermer (Netherlands), Christian Cuxac 
(France), etc.). Instead, there seems to be a link between the beginning of sign lin- 
guistics and local responses to deaf educational outcomes. In the 1970s and 1980s 
it became clear(er) that a (strict) oral approach to deaf education did not always 
yield the desired outcome and a renewed interest in the use of signs/signed lan- 
guages arose. At least in some European countries there seems to be a link between 
this evolution and the onset of signed language research. 

An analysis of European publications from the 1980s and early 1990s reveals 
two different traditions. Following Karlsson (1984), we can label these the "oral 
language compatibility view" and the "sign language differential view" (also see 
Woll 2003 for a discussion of modern and post-modern approaches to sign lin- 
guistics). The oral language compatibility view presupposes that most of what 
comprises signed language structure is in line with what is typically described for 
spoken languages (i.e. oral languages), which means that while signed languages 
are expressed in a different modality, they have a great deal of structural common- 
ality with spoken languages. The differential view suggests that signed languages 
are so unique in structure that their description should not be modelled on spoken 
language analogies. 

Accordingly, there are two research traditions in Europe, with a minority taking 
the differential view. The majority have concentrated on the similarities between 
spoken and signed languages, emphasising their underlying common identity. 
Characteristics that make signed languages unique were often ignored, minimised, 
or interpreted as being comparable to certain spoken language mechanisms. The 
tools of spoken language research (theories, categories, terminology, methodol- 
ogies, etc.) were considered appropriate for the analysis and description of signed 
languages. There are many reasons why this approach dominated, the most import- 
ant of these relating to the contemporaneous questions about the status of signed 
languages. Researchers aimed to show that signed languages were natural lan- 
guages, stressing points of commonality with spoken languages. Since the turn of 
the millennium, this clear-cut distinction between the two approaches has become 
blurred, with greater recognition of the unique characteristics that arise from, for 
example, modality differences. 

The increased recognition of signed languages has impacted to some degree on 
the status of signed languages. For example, since 1988, the European Parliament 
has twice passed resolutions on signed languages and called on member states to 
formally recognise their indigenous signed languages (1988, 1998) and the Euro- 
pean Commission funded a report examining the status of signed languages in the 
European Union (Kyle and Allsop 1997). The Council of Europe's Parliamentary 
Assembly has passed a resolution on signed languages, calling for their protection 
in member states (2003) and has published a report on the status of signed lan- 
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guages in the Council of Europe territories (Timmermans 2005). The role of signed 
languages in education is enshrined in UNESCO’s Salamanca Statement (para- 
graph 2.1) while the UN’s Standard Rules on the Equalisation of Opportunities for 
Persons with Disabilities make a clear statement about access to information and 
communication as well as referring to the role of signed languages in education 
(Rule 5). While many European states have now formally (legally, constitutionally, 
or in a de facto manner) recognised their indigenous signed languages, many others 
have not, and for many of those with legal recognition in place there is still some 
way to go before signed language users enjoy the same protection as afforded to 
speakers of minority or regional languages. It is striking that despite the wealth of 
information about the linguistic and sociolinguistic status of signed languages, the 
Council of Europe’s Charter for Regional or Minority Languages (CRML) has still 
not been applied to the indigenous signed languages of Europe. 

Some European signed languages have been studied in relative detail, e.g. Brit- 
ish Sign Language (see the work of Margaret Deuchar, Jim Kyle, Bencie Woll, 
Mary Brennan, and Rachel Sutton-Spence), French Sign Language (mainly by 
Christian Cuxac and his collaborators), Swedish Sign Language (see the work of 
Inger Ahlgren, Brita Bergman, Lars Wallin, and Anna-Lena Nilsson), Danish Sign 
Language (Engberg-Pedersen), and Italian Sign Language (e.g. Virginia Volterra 
and Elena Pizzuto). Many others are under-described (e.g. many eastern European 
signed languages) or may have been described locally, with the relevant studies not 
being published in international publication organs, though. Given this situation, it 
is currently not possible to present an exact typology of signed languages, even at 
the European level. In the following sections, therefore, we provide some cross- 
linguistic comparisons pointing out similarities and differences that have been 
documented for European signed languages. 


2. Accounting for similarities across signed languages 


The fact that signed languages are expressed in space, making use of manual and 
non-manual articulators, gives rise to a high degree of visual imagery, iconicity, 
and simultaneity of articulation. These modality-specific properties allow for a cer- 
tain degree of similarity across signed languages in terms of their linguistic struc- 
ture. The fact that signed languages appear to be typologically more similar to each 
other than spoken languages has been attributed to a number of different reasons, 
both linguistic and sociolinguistic. Amongst the factors that have been suggested 
are the following: the visual-gestural nature of signed languages, the absence of a 
written form, the unique contact features of signed and spoken languages, and the 
atypical acquisition patterns experienced by most deaf children (Johnston 1989: 
211). Moreover, Woll (2003: 25) writes: 
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“If sign languages are really as similar as has been claimed, we need to enquire why that 
may be so. Five mutually compatible and probably overlapping reasons suggest them- 
selves: 

* the relative youth of signed languages (including creolisation) 

© iconicity 

e a link between sign languages and gesture 

* linear syntax intrinsically creates greater differences than spatial syntax 

e differences are there but researchers haven't noticed them. 

Here, we will focus on five characteristics that illustrate similarity across European 
signed languages: (1) the use of space for referent tracking, (2) “classifier construc- 
tions", (3) non-manual sentence marking, (4) simultaneity and (5) the use of so- 


called *verb sandwiches". 


2.1. The use of space for referent tracking 


There is a two-way distinction with regard to how signers refer to referents that are 
present versus those that are not. Where a referent is physically present, a signer 
will typically point to the location where the referent is situated. In contrast, where 
the referent is not present, the signer can “establish a locus”, i.e. associate a certain 
area in the signing space with the non-present referent. By pointing towards this 
locus, the signer effectively creates and tracks referents across the discourse. The 
choice of loci is frequently motivated. For instance, a signer may refer to an absent 
colleague by pointing towards his/her office desk, whereby the locus of the empty 
desk functions as co-referential with the absent colleague. Where there is a per- 
ceived status differential between referents, a signer typically associates the hig- 
hest-ranking referent at a locus that is higher in signing space than the other refer- 
ents. Indexing toward the relevant loci provides the means for referring back to the 
individual referents. For example, the World Federation of the Deaf may be located 
higher in signing space than the European Union of the Deaf, which in turn would 
be located higher in space than a national Deaf association, reflecting the global 
status of each organisation by metaphoric extension in signing space. 

Loci are also used in verb agreement. Some verbs allow for alteration of the di- 
rection of their movement in order to reflect changes in their argument structure. 
GIVE, for example, can be signed with a movement starting from the agent's locus 
towards the patient's locus. This means that in order to express the meaning “I give 
X" the verb sign GIVE may be produced with a movement away from the signer's 
body, whereas for *X gives me" the articulation of GIVE often involves a move- 
ment towards the signer's body. This seems to be the case for all documented 
(European) signed languages. 
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D “Classifier constructions” 


Signed languages have the potentiality to encode information about space, path of 
movement, formal characteristics of the referent, and, sometimes, information re- 
garding animacy features within a single construction where the handshape par- 
ameter has been compared to classifiers in spoken languages (Allan 1977). How- 
ever, the parallel between classifiers and classifier constructions in signed 
languages and one of the four types in Allan’s (1977) typology of spoken classifier 
languages, often quoted in (earlier) signed language publications, has (later) 
shown to be problematic. This, amongst other reasons, has led a number of sign lin- 
guists to argue against an analysis of the handshape in these so-called “classifier 
constructions” as classifiers (see Schembri 2003 for an elaborate discussion of this 
issue). 

Both Flemish Sign Language and Irish Sign Language encode a person mov- 
ing, e.g. a person walking in the street as seen by someone looking out of the 
window, in the following way: 


Figure 1. Classifier representing a person 


The person is represented by an index finger pointing upwards. In contrast, a mov- 
ing vehicle takes a different handshape. In the next example from Irish Sign Lan- 
guage, the vehicle is a motorbike, represented on the signer’s left hand, and it is 
moving towards a wall, represented by the signer’s right hand, but this handshape 
could equally represent (at least in Irish Sign Language) a car, bus, truck, bicycle, 
or boat. 
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Figure 2. Classifier representing a vehicle in Irish Sign Language 


As far as we know, all European signed languages do this, but as we shall see in sec- 
tion 3.2 below, there are differences in terms of handshape. We should also note here 
that some non-European signed languages do not make use of classifiers to the same 
extent. 


2.3. Non-manual sentence marking 


Non-manual features (NMFs) have been described as "specific contractions of the 
facial muscles, combined with head movements or positions and body movements 
or positions" that serve a linguistic function (Liddell 1980: 14). The category of 
NMFs includes puffed cheeks, furrowed brows, eyebrow-raise, pursed or stretched 
lips, and narrowed eye-gaze. Activity that occurs on parts of the body other than the 
hands is considered to be non-manual in nature. Prior to Liddell’s work in the late 
1970s, the extent of the role of NMFs was underestimated in the literature on signed 
languages. Today, the significance of the role of NMFs is beginning to be more fully 
appreciated and NMFs are recognised as serving important linguistic functions. 
Across European signed languages, there is a significant degree of overlap 
noted for NMFs that mark sentence types such as topic-comment structure, wh- 
questions, yes-no questions, negation, conditionals, etc. Similarities extend 
beyond functionality: the form of non-manual marking is also surprisingly similar 
across signed languages. We find that yes-no questions, for example, are typically 
marked by a cluster of features, of which raised eyebrows are the most important 
one. Furrowed brows, on the other hand, are the most salient NMF associated with 
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wh-questions across documented European signed languages. Interestingly, in 
recent work on Austrian Sign Language and Croatian Sign Lanugage it is claimed 
that the position of the head is the most salient feature of the non-manual marker 
for these two languages: polar questions are marked with "chin down", whereas 
"chin up" indicates content questions (Schalber 2006: 135-136). 


2.4. Simultaneity 


Signed language users have at their disposal a range of articulators that they can 
draw upon when expressing linguistic messages, including the hands, torso, eyes, 
and mouth. These articulators can work in tandem to produce one lexical item, 
though in other circumstances, they operate to convey different types of in- 
formation simultaneously. All (European) signed languages show different forms 
of simultaneity, which can be roughly divided into three groups: manual simulta- 
neity, manual-oral simultaneity, and multi-channel simultaneity involving non- 
manual articulators other than the mouth. 

Manual simultaneity occurs when each hand conveys different information. 
This may take the form of "full simultaneity", i.e. the simultaneous production 
of different complete signs. Alternatively, one hand may hold the end-state of a sign 
in situ while the other hand continues to produce one or more signs. This way, a ref- 
erent or piece of information can be kept visible/active while something is predi- 
cated of it or brought into a certain (spatial) relationship with it by the signs pro- 
duced by the other hand. This can be seen in the following example from Irish Sign 
Language, where the signer is talking about a holiday in Norway. Here, the signer's 
left hand represents a foot in the snow, while the right hand is captured mid-com- 
ment, saying that there is "really snow (on the ground where the foot is situated)". 


Figure 3. Manual simultaneity in Irish Sign Language 


With full manual simultaneity, one of the two signs that are simultaneously pro- 
duced often shows a relatively simple form. This can be a numeral (handshape) or a 
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pointing sign, for example when a signer points at a person present in the room 
with one hand and simultaneously produces the signs MY BROTHER with the 
other hand, meaning “He/that is my brother". Examples of manual simultaneity 
also include simultaneous constructions involving classifiers, such as when the two 
actors in a motion event are represented by a distinct classifier on either hand. 

One form of manual-oral simultaneity involves the mouthing of lexical items 
from a spoken language. Often mouthings are derived from the spoken equivalent 
of the manual sign they accompany, e.g. the manual sign BOY in British Sign Lan- 
guage in combination with the mouthing boy, but there are also examples where 
there is no correspondence between the mouthing and the meaning of the sign it 
accompanies. For example, in British Sign Language the mouthing boy can be 
combined with the sign LITTLE resulting in the meaning “little boy" (Sutton- 
Spence 2007: 157). Another form of manual-oral simultaneity involves the pro- 
duction of “mouth gestures" — idiosyncratic gestures produced by the mouth that 
are unrelated to the words of a spoken language, e.g. the production of a ‘fffff’ 
mouth pattern to refer to water running. 

As was shown in the previous section, multi-channel simultaneity involving 
non-manual articulators other than the mouth occurs with non-manual sentence type 
marking. This type of simultaneity is also often related to the simultaneous 
expression of different points of view and the simultaneous expression of different 
elements of a multi-faceted event. In the following example from French Sign 
Language, two agents performing different actions in the same setting are being rep- 
resented. The first agent is a horse getting ready to jump over the fence. This agent is 
referred to by a set of non-manual articulators: eye gaze + facial expression + mouth 
gestures + body postures. The fence is represented by the non-dominant (left) hand. 
The dominant (right) hand refers to the second agent: a bird looking at the horse 
(Sallandre, 2007: 114—115; figure reprinted with permission from John Benjamins). 


Figure 4. Multi-channel simultaneity in French Sign Language 
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2:5: “Verb sandwiches" 


While the four characteristics of (European) signed languages discussed so far can 
be said to be modality-specific (i.e. associated with the visual-gestural means of 
articulating signed languages), other similarities go beyond the modality issue. 
So-called “verb sandwiches" are a good example. Fischer and Jannis (1990: 281) 
schematize this construction as: “SUBJECT VERB (OBJECT) (ADJUNCTS) 
VERB + LOTS-OF-INFLECTION ...”. In the prototypical example of a verb 
sandwich there are two instances of the same verb sharing the same subject (or 
agent) but with the second instance of the verb “carrying additional information’, 
e.g. a modification involving reduplication of the movement parameter of the sign 
indicating iterative aspect, as in the following example from Croatian Sign Lan- 
guage (Milković, Bradari¢-Jonéi¢ and Wilbur 2006: 184—185, figure reprinted with 
permission of John Benjamins): 


(1) ŽENA UZETI TABLETE UZETI++ 
woman take-down pill take++ 
“The/a woman takes down a (several) pills 


” 


UZETI* 
Figure 5. A verb sandwich in Croatian Sign Language 


However, in a related construction, sometimes called “verb echo”, the two in- 
stances of the lexical verb sign are identical. 

Verb sandwiches may also consist of a combination of a lexical verb and a re- 
lated "productive predicate" (see below). For instance, we might see the lexi- 
cal verb sign EAT in combination with the productive sign "eat-sandwich", 
which would occur at the end of the utterance (GIRL EAT SANDWICH “eat-sand- 
wich"): 
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EAT as signes a Flemish signer “eat-sandwich” as signed by the samer signer 


Figure 6. The lexical verb and related productive sign in a verb sandwich in Flemish Sign 
Language 


Lexical (or "established") signs are “those items which occur so frequently in the 
language that we can list them in dictionaries", while productive signs are those 
that are “ ... made up of several individual meaningful components which can be 
re-assembled in different ways to produce other signs" (Brennan 1990: 181). Bren- 
nan goes on to note that the distinction between established and productive signs is 
not always clear cut, as the lexicon is constantly in a change of flux, partly made up 
of established items, and partly productive in nature. 

“Verb sandwiches", or more specifically, the tendency for an agent to come first 
in a sentence, followed by an established lexical verb, followed by a second argu- 
ment, which in turn is followed by a second (modified) instance of the same verb or 
a related productive predicate sign seems to be shared by many European signed 
languages. Other similarities across European signed languages that are unrelated 
to the modality include the absence of the copula and a striking preference for 
clause-peripheral positioning of the question sign in content questions: question 
signs typically appear clause-initially, clause-finally, or in both positions (“doubl- 
ing" or “echoing” of question words). 


3. Some typological differences across European signed languages 


The early days of signed language research focused on a small number of signed 
languages (typically American Sign Language, Australian Sign Language, and 
western European signed languages). The turn of the millennium, however, has seen 
a surge in research on previously un(der)-studied signed languages, including many 
non-western signed languages (e.g. Indo-Pakistani Sign Language, Adamorobe 
Sign Language, Jordanian Sign Language, Hong Kong Sign Language). Within Eu- 
rope too, the process of documenting the grammars of a growing number of signed 
languages allows for greater in-depth comparison. Some of the more recent cross- 
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linguistic comparisons seem to confirm the high degree of similarity across signed 
languages, whereas other studies seem to point at more variation than was earlier re- 
cognised. It is clear that there is a need for much more comparative work — and for 
yet more signed languages to be documented before more conclusive answers can 
be given. Here we look at some aspects of variability that have already been iden- 
tified. We refer the reader to Perniss, Pfau and Steinbach (2007a) for more in- 
formation and examples on differences across signed languages. 


3.1. Auxiliary-like elements 


A number of auxiliary-like elements have been described for several European 
signed languages, including Sign Language of the Netherlands, Flemish Sign Lan- 
guage, Irish Sign Language, German Sign Language, Catalan Sign Language, and 
Greek Sign Language. At least part of these auxiliary-like elements express the 
role of arguments and they do so in two possible ways: (i) by their position in the 
sentence, (ii) by an alteration to the phonological property of the auxiliary (i.e. 
a change in direction of movement) to mark for agreement (cf. the discussion of 
verb agreement in section 2.1). 

Perniss, Pfau and Steinbach (2007b: 16) present an example from German Sign 
Language, showing the use of the auxiliary sign glossed as PAM (person agree- 
ment marker): 


(2) YESTERDAY INDEX, TEACHER INDEX,, ANGRY ;PAM,, 
“Were you angry with the teacher yesterday?" 


As can be seen from the transcription, PAM is modified so that the starting point of 
its movement refers to the (locus for the) second person (“locus 2") and the end 
point refers to the locus for the teacher (“locus 3b") (see the discussion of the use of 
loci in section 2.1.) This is an example of (ii), an alteration to the phonological prop- 
erty of the auxiliary, which results in PAM carrying the spatial agreement which the 
body-anchored predicate, ANGRY, cannot carry. However, in German Sign Lan- 
guage PAM can also combine with agreement verbs, resulting in a double agree- 
ment case. Such double marking is attested in a number of other signed languages 
but has been claimed to be ungrammatical in others (e.g. in Greek Sign Language). 

According to Steinbach and Pfau (2007: 327), signed language "agreement 
auxiliaries" are grammaticalised forms of lexical elements and there are three 
types of lexical sources for them: pronouns, nouns, and verbs. Some signed lan- 
guages show only one of these forms (e.g. PAM in German Sign Language where 
pronouns are the lexical source and the agreement auxiliary ACT-ON in Sign Lan- 
guage of the Netherlands where the source is the verb GO-TO), whereas others 
show all three types (e.g. Catalan Sign Language). 

As can be seen from the example, in German Sign Language PAM appears at 
the end of the sentence, whereas in a number of other European signed languages 
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auxiliary-like elements precede the verb (e.g. GIVE-AUX in Greek Sign Language 
and Catalan Sign Language) or the patient/object (e.g. Flemish Sign Language and 
Irish Sign Language). In the latter instance, the auxiliary-like element is not always 
modified for spatial agreement. For example, in Irish Sign Language, the sign TO 
is used by some signers and it does not inflect spatially to carry agreement. That is, 
TO does not move in signing space, it is body-anchored. In Flemish Sign Language 
there are several auxiliary-like elements, some of which behave like the Irish Sign 
Language TO (e.g. ON), whereas at least one other (GIVE) allows for spatial 
agreement with one or two arguments of the verb. We note that examples like TO in 
Irish Sign Language and ON in Flemish Sign Language, which are related to prep- 
ositions, are not included in Steinbach and Pfau's analysis of (grammaticalisation 
of) agreement auxiliaries. 


3.2: Classifiers 


As we noted in section 2.2, there are several categories of classifiers that have been 
reported for European signed languages. These include (but are not limited to) (1) 
handling classifiers (where the handshape represents the handling of a referent) 
and (ii) whole-entity classifiers (where the handshape refers to a referent as a 
whole). There are very few differences in terms of form and function for handling 
classifiers across documented European signed languages. However, some vari- 
ation has been reported for whole-entity classifiers. 

For example, the two classifier handshapes which represent vehicles and per- 
sons described in section 2.2 above are not common across all Buropean signed 
languages. While all vehicles are represented using the flat handshape with the 
palm facing sidewards in Irish Sign Language (recall Figure 2), four-wheeled ve- 
hicles in many other European signed languages are referred to by using a flat 
handshape with the palm facing downwards, as illustrated in Figure 7: 


Figure 7. Classifier representing four-wheeled vehicles in different European signed 
languages 
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In German and Flemish Sign Language, and possibly others, both forms are used 
but for different purposes. The flat handshape with the palm facing downwards is 
used to represent vehicles with four wheels (e.g. cars, buses, trucks), whereas the 
flat handshape with the palm facing sidewards is used to represent vehicles with 
two wheels (e.g. bicycle, motorbike). 

In Irish Sign Language, the index finger representing a person is also used 
when discussing the interaction of two persons, e.g. one person following the 
other: 


Figure 8. Classifiers representing two interacting persons in Irish Sign Language 
(and some other signed languages) 


However, in other signed languages, including Flemish and Danish Sign Lan- 
guage, in this and similar situations, i.e. when one individual is seen in contrast to/ 
competition with/excluded from one or more others, the A-hand with extended 
thumb is used to refer to a person rather than the index finger. 


3.3. Constituent order 


The analysis of word order in terms of Subject- Verb-Object is not without contro- 
versy in signed language research. Some researchers claim that applicability of an 
analysis in terms of S, V and O is limited and/or that a topic-comment analysis 
is most fitting (Deuchar 1983 for British Sign Language, Jantunen 2008 for Finn- 
ish Sign Language). Nevertheless, for many signed languages basic word order 
has been identified in terms of Subject- Verb-Object ordering, demonstrating that 
SVO and SOV are prominent orders. For example, Croatian Sign Language (Mil- 
kovié, Bradari¢-Jonéi¢ and Wilbur 2006) and Swedish Sign Language (Bergman 
1994) are said to have an SVO basic word order. Signed languages that are de- 
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scribed as having an SOV order include Sign Language of the Netherlands, Ger- 
man Sign Language and Italian Sign Language (Pfau and Bos 2008). In an earlier 
study on Italian Sign Language, however, Volterra et al. (1984) found a prevalence 
of the SVO order, especially in sentences with reversible arguments. Some recent 
publications list Italian Sign Languages as SVO, referring to the Volterra et al. 
study. 

For a number of (other) signed languages, the verb type influences word order. 
For Irish Sign Language, for example, a semantic actor takes first place, followed 
by a plain verb, followed by a patient (i.e. an SVO ordering) while the situation is 
more complex when agreement verbs are used (see section 2.1). Here, a topic is es- 
tablished, followed by an agreement verb whose onset is typically associated with 
the actor/subject role and the offset point of the verb is associated with the patient/ 
object role. Research on Flemish Sign Language points out that productive verb 
signs as well as established lexical verbs "incorporating" the object (either because 
the object is referred to on the non-dominant hand or because the location or orien- 
tation of the verb sign is modified so as to refer to the location of the object) are 
produced sentence-finally, whereas the verb in an SVO-construction is most often 
a plain lexical verb. Claims concerning constituent ordering which varies depend- 
ing on the kind of verb used by the signer have been made for a number of Euro- 
pean (and other) signed languages. This seems to be an area for cross-linguistic 
similarity as well as variation awaiting further cross-linguistic and typological 
study. 


3.4. Non-Manual Features (NMFs) 


The functionality of NMFs is far-ranging in signed languages. Here, we outline 
some of the specific differences that have been noted with respect to the marking of 
negation across several Buropean signed languages. 

Most documented signed languages are reported to have non-manual marking 
for negation that co-exists with manual markers of negation. For some, manual 
markers of negation are not obligatory and may be dropped, allowing for purely 
non-manually marked instances of negation (e.g. Irish Sign Language, Flemish 
Sign Language, British Sign Language, and Swedish Sign Language, although, 
according to Bergman (1995: 100), in Swedish Sign Language “a manual negation 
is obligatory in clauses with auxiliaries, and in sentences with clausal comple- 
ments"). Other signed languages allow for non-manual markers of negation to be 
dropped, with the manual markers alone marking negation (e.g. Turkish Sign Lan- 
guage and Italian Sign Language). In yet other signed languages, including Ger- 
man Sign Language, both manual and non-manual markers must co-occur, with a 
negative headshake extending across the predicate (Perniss, Pfau and Steinbach 
2007b; Pfau and Quer 2007) (Cf. Perniss, Pfau and Steinbach 2007b; Zeshan 2006 
for discussion regarding a range of signed languages.) 
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We also note that while a negative headshake has been reported as a marker of 
negation for all European signed languages, there are also some reports of a back- 
ward head tilt being used in a small number of signed languages such as Greek 
Sign Language and Turkish Sign Language (Zeshan 2006: 42—43). It is suggested 
that this may be a grammaticalisation of a culture-specific gesture now embedded 
in these signed languages (Perniss, Pfau and Steinbach 2007b). 

Another difference noted relates to negative concord. Some signed languages 
allow for two manual elements marking negation in one utterance without this 
impacting on the negative interpretation of the sentence (e.g. Catalan Sign Lan- 
guage). This is ungrammatical in other signed languages (e.g. German Sign Lan- 
guage; Pfau and Quer 2007). 


4. Concluding remarks 


Summing up, we can say that in the past, “cross-linguistic work" often entailed a 
comparison of signed languages with spoken languages. Since the turn of this cen- 
tury, an increase in cross-signed language comparison has led to a stage where 
a small number of signed languages are being compared and the first typological 
projects involving a larger number of signed languages are emerging (e.g. Zeshan 
2006; Zeshan and Perniss 2008). But there are many challenges to conducting 
large-scale comparative work in this domain, not least, as noted earlier, due to 
the lack of documentation for many signed languages. Although today an increas- 
ing number of signed languages are receiving attention, these remain under-studied 
for the purposes of typological description. Another challenge is the fact that for 
many phenomena no consensus on labelling has emerged, leading to a multiplicity 
of terminologies. This lack of agreement regarding terminology, coupled with the 
lack of a written form for signed languages or a universally adopted, conventional- 
ised system for notation, complicates the process of interpreting the outcomes 
of seemingly similar studies. Are, for example, the purported differences between 
signed languages real, or are they the artefact of different interpretations of the raw 
data? 

Another issue in contemporary sign linguistics concerns the analysis of signed 
language structure as incorporating both linguistic and "gestural" elements. 
A growing number of researchers propose to analyse signed languages as hetero- 
geneous systems in which meanings are conveyed by using a combination of el- 
ements, rather than as homogeneous systems where all major elements of signing 
behaviour are considered to be equal parts of a morphosyntactic system (Schembri 
2001). One idea emerging from this strand of research is that when the communi- 
cation of signers and speakers is compared, we should consider speech plus co- 
speech gesture rather than speech alone, or a comparison of signed languages to the 
written form of spoken languages (cf. Vermeerbergen and Demey 2007). The fact 
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that sign linguists are addressing such issues speaks to the fact that the linguistic 
status of signed languages is now clearly established. Probably also related to this 
is the fact that linguists are increasingly taking account of the complex socioling- 
uistic contexts that signed languages exist in. One of the key factors is the acquisi- 
tion path. With some 90—95 96 of deaf children born to non-deaf, non-signing par- 
ents, the acquisition of signed languages follows an atypical route for the majority 
of deaf children in Europe. Across Europe, there are differences in terms of the age 
at which deaf children first come into contact with signed languages. These are 
typically linked to educational systems and to the support mechanisms (or lack 
thereof) to support the learning of signed languages by parents (often linked to the 
official status of signed language in those territories). For example, since the 
1980s, in Scandinavian countries, early signed language intervention has meant 
that the acquisition path of deaf children from hearing families was more nor- 
mative vis-à-vis deaf children from deaf families (i.e. a “normal” path to signed 
language acquisition for native deaf signers) than in other parts of Europe. How- 
ever, in many other European countries, deaf children only come into contact with 
the local signed language when they are older. This entails the development of ges- 
tural elements for the purpose of communication with immediate family and care- 
givers in the interim period. Such so-called “home signing” thus occurs prior to the 
acquisition of a formal signed language. The possible impact of home sign systems 
on the structure of signed languages themselves remains to be fully understood. 

As Graham Turner notes, “as the study of signed language matures, we become 
increasingly confident about exploring those elements where the phenomena we're 
addressing may differ most strikingly from the patterns with which ‘mainstream’ 
linguistics is most familiar” (personal communication, 2007). This, it seems to us, 
neatly sums up where we are on the sign linguistics path to maturity. 
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II. Areal typology and language contact 


II.1. Areal typology 
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1. Introduction 


Until recently, ‘Standard Average European’ (or ‘SAE’) was not a standard term, 
more particularly, there was no awareness, let alone a consensus, about the hypoth- 
esis that Western Europe could be the home of a Sprachbund. It is fair to say that the 
“World of Linguistics’ has changed because of the so-called ‘EUROTYP’ project. 
Section 3 describes the basic hypothesis as it was formulated within that project, 
after briefly recalling the origin of the term 'Standard Average European' and 
sketching a bit of history (section 2). Section 4 describes three lines of research, each 
of which was explored in the period after the completion of the EUROTYP project. 


2: From Whorf to EUROTYP 


The phrase ‘Standard Average European’ was coined by Benjamin Lee Whorf in 
1939 in the following paragraph, quoted from a 1956 edition. 


Since, with respect to the traits compared, there is little difference between English, 
French, German, or other European languages with the POSSIBLE (but doubtful) ex- 
ception of Balto-Slavic and non-Indo-European, I have lumped these languages into one 
group called SAE, or "Standard Average European". (Whorf 1956: 138) 


The word *lumped' is important: Whorf was really only interested in native American 
languages, and lumping languages of Europe together was part of the argument that 
native American languages are very different. Still, there is a suggestion that some of 
the languages of Europe are similar in a non-trivial way, essentially, more similar than 
would follow on genetic grounds. This idea, however implicit, was around before 
and although various later linguists entertained the Standard Average European idea, 
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usually without the term and without reference to Whorf (e.g. Hock 1988, and see van 
der Auwera 1998b or Stolz 2006 for an overview), the idea and the term became 
prominent only with the project launched by the European Science Foundation 
(1990-1995) called *EUROTYP'. This project had its direct output in a subseries 
within the Empirical approaches to language typology series (Mouton de Gruyter), 
but there is a large body of papers and books that bear a more indirect relationship. 
Two of the most recent as well as prominent books in this category are Heine and 
Kuteva (2006) and the so-called WALS or World Atlas of Language Structures (Has- 
pelmath et al. 2005). Heine and Kuteva (2006) is a general discussion of the conver- 
gence of the grammars of the European languages, of the forces that led to this con- 
vergence, and of a few case studies. The WALS is not directly concerned with the areal 
typology of Europe, but with that of the world, thus also including Europe, and it owes 
its existence at least in part to the interest in areal typology that the EUROTYP project 
generated. The EUROTYP project also spurred the interest in areal work around 
the Mediterranean and Baltic Seas (see Walchli, this volume and Sanso, this volume), 
the results of which are often compared to those for Standard Average European. 


3. EUROTYP 


In this section I illustrate the SAE convergence with a relatively simple linguistic 
phenomenon, I then mention some more phenomena and I discuss some of the 
properties of these convergence phenomena. The simple phenomenon is that of the 
adverbial expression of the phases of states, more particularly, the phases for which 
English uses not yet, already, still, and no longer. These adverbials are polysemous, 
but the only uses of concern here are their phasal ones, illustrated in (1). 


(1) Mary is already in London. 
Mary is still in London. 
Mary is not in London yet. 


d. Mary is no longer in London. 


EE 


In van der Auwera (19982) I characterized the SAE system of phasal abverbials 
in terms of 12 features. I will illustrate just two of these. The first feature is very 
simple: in Standard Average European each of the meanings illustrated in (1) can 
be expressed by an adverbial expression. Map 1 shows the westernmost languages 
of Europe for which it was investigated whether or not the language has the four 
adverbials. All of these languages do indeed have four adverbials, except for Ice- 
landic and Albanian (they lack ‘already’). The conclusion I drew was that with 
respect to one of the properties of SAE phasal adverbials Icelandic and Albanian 
are less SAE than all the other languages shown on the map. The isogloss of Map 1 
reflects this. Note that the isogloss is open on the eastern side, meaning that the 
area actually continues. The open-endedness is symbolized by the dotted line. 


Standard Average European 293 


A second feature concerns the expression of the ‘no longer’ meaning. English 
chooses a strategy with a comparative and so does French (ne plus ‘not more’) or 
Dutch (niet meer *no more"). But Spanish uses a different strategy: it expresses "no 
longer’ with the adverb for ‘already’ (ya), which in combination with negation 
yields ya no ‘no longer’, literally ‘already not’. In some languages both strategies 
are available: thus Yiddish has both mer nit, literally ‘more not’, and shoyn nit 
*already not’ and they can even combine (shoyn mer nit). Map 2 shows three op- 
tions: (i) only a kind of comparative strategy, (ii) only an ‘already’ strategy, and 
(iii) both the comparative and the 'already' strategy. 

On the basis of Maps 1 and 2 anyone's hypothesis on what could be the SAE pha- 
sal adverbial system should be this: (1) SAE has adverbials corresponding to the four 
adverbials illustrated for English, (ii) SAE construes the ‘no longer’ meaning with a 
comparative and there is no alternative strategy. This hypothesis implies that at least 
with respect to two features, both Icelandic and Spanish are less SAE than French, for 
instance, for Icelandic lacks an ‘already’ adverb and Spanish construes ‘no longer’ as 
“already not’. Map 3 represents this: shading identifies languages with two out of two 
SAE features, the line identifies with languages with just one of two features. These 
notational devices of Map 3 are not isoglosses; the lines in this map show numbers of 
features, and I called them ‘isopleths’ (with pleth for plethora) and the resulting maps 
‘isopleth maps’ — Haspelmath (2001: 154) calls them ‘cluster maps’. 

In van der Auwera (1998a) I added another 10 features and for each isoglosses 
were drawn yielding a hypothesis of what would be the SAE strategy. One version 
of the resulting isopleth map is Map 4. Note that the map contains fewer languages 
than Maps | to 3: the reason is that it was not possible to determine for each lan- 
guage whether it did or did not exhibit the 10 additional features. 

Of course, phasal adverbials form just a minuscule atom of the fabric of any 
language, and the relevant properties of phasal adverbials are more lexical than 
grammatical. But for some morphosyntactic features, described in Haspelmath 
(1998, 2001; see also Stolz 2006 and Heine and Kuteva 2006: 23-27) a similar pic- 
ture emerges. (2) lists the features and Map 5 shows the isopleth map. (3) illustrates 
the less obvious e. to i. features. 


(2) Major SAE features (Haspelmath (1998, 2001) 
definite and indefinite articles 

relative clauses with relative pronoun 
*have'-perfects 

participial passives 

dative external possessors 

negative pronouns without verbal negation 
relative-based equative constructions 
non-pro-drop 

intensifier-reflexive differentiation 


ro mo eo ge 
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Map 1. Does the language have four phasal adverbials? 


{ 
AN 
53 
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Map 2. Does the language construe ‘no longer’ with a comparative (mem), 
with ‘already’ (eee) or only with a comparative (shading)? 
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Map 3. Does the language has only 1 SAE feature (en) or two (shading)? 


Map 4. SAE phasal adverbials: 211 features (shading), 29 features (meum ) or 27 
features (mmm ) 
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Map 5. SAE Map (Haspelmath 2001: 1505): 9 features (shading), 8 features 
(mmm ), 7 features (unn), 6 features (``). 


(3) e. A dative external possessor (dem Kind) in German 

Die Mutter wäscht dem Kind die Haare. 
the mother washes the-DAT child the hairs- 
"The mother is washing the child's hair.’ 

f. Anegative pronoun without the verbal negator not in English: 
I saw nobody. 

g. A relative-based equative in Catalan (Haspelmath and Buchholz 1998: 
291) 
La meva germana és tan bonica com tú 
the my sister is so pretty how you. 
‘My sister is as pretty as you.’ 

h. Non-pro-drop (independent subject pronoun necessary) in French 
Je vous voie. 
I you see 


‘I see you.’ 
i. Intensifier-reflexive differentation in Dutch 
Hij zelf wou  zich niet wassen. 


he himself would himself not wash 
‘He himself would not wash himself.’ 
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As is expected, the exact membership of the various isogloss and isopleth areas 
differs a bit for each phenomenon. This is very clear for the intermediate and low 
degrees of membership. If one compares isopleth maps 4 and 5 one can see for 
example that the North Germanic languages are more similar to the core SAE lan- 
guages than Iberian Romance on Map 4 and that it is the opposite on Map 5, or that 
Irish is included in the similarity clustering on Map 4 but not on Map 5. The core 
areas are more alike, but again not quite identical. Thus the shaded core of Map 4 
has Italian, but that of Map 5 does not have it. However, each core has at least 
French and German, and other continental Romance (Italian, Friulian, ...) as well 
as Germanic (Dutch, Frisian, ...) are often in the core as well, or at least very close, 
and typically closer than e.g. Spanish or English. For this reason I gave the core of 
SAE a label of its own, viz. ‘Charlemagne Sprachbund’ (van der Auwera 1998b: 
824—825), thereby suggesting that at least one factor that is responsible for the con- 
vergence is the contact during and after the ‘Barbarian Invasions’ and the cultural 
unity achieved by Charlemagne and consolidated later, in which French and Ger- 
man played a major role (see also Haspelmath 1998: 285, 2001: 1506-1507). 
However, there is no claim that there is only one process through which SAE 
and its Charlemagne core came into existence, especially not for the languages that 
are more peripheral. To illustrate the point with phasal adverbials: the fact that 
Basque, Faroese, Maltese and Romanian now all have an ‘already’ adverb is due to 
different borrowing processes: Basque borrowed from Spanish, Faroese from Da- 
nish, Maltese from Italian, and Rumanian from French (van der Auwera 1998a: 
68). And the fact that Yiddish can express ‘no longer’ as “already not’ (shoyn nit) 
means that we are dealing with a calque from Slavic (van der Auwera 1998a: 59) 
and the Upper Sorbian comparative strategy with wjace "more is probably a cal- 
que from German (Faßke 1980: 778). Or, as a final example, the Albanian ‘still’ 
word akoma comes from Greek, the Finnish one, vield, from Baltic, and the Rom- 
any one, inke, from Rumanian (van der Auwera 1998a: 68). These illustrations 
clearly show that there is no one ‘prime mover’, not one language — nor even a 
small set of languages — that was a model for all the others. Also, there is no claim 
that the relevant changes all happened in the early Middle Ages. There is even no 
claim that whenever a convergence can be assumed in the more central SAE lan- 
guages, this set has to include French and German. Thus although with the adverb- 
ial expression of phases French patterns very much like German, these two lan- 
guages are very different when it comes to adverbials of scalar addition. For the 
‘even’ type uses German has three strategies, viz. sogar, (nicht) einmal (lit.) *(not) 
once’, and auch nur (lit.) ‘also only’ and European languages are indeed typically 
complex, though often in different ways. French, however, is untypically simple 
here, in that it only uses méme (Gast and van der Auwera 2011). To reflect the over- 
all complexity of the SAE language situation, it has been suggested (Stolz 2006: 
293) that one should not use the terms ‘Sprachbund’ or ‘linguistic area’, but “‘Con- 
tact Superposition zone’ instead (term due to Koptjevskaja-Tamm and Wälchli 
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2001, proposed for the Baltic area). The point is well taken, though it remains to be 
seen whether in that case more of the alleged Sprachbünde or linguistic areas (even 
the Balkan one) will not end up as Contact Superposition zones as well. Of course, 
a lot depends on one's definition of the terms ‘Sprachbund’ and ‘linguistic area’. If 
these terms are only supposed to refer to areas where languages are geographically 
close as well as similar on account of their geographical closeness, then SAE can 
remain a Sprachbund or linguistic area. 

Another point is that some of the convergence may be accidental, in which case 
one must assume that when two languages converged the motivation will be lan- 
guage-internal. Phasal adverbials can again illustrate this point. Van der Auwera 
(1998a) has argued that there are semantic reasons for speakers organizing the 
semantic space of phases in terms of either three concepts (‘not yet’, ‘still’, ‘no 
longer’) or in terms of four (‘not yet’, ‘already’, ‘still’, ‘no longer’). From this 
point of view one understands that when a language misses one of the four adverb- 
ials it is invariably (at least in Europe) the one for ‘already’ and that the switch 
from a three-term system to a four-term system or vice versa, invariably happens 
through the acquisition or loss of the ‘already’ adverbial. In Europe most lan- 
guages clearly opt for the four-term system and some of them will have developed 
it on language-internal grounds. For other languages contact influence will have 
triggered the four-term system, as when Maltese borrowed digà from Italian. And 
equally importantly, the change could be prompted internally as well as externally. 
In the past linguists have admitted the latter kind of situation only grudgingly, but 
thanks to Heine and Kuteva's introduction of the notion of ‘contact-induced gram- 
maticalization’ (2005) there is much more readiness to accept internal and external 
factors as working in sync. In any case, to be able to hypothesize the language in- 
ternal origin or the external one or the combination, ultimately only historical work 
will provide evidence or at least plausibility grounds. 

A point related to the one about the relevance of language-internal factors 
is whether or not a feature that is frequent elsewhere may be considered an SAE 
feature. In Haspelmath’s account (2001: 1493), this is not allowed: “[...] what 
needs to be shown in order to demonstrate that a structural feature is a Euro- 
peanism [= an SAE feature] is [...] (iv) that this feature is not found in the majority 
of the world's languages." 

But one can take issue here. Fundamentally, there could very well be features 
that characterize the core of SAE and, possibly to a lower degree, also the pe- 
riphery, and that also occur in, say, 51 % of the world’s languages. Take the ex- 
pression of ‘no longer’ again. Given the spread of the comparative as the only strat- 
egy for expressing this meaning, shown on Map 2, given also that both the Iberian 
peninsula and Eastern Europe are different, that Latin was different too and that the 
change in non-Iberian Romance has therefore taken place in historical times, the 
SAE status of the feature is rather plausible, and it does not seem threatened by 
whether or not 51 % of the world's languages would be similar. A less hypothetical 
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example concerns definite articles. In his WALS work on articles Dryer (2005a, 
2005b) shows that 337 of 586 languages have definite articles and 204 of 473 
languages have indefinite articles. Since his work also shows an areal European 
concentration of articles, both definite and indefinite, it is hard to see why their oc- 
currence in other areas or families should be relevant for a decision on their SAE- 
status. 

There are two further points, methodological in nature. First, of course, cer- 
tainly the argumentative task for the SAE status of a feature is easier if the relevant 
feature is not a majority feature or better still if it is an exotic feature. Secondly, 
as Haspelmath (2001: 1493) admits, very often we simply do not know how widely 
a feature occurs in the world's languages. On this point, however, this millennium 
has seen some progress. 


4. After EUROTYP 


In the post-EUROTYP era one can discern at least two and possibly three major 
trends in the further study of SAE convergence. First, one addresses more seriously 
how SAE might be different from the rest of the world, not so much by further 
studying Europe on its own, but by studying both Europe and the rest of the world. 
I call this the *macro-orientation'. My illustration will involve modality. The other 
trend can be called the *micro-orientation'. Here it is again the European languages 
that are themselves in focus, but not really the standard languages that were the 
prime focus of the EUROTYP work, but the dialects or at least substandard ver- 
sions. I will illustrate the micro-orientation with the issue of negation. The third 
trend takes us out of Europe again, but with European languages, for these are by 
no means confined to Europe any more. If one can study the influence of Spanish 
on Basque and say that some of the influence testifies to Basque becoming more 
SAE, then there is no reason why one shouldn't consider this scenario also in Latin 
America, for example. I call this the ‘extra-territorial orientation’. 


4.1. The macro-orientation 


A major parameter in the study of modality is the distinction between epistemic 
and non-epistemic meanings (uses). (4) and (5) illustrate this distinction with the 
English auxiliaries must and may. I paraphrase non-epistemic readings in (a) and 
epistemic ones in (b). 


(4) a. John must stay home tonight. 
Non-epistemic: ‘I order John to stay home tonight’ 
b. John must be staying home tonight. 
Epistemic: “It is highly probable that John will be staying home tonight’ 
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(5) a. John may stay home tonight. 
Non-epistemic: 'I allow John to stay home tonight? 
b. John may be staying home tonight. 
Epistemic: ‘It is uncertain that John will be staying home tonight? 


Since must and may can both have these uses, the auxiliaries may be considered 
polyfunctional (vague or even ambiguous). But the situation in English may not be 
very typical for the world at large. To answer this kind of question and more than a 
hundred similar ones we now have the World Atlas of Language Structures (Has- 
pelmath et al. 2005). In one of its chapters (but see also van der Auwera and Am- 
mann 2005) the issue of the modal polyfunctionality was studied and found to be 
typical for the languages of Europe and possibly no other area. Map 6 is a version 
of the relevant WALS map. 

On Map 6 the set of polyfunctional languages includes French and German, as 
one would expect if one thinks in SAE terms. The German case is particularly in- 
teresting. That German was itself a source language for coterritorial or neighboring 
languages with respect to modality was known already. As early as in 1968 Porák 
drew attention to the fact that German müssen is widely borrowed in West and East 
Slavic languages and also that these languages exhibit a geographical cline as to 
the personal vs. impersonal expression of modality, with the westernmost lan- 
guages having more personal strategies. Meanwhile, the success story of müssen 
has attracted a detailed study by Hansen (1990), and the point about the geographi- 
cal dimension of the personal vs. impersonal expression types has been followed 
up by van der Auwera, Schalley and Nuyts (2005), a pilot study of the expression 
of epistemic possibility in a Slavic corpus of Harry Potter translations. 


d 
` f / 
Q 1. Overlap tor both posskety and neces ety [36] Ee : p 
— 2.Owerinp tor ether possibility or necessty OR No overian [171] 


pl eme 


Been, wf 
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a — 


Map 6. Modal polyfunctionality for possibility and necessity 
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The general conclusion is that macro-variationist work such as reflected on 
Map 6 is a most useful strategy of finding out about Europe. The more specific con- 
clusion pertaining to modality is that some aspects of modality such as we know it 
in German is probably a feature of SAE, and that German may have spread some of 
them to its neighbor languages. 

The case of modality documents a situation in which it is the macro-work itself 
that has led us to hypothesize the SAE status of a certain feature. Of course, macro- 
work can also be used to check earlier hypotheses, for example the nine hypotheses 
listed in (2). For most of these, WALS has confirmed these hypotheses, very clearly 
so, e.g. for the claims on ‘have’ perfects, relative clauses using relative pronouns, 
and the co-occurrence of negative pronouns with negative verbs, less clearly so for 
the non-identity of intensifiers and reflexives (with e.g. Kónig and Gast [2005] pre- 
ferring a genetic rather than an areal account). However, as demonstrated already, 
if a feature can be SAE only if it is not found in the majority of the languages, then 
the WALS evidence on articles will force one to remove articles from the list of 
SAE indicators (in (2)). But then I argued that this condition is probably too severe. 


4.2. The micro-orientation 


The EUROTYP work did not pay attention to dialectal variation. Such variation 
was not excluded in principle, but in practice dialects were rarely discussed. Con- 
sistently drawing in dialect data would have made the work vastly more challen- 
ging, in part also because for many phenomena there simply were no dialect data. 
Meanwhile, in the post-EUROTYP era Europe has seen a variety of dialect gram- 
mar projects, sometimes explicitly geared to complement or even correct the EU- 
ROTYP work. This is documented in detail in Murelli and Kortmann (this vol- 
ume), but I will briefly illustrate the SAE relevance of dialects here too. 

The illustration concerns negation. The issue of negation, it is worth noting at 
the start, may well been crucial in rekindling the interest in Whorf's SAE, as ne- 
gation in the languages of Europe and particularly also its areal typology occupied 
the EUROTYP protagonists Giuliano Bernini and Paolo Ramat (1992/1996), and 
they furthermore found at least some negation facts to be in support of the SAE hy- 
pothesis: “[...] it appears that on the typological level the various geographical dis- 
tributions [...] confirm the existence of a central Romance-Germanic ‘core area’, 
already well-known in historical linguistics." 

But they also already show that it is necessary to look at non-standard varieties. 
When they study the fate of the co-called ‘Jespersen’s cycle’ in the languages of 
Europe, they point out that the main clause postverbal negative marker as the sole 
exponent of negation is typical for German, standard as well as non-standard, but 
for French, we only find it in non-standard varieties (Bernini and Ramat 1996: 20). 
Standard French has a double, discontinuous negation, at least as the default op- 
tion, and in Jespersen's cycle this strategy is a transition stage between an earlier 
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preverbal ne and a later postverbal pas. The latter is found in substandard French: 
the ne has become optional. 


(6 a. Je ne viens pas. [Standard French] 
b. Je (ne) viens pas.  [Non-standard French] 
c. Ich komme nicht. [German] 
I NEG come NEG 
‘I don't come.’ 


The non-homogeneity of French is actually found in other Romance languages too, 
viz. in Italian, Catalan and Occitan. If at least some negation facts are indeed 
supportive of positing SAE, one should study these Romance data in detail, also 
historically. Of course one should also check the homogeneity of Germanic and 
then one soon finds out that present-day Germanic does indeed show variation, 
again on the substandard level. Though it is correct that nearly all of Germanic has 
developed the main clause postverbal negation out of an earlier discontinuous ne- 
gation. there is one dialect area which keeps the discontinuous negation, even to 
this day, viz. the Dutch dialects of the Belgian provinces of West and East Flanders 
and of the bordering French area of French Flanders. (7) pairs the facts of German 
and French with those of Dutch. 


(7) a. Ik kom niet. [Standard Dutch] 
b. Ik (en) kom niet. [Substandard East, West and French Flemish] 
I NEG come NEG 
‘I don't come.’ 


It is furthermore of interest that the one small corner of the Germanic area in which 
the original discontinuous negation survives, at least as an option, is contiguous to 
French, a language which also has the (NEG) V NEG strategy, and perhaps the 
contiguous dialects too. So contact interference is not to be ruled out off-hand. 

What this very brief illustration shows about the question of the status of the 
Jespersen cycle as a characteristic of SAE is that the answer is different for the 
standard and the non-standard varieties: (i) the strategy of using a postverbal ne- 
gation — and no preverbal negation — is the only option in standard German, stan- 
dard Dutch, and it is one of two options in some Flemish dialects of Dutch, and in 
non-standard French, (ii) these Flemish dialects and non-standard French also 
allow the discontinuous negation, (iii) in standard French the discontinuous ne- 
gation is the default strategy. Detailed accounts along these lines will not neces- 
sarily lead us to discard the Jespersen facts as diagnostic for SAE, though it is cor- 
rect that the isogloss of the postverbal negation only will not treat standard French 
and standard German in the same way, which is untypical for SAE. 

In the above case, one can argue that the dialects show an SAE feature to be 
somewhat different and more complex than at the level of the standard languages. 
There are two other possibilities. First, the dialects and the standard languages data 
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are in full accord. This may be the case for phasal adverbs, for instance, or put more 
carefully, I am not aware of any significant difference with respect to ‘already’, 
‘still’, etc. between a standard language and its dialects. Second, the dialects and 
the standard languages differ very strongly. This is something we do find and I can 
again use Dutch negation facts. It is a supposed to be a feature of SAE and one that 
is even supported by the macro-perspective that the negative pronoun does not 
occur with the negative verb. Standard Dutch illustrates this. 


(8) a. Ik heb niemand gezien. 
b. *Ik heb niemand niet gezien. 
I have nobody not seen 


‘I have seen nobody.’ 


However, in Belgian Dutch, (8b) is fine in the central area, and in the western area, 
the negative pronoun may co-occur with the predecessor of not, the particle en, dis- 
cussed already, and there is also an overlap area in which niemand may co-occur 
with both niet and en (van der Auwera and Neuckermans 2004). 


(8) c. Ik heb niemand niet gezien. 
d. Ik en heb niemand niet gezien. 
I not have nobody not seen 


‘I have seen nobody.’ 


For this case — as well as that of the relative pronouns — Haspelmath (2001: 1507) 
argues that Latin could have played a significant role in shaping the SAE standard 
languages. 


4.3. The extraterritorial perspective 


Basque is a language that is on the SAE periphery. On the phasal adverbials Map 4 
it scores below 7 features and on the Haspelmath Map 5 it scores below 6 features. 
Yet that by no means implies that Basque has not acquired any SAE features. Heine 
and Kuteva (2006: 246—252) follow Haase (1992: 158) and list the following fea- 
tures, relative to the Basque spoken in southwestern France (note the micro-per- 
spective again). 


(9) Some SAE features acquired by Basque: 
do as a causative marker, comitative as instrumental, rise of passive, rise of 
third person pronoun, status of modal auxiliaries, complex postpositions, rise 
of indefinite article, TAM system, interrogative subordination 


Let us follow Heine and Kuteva (2006: 257—263), themselves guided by Campbell 
(1985), to El Salvador and its Aztecan language Pipil. The Pipil language, like 
many other indigenous languages of Latin America, has been influenced by Span- 
ish, and it has therefore acquired the following SAE-features: 
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(10) Some SAE-features acquired by Pipil: 
definite and indefinite articles, prepositions, past participles as adjectives, 
‘be’ progressive, ‘go’ future, rise of coordination, relative complementizer, 
purposive complementizer 


If one is ready to say that Basque is a peripheral SAE language, then there is at least 
no linguistic reason to disqualify Pipil. And if Pipil is marginally SAE, then there 
are many more peripheral SAE languages. Of course, the locus of contact inference 
is not Europe and the (post)colonial social dynamics are different. Basically, the 
study of the SAE features of Pipil no longer concerns the areal typology of Europe, 
but it still concerns the areal typology of European languages. This field existed 
before the upsurge of interest in SAE and thrives independently (see e.g. the bur- 
geoning subdiscipline of ‘World Englishes’), but it is good that Heine and Kuteva 
(2006: 244—283) have made us aware of the link. 


5. Conclusion 


The main advantage of the fact that the term ‘Standard Average European’ was 
picked up in the nineties was that it sparked off a wealth of areal typological work 
on the languages of Europe, both to research the possible Standard Average Euro- 
pean Sprachbund itself, radiating from continental Romance and Germanic, and 
areas to the North and the South. In the history of the discipline it sparked off areal 
typological word on the world at large (WALS), on the dialects of the European 
languages and it made us realize SAE effects outside of Europe. None of the 
strands of research have been explored to their full potential, but we now under- 
stand better to what extent SAE languages are indeed standard or average for 
human language and to what extent they are exotic, and how European languages 
influenced other languages, both European and non-European. 


See also the following chapters in this volume: 16 by Tomić, 17 by Wälchli, 18 by 
Sanso, 28 by Murelli and Kortmann and 38 by Ramat. 
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1. The Balkan Sprachbund phenomenon 


Though often genetically only remotely related (and in some cases totally unrelated), 
the Balkan languages share sets of typological similarities. These similarities were 
first signalled by Jernej Kopitar (1829), who pointed out that the languages spoken 
south of the Danube have analogous forms expressed through “different language 
material”. Following Trubetzkoy (1928), who referred to relationships between 
languages that show similarities in sentence structure and word-formation but have 
no systematic sound correspondences as Sprachbund, the typological analogies be- 
tween the Balkan languages have been treated as Balkan Sprachbund features.! 
Since the amount, the extent, and the limit of these analogies have not and cannot 
be assessed independently, linguists have centered around typologically similar fea- 
tures existing in a minimum of three languages of the area, at least two of which 
belong to different genetic families. Most prominent of these are one phonological 
feature — the presence of schwa — and ten grammatical features: (1) prepositional 
cases, (2) postpositive definite articles, (3) dative/genitive merger, (4) location/ 
direction merger, (5) vocative case markers, (6) pronominal clitic doubling of ob- 
jects, (7) loss of the infinitive and its substitution by subjunctive complements, 
(8) analytic expression of futurity with a ‘will’ auxiliary, (9) analytic perfect with a 
*have'-auxiliary, and (10) the modal category of evidentiality (cf. Tomi¢ 2006). 
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1.1. Origin 


The Balkan Sprachbund has been hypothesized to derive from an ancient substrate — 
Thracian, Dacian, and/or Illyrian (cf. Kopitar 1829, Miklosich 1861; Weigand 
1928), from a more recent sub- or superstrate, namely Latin (cf. Solta 1980), or 
Ancient Greek - the language of an outstanding civilization (cf. Sandfeld 1930). 
The possible ancient "substrates" had, however, disappeared long before the Bal- 
kan Sprachbund relationships began to evolve, and it is hard to believe that they 
could be activated long after their disappearance. Latin or Ancient Greek can 
hardly have served as sources for the Sprachbund either, since in Latin the Balkan 
Sprachbund features are scarcely present and in Greek they are post-classical in- 
novations. The Balkan Sprachbund is not a simple transfer from a substrate or from 
one of the languages themselves; it is a typological phenomenon, which developed 
through convergence of language codes in a multilingual environment. According 
to Lindstedt (2000), we are here dealing with a shared drift whereby among the 
random changes in each language those spread more easily that contribute most to 
direct inter-translatability between the relevant languages. It should, however, be 
borne in mind that more than a mere need for effective communication is involved. 
As pointed out by Joseph (1983: 249), "the need for communicative efficiency in a 
multi-lingual setting is not readily transferable to a feature such as the postposed 
definite article, for there does not seem to be any obvious communicative advan- 
tage in any particular placement of the definite article". 


1.2. Membership 


There are quite a number of languages spoken in the modern Balkan states. The 
Balkan Sprachbund features have been identified in those languages which have 
been spoken in the Ottoman empire since the early Middle Ages - the South Slavic 
languages Macedonian, Bulgarian, and Serbian, the Eastern Romance languages 
Romanian, Aromanian, and Megleno-Romanian, Albanian, Modern Greek, the 
Balkan Romani dialects, Balkan Turkish, and Judeo-Spanish as spoken on the 
Balkans. Not all Balkan languages have an equal share in the Balkan Sprachbund; 
moreover, different analysts have paid attention to diverse languages and have 
ranked them diversely. Macedonian, Bulgarian, Romanian, Aromanian, Megleno- 
Romanian, Albanian, and Modern Greek are by most analysts treated as the “core” 
Balkan Sprachbund languages (cf. e.g. Weigand 1928, Sandfeld 1930, Schaller 
1975). Standard Serbian and Judeo-Spanish are looked upon as peripherally Balkan; 
Balkan Romani has been discussed only recently? while Turkish, the language 
which enjoyed a dominant social status when the Sprachbund features developed, 
is regarded as an instigator for the development of the Sprachbund more than as a 
participant in it. The balkanization of the “core” languages, however, differs from 
dialect to dialect. The core languages for the Sprachbund as a whole are not neces- 
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sarily “core” languages for each feature (cf. Tomić 2008). If languages are treated 
as diasystems of regional and social dialects and the functioning of individual 
features is taken into consideration, the area around the southern parts of the lakes 
of Ohrid and Prespa, where Greek, Albanian, Macedonian, Aromanian, and Arli 
Balkan Romani intersect, shows up as the epicenter of Balkanisms. Many of the 
Balkan Sprachbund features are also present in non-Balkan languages.? What is 
actually remarkable on the Balkans is the combination of linguistic features whose 
dynamic equilibrium contributes to the uniqueness of the union (cf. Civjan 1979). 


1.3. Convergence and divergence of features 


Balkanists have traditionally been concerned with compiling lists of similarities 
and drawing parallels between lexical and grammatical forms in individual lan- 
guages where the Balkan Sprachbund features are represented, thus perpetuating 
a picture of uniformity. The Balkan Sprachbund features, however, relate to the 
overall features of the systems of the languages in which they occur and this leads 
to differences in their functioning. With this in mind, what will be described in the 
present chapter is the convergence of a number of widely known Balkan Sprach- 
bund grammatical features in the “core” Balkan language codes (including dia- 
lectal varieties), along with divergence resulting from their interaction with other 
features in the structure of the nominal phrases and sentences in which they occur. 
In section 2 the features of the Balkan nominal case systems will be discussed, with 
special reference to definite articles. In section 3 the distinct usage of pronominal 
clitics will be dealt with. Section 4 will be devoted to subjunctive constructions, 
with particular mention of their use in the future tense forms. Section 5 will exam- 
ine the modal category of evidentiality. The chapter will be rounded off by some 
concluding remarks in section 6. 


2. Cases and articles 


The synthetic case markers of the nominal case systems of the core Balkan lan- 
guages are to a great extent replaced by analytic ones, though differently in different 
languages. In Macedonian, Bulgarian, and Megleno-Romanian there are only ves- 
tiges of the nominal declensions. Romanian and Aromanian have distinct dative 
forms for all nouns. Albanian nouns have three distinct oblique case forms — dative 
(Dat), accusative (Acc), and ablative (Abl), though the Acc forms are often equiv- 
alent to the nominative (Nom) ones, while the Abl forms are often equivalent to the 
Dat ones. In Modern Greek, masculine nouns with o-stems have distinct genitive 
(Gen) and Acc forms; masculine nouns with other stems have only one oblique 
form, though the Gen and Acc cases are overtly marked on articles; feminine and 
neuter nouns have a distinct Gen form, which also occurs in Dat positions, while the 
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Acc forms are equivalent to the Nom ones. The tendency towards analyticity of the 
nominal system, by itself, is not a Balkan specificity — it is encountered in many 
other Indo-European languages. Characteristic of the Balkan Sprachbund are, how- 
ever, a number of phenomena that accompany this tendency, such as postpositive ar- 
ticles (section 2.1), syncretism of the Gen and Dat cases, and analogous selection of 
prepositions for the expression of case relations (section 2.2). 


2:1: Postpositive articles 


The core Balkan languages have definite articles that inflect for gender and number 
and occur pre-nominally in Modern Greek, whereas in Balkan Slavic, Balkan Ro- 
mance, and Albanian we actually find morphemes included in nouns or pre-nom- 
inal noun modifiers, which are referred to as postpositive articles. Wherever they 
occur, the postpositive articles inflect for gender and number. In Macedonian (not 
in the eastern Macedonian dialects, though), there are also distinct sets of postposi- 
tive articles that exhibit spatial differentiation, denoting closeness to the speaker, 
closeness to the addressee, or distance from both the speaker and the addressee. 

Whereas the Balkan Slavic definite articles occur at the right edge of the nouns 
to which they refer, their Balkan Romance and Albanian counterparts appear to the 
right of the root of those nouns and often amalgamate with gender, number, and 
case markers. In (1) we have examples of the occurrence of definite articles in 
Macedonian (Mac), Romanian (Rom), and Albanian (Alb) feminine nouns with the 
meaning ‘girl’. 


(1) a, devojka devojkava devojkata devojkana Mac 
girl girl.Def.Prox1 girl.Def.Prox2 girl.Def.Dist 
a, devojki devojkive devojkite devojkine 


girls girls.Def.Prox1 girls.Def.Prox2 girls.Def.Dist 


b, fatd fata fete fetei Rom 
girl girlDef girlDat girl.Dat.Def 


b, fete fetele fete fetelor 
girls girls.Def girls.Dat als Da Det 


c, vajzé vajza vajze vajzés Alb 
girl — girlDef girl.Dat girl.Dat.Def 


c, vajza vajzat vajzave 
girls girls.Def girls.Dat/Dat.Def 


When the noun is preceded by pre-nominal modifiers, the postpositive article oc- 
curs at the right edge of the leftmost modifier. This is illustrated in (2) on examples 
of Bulgarian (Bulg) and Megleno-Romanian (M-Rom) noun phrases: 
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(2) a. malkoto belo prase Bulg 
small.N.Sg+the.N.Sg white.N.Sg pig.N 
“the small white pig’ 


b. micu alb porc M-Rom 
small.M.Sg+the.M.Sg white.M.Sg pig.M 
“the small white pig’ 


Due to differences in the use of pronominal modifiers, the frequency of occurrence 
of articled modifiers varies. In Macedonian and Bulgarian, where the modifiers are 
as a rule pre-nominal, and in Megleno-Romanian and Aromanian, where they are 
predominantly pre-nominal, the occurrence of articled modifiers is common. The 
Romanian and Albanian modifiers, which more often than not occur post-nom- 
inally, are articled only when they precede the noun in specific environments. 


2.2. Use of prepositions for the expression of case relations 


In the core Balkan languages, abstract case or thematic relations other than the 
agent are often marked by prepositions. Not only in Macedonian, Bulgarian, and 
Megleno-Romanian, where synthetic nominal case markers are rarely, if at all pres- 
ent, but also in Aromanian, where there are distinct Dat nominal forms, dative and 
genitive (possessive) relationships are marked by prepositions whose basic mean- 
ing is locative. Consider the Bulgarian and Aromanian examples in (3) with the rel- 
evant prepositions appearing to the immediate left of direct objects (3a,, b,) and 
nouns conveying possessive relationships (3a,, b,). 


(3) a, Dadox knigata na studenta. Bulg 
give.1Sg.Aor book+the.F.Sg to student+the.M.Sg 
‘I gave the book to the student.’ 


a, sestrata na profesora 
sister+the.F.Sg to professor+the.M.Sg 
‘the professors's sister’ 


b, Petri lji deade lilice a featilj(e)i. Arom 
Petri 3Sg.Dat.Cl give.3Sg.Aor flower to girl+the.F.Sg.Dat 
‘Petri gave a flower to the girl.’ 


b, sor-sa a profesorlui 
sisterthis.M.Sg.Cl to professor+the.M.Sg.Dat 
‘the professor’s sister’ 
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In the Northern Modern Greek (ModGrk) dialects, the indirect object is marked by 
preposition * Acc case markers, while in standard Modern Greek it is alternatively 
marked by Gen case markers or preposition + Acc case markers: 


(4) a. Eghrapsa sti Maria. ModGrk 
write.1Sg.Aor to+the.F.Sg.Acc Maria 


b. Eghrapsa tis Marias. 
write.ISg. Aor the.ESg.Gen Maria.Gen 
‘I wrote to Maria.’ 


In Albanian and Standard Romanian, there are no prepositions to the left of indirect 
objects. Note, however, that direct objects with prepositions appear in colloquial Ro- 
manian. In Standard Romanian, a preposition with basic locative meaning is used as 
a marker of the Acc case of proper names and human nouns without a definite article: 


(5) L-am vdzut pe Ion/profesor Rom 
3Sg.M.Acc.Cl-have.1Sg seen bast bat Acc.Marker Ion/professor 
‘I saw/have seen Ion/the professor.’ 


3. Pronominal clitics 


The core Balkan languages have two types of pronominal clitics: (a) clitics that 
operate in the domain of the noun phrase, to which we shall refer as adnominal, 
and (b) clitics that operate in the domain of the clause, to which we shall refer as 
clausal. Whereas the former choose as their syntactic domain the head or the first 
constituent of the noun phrase and encliticize to it, the latter co-occur with mood, 
modality, and/or negation clitics in clusters that typically occur to the left of the 
verb and, except for Bulgarian, are proclitic, though in non-negated imperative 
clauses and clauses with non-finite verbs they occur post-verbally and are enclitic 
throughout the area. In (6a) we have a Modern Greek clause with an adnominal 
clitic, in (6b) Macedonian and Bulgarian examples of clausal clitics operating in 
indicative clauses, whereas in (6d) the clausal clitics operating in non-negated 
clauses with non-finite verbs are illustrated by an example with a Romanian im- 
perative clause. The arrows («— and —) mark directionality of cliticization. 


(6 a. O Jorghos idhe tin KOri «— su. ModGrk 
the.M.Sg Jorghos see.3Sg.Aor the.F.Sg.Acc daughter 2Sg.Gen.Cl 
*Jorgo saw your daughter.' 


b, PETko > mu > go > DAL 
Petko 3Sg.M.Dat.Cl 3Sg.N.Dat.Cl given.M.Sg./-Part 
poDArokot. Mac 


present+the.M.Sg 
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b, PETko = mu «— e DAL Bulg 
Petko 3Sg.M.Dat.Cl be.3Sg.Cl given.M.Sg./-Part 
poDAráka. 


present+the.M.Sg 
*Petko has given the present to him.’ 


c. MáNÁncd < I! Rom 
eat.2Sg.Imper-3Sg.Acc.Cl 
‘Eat it!’ 


3.1. Forms and usage of the adnominal clitics 


The forms and usage of the adnominal clitics differ. In Bulgarian and Macedonian 
the forms of the adnominal clitics are borrowed from the Dat clausal pronominal 
clitics, while in Modern Greek the forms of the genitive adnominal clitics have re- 
placed the former dative clausal clitics. Thus, in Balkan Slavic and Modern Greek 
the Dat clausal clitics are formally equivalent to the adnominal clitics — a phenom- 
enon referred to as dative/genitive clitic merger. In Romanian there are two types of 
adnominal clitics: (a) adnominal clitics whose forms are equivalent to the forms of 
the dative clausal clitics and (b) adnominal clitics whose forms are phonologically 
deficient versions of possessive modifiers. The Aromanian and Megleno-Romanian 
adnominal clitics are equivalent to the Romanian clitics of the (b) type. In Albanian, 
adnominal clitics do not occur, though in a number of Tosk Albanian dialects the 
function of the Balkan Slavic and Balkan Romance adnominal clitics is performed 
by clitics that mark the agreement of adjectives with the nouns to which they refer. 

All adnominal clitics express possessive relationships but their references 
differ. The Modern Greek Gen adnominal clitics are robust; they can refer to a wide 
variety of possessors in diverse contexts and can encliticize to the possessor, its first 
modifier or both to the possessor and its first modifier. The Bulgarian Dat adnomi- 
nal clitics are also robust and differ from their Modern Greek counterparts only in 
lacking the ability to encliticize to the noun when it is preceded by a modifier. The 
Macedonian Dat adnominal clitics can only refer to non-articled possessors denot- 
ing family relationship, to which they have to encliticize. The Romanian Dat ad- 
nominal clitics are used only in literary or archaic contexts and are most often fea- 
tured by third person singular Dat clitics which encliticize to a modifier whenever 
there is one. The usage of the Balkan Romance phonologically deficient possessive 
modifiers is restricted to singular possessors denoting family relationship. In (7) we 
have examples of adnominal clitics denoting possessive relationships in Modern 
Greek (ModGrk), Macedonian, and Aromanian: 


(7) a, o pateras su ModGrk 
the.M.Sg father 2Sg.Gen.Cl 
*your father’ 
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a, to heri mu 
the.N.Sg hand 1Sg.Gen.Cl 
‘my hand’ 
a, ta meghala tu dulapja tu 
the.N.Pl big.N.Pl 3Sg.N.Gen.Cl cupboards 3Sg.N.Gen.Cl 
“its (a room’s) big cupboards’ 


b, sestra mi Mac 
sister 1Sg.Dat.Cl 
‘my sister’ 

b, tatko ti 


father 2Sg.Dat.Cl 
‘your father’ 


b, tetki ni 
aunts 1Pl.Dat.Cl 
*our aunts' 


c, sor-me(a) Arom 
sister-my.F.Sg.Cl 
‘my sister’ 


C, tetd-sa 
aunt-his/her.F.Sg.Cl 
‘his/her aunt’ 


3.2. Clitic doubling 


The pronominal clitics in the Balkan languages often double direct and indirect 
objects. In Macedonian, Megleno-Romanian, and the Aromanian dialects spoken 
in Macedonia clitic doubling is to a large extent grammaticalized; basically, all 
definite direct objects and all specific indirect objects in these languages are clitic 
doubled. In the Aromanian dialects spoken in Greece clitic doubling behaves as it 
does in Modern Greek. In (8) we have a Macedonian example: 


(8) Jana mu go dade pismoto Mac 
Jana 3Sg.N.Dat.Cl 3Sg.N.Acc.Cl gave.3Sg letter-the.N.Sg 
na deteto/ na edno dete. 


to child+the.N.Sg to a.N.Sg child 
‘Jana gave the letter to the child/a child (that I know).’ 


In Bulgarian, Albanian, Modern Greek, and Romanian, clitic doubling involves 
discourse factors. In Bulgarian both direct and indirect objects are clitic-doubled if 
topicalized. In some cases, though, in situ non-focused objects can also be clitic- 
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doubled, resulting in a specificity and definiteness effect. In Albanian, indirect 
object clitic doubling is fully grammaticalized, while direct object clitic doubling 
is contingent on topicalization, as well as on specificity. In Modern Greek, clitic 
doubling of both direct and indirect objects is contingent on topicalization and spe- 
cificity, though in colloquial speech a doubling clitic tends to accompany every 
specific indirect object. In Romanian, clitic doubling of indirect objects is typically 
conditioned on specificity, whereas direct object clitic doubling is contingent on 
topicalization, on specificity, and on whether reference is made to a human being. 
Consider the Albanian examples in (9): 


(9) a. Agimi *(i) dérgoi lule Alb 
Agim+the.M.Sg 3Sg.Dat.Cl send.3Sg.Aor flower 
Drités. 


Drita+the.F.Sg.Dat 
“Agim sent flowers to Drita.’ 


b. Jani *(e) pret Merin 
Jani+the.M.Sg 3Sg.Acc.Cl expect.3Sg Mary+the.Acc 
të ankohet. 


Subj.Marker complain.3Sg.M-P.Pers 
‘Jani expects Mary to complain." 


4. Subjunctive constructions 


The most perspicuous and most widely discussed feature of the Balkan Sprach- 
bund languages is the loss of the infinitive and its replacement by structurally com- 
parable subjunctive complements with tensed verbs preceded by subjunctive 
markers.^ 


4.1. Typical uses of subjunctive constructions 


Typically, subjunctive constructions are used as complements of lexical modal 
verbs (10a,,b,) and as complements of intentional verbs (10a,,b,). The examples in 
(10) are taken from Aromanian, where, as elsewhere in Balkan Romance as well as 
in Albanian, the subjunctive markers may be preceded by subjunctive complemen- 
tizers and the verbs have subjunctive morphology, and from Modern Greek, where, 
as in Balkan Slavic, the subjunctive markers are not preceded by complementizers 
and the verbs have no subjunctive morphology: 


(10) a, Feata lipseaschce (td) s-neagd. Arom 
girl+the.F.Sg should.3Sg that.Subj Subj.Marker-go.3Sg.Subj.Pres 
"Ihe girl should go.’ 
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a, 


4.2. 


Nu vrea (td) s-minduiascd. 
not want.3Sg.Imperf that.Subj Subj.Marker-think.3Sg.Subj.Pres 
*(S)he didn't want to think.’ 


Prepi na fighis. ModGrk 
must.2Sg Subj.Marker leave.2Sg 
“You must leave.’ 


Thelo na pao eki. 
want.1Sg Subj.Marker go.1Sg there 
‘I want to go there.’ 


Bare subjunctive constructions 


Subjunctive constructions can be bare, i.e. can occur by themselves and not as 
complements of main clauses, in which case they most frequently express wishes 
or commands. But they can also express resolution, astonishment, wonderment, 
and even missed opportunity or unrealized probability. Consider the Macedonian 
and Albanian examples in (11): 


(11) a, 


a, 


a; 


Da odis! Mac 
Subj.Marker go.2Sg 
“You should go" 


Da sum taka sut!? 
Subj.Marker be.1Sg so  foolish.M.Sg 
“To be such a fool!?’ 


Da go zemese! 
Subj.Marker 3Sg.N.Acc.Cl take.2/3Sg.Subj.Past 
* You/(s)he should have taken it!’ 


Té shkosh! Alb 
Subj.Marker go.2Sg.Subj 
“You should go!’ 


Té provojmé 

Subj.Marker ` try.1Pl 

‘Let us try!’ 

Té martohet kaq herét! 


Subj.Marker marry.3Sg.M-P.Pres so soon 
“To get married so soon!’ 
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4.3. Future tenses 


Subjunctive constructions also appear in the Balkan future tenses. In Standard Ro- 
manian the future tense is constructed by infinitives preceded by inflecting ‘will’ 
auxiliary clitics; in formal and colloquial Romanian, however, there are future 
tenses with subjunctive constructions preceded by inflecting or non-inflecting ‘will’ 
auxiliaries, as well as with ‘have’ auxiliaries. In Aromanian (12a,,a,) we have sub- 
junctive constructions preceded by non-inflecting ‘will’ auxiliaries in the positive 
future tenses and impersonal ‘have’ auxiliaries in their negated counterparts. In 
Megleno-Romanian the positive future tenses are actually bare subjunctive con- 
structions, while in the negated future tenses the subjunctive constructions are pre- 
ceded by impersonal ‘have’ auxiliaries. The Standard Albanian future tenses are 
constructed by subjunctive constructions preceded by non-inflecting ‘will’ auxili- 
aries, whereas in the Geg Albanian dialects (12b,,b,) we have subjunctive construc- 
tions preceded by inflecting ‘have’ auxiliaries. Balkan Slavic and Modern Greek 
used to have future tenses with ‘will’ auxiliaries, followed by subjunctive construc- 
tions introduced by subjunctive markers, but these markers were subsequently lost 
and the future tenses in these languages are now constructed by non-inflecting ‘will’ 
auxiliaries followed by tensed lexical verbs (see 12c, from Bulgarian). Subjunctive 
constructions with subjunctive markers to the right of ‘have’ auxiliaries are, how- 
ever, used in the Balkan Slavic negated future tenses (see 12c,). In (12) we give 
examples of Aromanian, Geg Albanian (G-Alb), and Bulgarian future tenses: 


(12) a, Va s-yin máne. Arom 
will.Mod.Cl Subj.Marker-come.1Sg tomorrow 
‘I will come tomorrow.’ 


a, Nore (td) s-yind. 
not+have.Impers that.Subj Subj.Marker-come.3Sg/Pl.Subj.Pres 
*(S)he/they won’t/are not going to come.’ 


b, Kemi me ärdh. G-Alb 
have.1Pl to.Inf.Marker come.Non-Fin 
“We will come.’ 


b, Nuk ka me ärdh nesér. 
not have.3Sg to.Inf.Marker come.Non-Fin tomorrow 
*(S)he will not come tomorrow.’ 


c, Ste ti četa utre. Bulg 
will.Mod.Cl 2Sg.Dat.Cl read.1Sg tomorrow 
‘I will read to you tomorrow.’ 


c, Njama da dojdat. 
not+have.3Sg Subj.Marker come.3Pl.Perf.Pres 
“They won't come.’ 
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5. Evidentials 


In Albanian, Macedonian, Bulgarian, Megleno-Romanian, and an Aromanian dia- 
lect there is a modal category, which different linguists have defined and named 
differently. The Balkan Slavic exponents of this category are 'be'-auxiliary clitics 
followed by /-participles, which originally featured the perfect, while in Albanian 
and Megleno-Romanian there are distinct modal markers based on the perfect. In 
Macedonian grammars this category is described as a category expressing reported- 
ness (cf. Koneski 1967), in Bulgarian grammars the exponents of this category are 
referred to as perfect tense forms describing “unwitnessed events" (cf. BAN 
1982—83; BojadZiev et al. 1999), whereas Albanian grammars name a series of para- 
digmatic sets that have developed from the perfect but have distinct usage habitore 
(from habi ‘surprise’; cf. Newmark et al. 1982). However, the attributes reported 
and unwitnessed do not cover all the uses that the forms of this modal category have 
in Balkan Slavic, just as ‘surprise’ is not the only meaning that the Albanian habi- 
tore paradigms impart. Following Friedman (2004: 103), I adopt the term eviden- 
tiality and define the given category as a modal category that expresses “the subjec- 
tive relationship of the participant in the speech event to the narrated event." 

The Albanian evidential forms typically express admiration, surprise, or irony 
and have in Albanian grammars been labeled “admiratives”, but can also be used to 
assert the occurrence of an action or express disagreement or uncertainty: 


(13) a. TI (me te vérteté) flitke kinezge! Alb 
you in Pl.Agr.Cl truth speak.2Sg.Evid Chinese 
“You really speak Chinese!’ 


b. Ai (vérteté) e paska pérdorur 
he truth 3Sg.Acc.Cl have.3Sg.Pres.Evid used.Part 
thikél! 
knife+the.M.Sg.Acc 
*He indeed used the knife!’ 


c. Thonë se lakam tesha çdo ditë. 
say.3Pl that wash.1Sg.Pres.Evid clothes.Sg+the.F.Sg every day 
"They say that I wash/am washing clothes every day (but I don't). 


The Macedonian, Bulgarian, and Megleno-Romanian evidentials typically impart 
the speaker's non-awareness of the event to which the verb refers but can also ex- 
press admiration, surprise, or irony: 
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(14) a, Veli deka si me videl. Mac 
say.3Sg that be.2Sg 1Sg.Acc.Cl seen.M.Sg.Part 
*(S)he says that you have seen me.’ 


a, Ama ti si bil junak! 
but you are.2Sg be.M.Sg.Part hero 
1. “You are such a hero" (admiration) 
2. “You a hero!’ (irony/surprise) 


b, Zisi ca t-li M-Rom 
say.3Sg that 2Sg.Dat.Cl+3P1.F.Acc.Cl 
vutau dat 


have.3Sg.Inv.Perf given.Past.Part 
*(S)he says that (s)he has given them to you.’ 


b, Tu  fostai äncrilatä! 
you be.2Sg.Evid clever.F.Sg 
1. “You are clever!’ (admiration) 
2. “You clever!’ (irony) 


The Megleno-Romanian and Macedonian evidential forms, though not their Bul- 
garian counterparts, are also commonly used in connected narratives. 


(15) a. Sum ti stanala rano i sum Mac 
be.1Sg 2Sg.Eth.Dat.Cl get up.F.Sg.Part early and be.1Sg 
ti se dala na rabota. 


2Sg.Eth.Dat.Cl Impers.Acc.Cl give.F.Sg.Part on work 
‘Lo and behold, I got up early and got to task.’ 


b. Ash vutau und mumă trei  feati... M-Rom 
Impers.Dat.Cl have.3Sg.Inv.Pf aF mother three girls 
‘(Once upon a time) a mother had three daughters ...’ 
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6. Concluding remarks 


The Balkan Sprachbund features do not show up uniformly in all the languages in 
which they occur. In Table 1, the regular occurrence of a feature in the languages 
discussed in this chapter are represented by a plus symbol, the possible occurrence 
of a feature by a bracketed plus symbol, and the non-occurrence of a feature by a 
minus symbol. 


Table 1: Occurrence of grammatical balkanisms 


Balkan Slavic Balkan Romance Albanian Modern 


Mac Bulg Rom M-Rom Arom Greek 

Cases and articles 

postpositive articles + + + + + + = 

prepositional case relations + + (+) + (+) (+) (+) 
pronominal clitics 

dative/accusative merger + + (+) - = = + 

phonol. deficient possessives = — (+) + + = _ 

clitic doubling + + + + + + $ 
subjunctive constructions 

with subjunctive markers + + + + + + + 

V with subjunctive morphology - - + + + + - 

preceded by complementizers = - (+) (+) (+) (+) z 

bare subjunctives + + + + + + + 

‘will’ modal subjunctives - (+) (+) - + + - 

‘have’ subjunctives t + (+) + = (+) = 
Evidentials + + - + (+) + E 


The divergences in the representation of a feature reflect not only differences in 
language structure but also differences in areal contact. Thus, evidentials, as a rule, 
do not occur in the Romance languages but they are robust in Megleno-Romanian — 
a language in close areal contact with Macedonian. The analogies for the condi- 
tions for clitic doubling of indirect and direct objects of Modern Greek, Albanian, 
Bulgarian, Macedonian, Romanian, Megleno-Romanian, and the Aromanian dia- 
lects spoken in Macedonia (M-Aromanian), represented in Table 2, show that areal 
proximity determines the conditions for the occurrence of a feature. 
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Table 2: Conditions for clitic doubling of Balkan direct and indirect objects 


Bul Mac  M-Rom  Arom Rom Alb  ModGrk 
direct ` definite yesToP yes yes yes yesNon-ArtHum  yesSpecTop yegTop 
objects | indefinite yesExXTop ` yesEx yesEx yesE* ` yesHumSpecTop yesSpecTop  yesTop 

bare no no no no no no no 
indirect def yesToP yes yes yes yesTop yes yes(Top) 
objects indef [+spec] yesE*Top yes yes yes yesTop yes yes(Top) 

indef [-spec] no no no no no no no 

bare — no yesOrt yesOpt yesOpt no no no 


Ex = exceptionally when the given object contrasts with some other object in the domain of 


discourse 

ExTop = exceptionally in topicalized objects in sentences involving embedded subjunctive 
clauses or adjuncts 

HumSpecTop = in human, specific, topicalized objects 

Non-ArtHum = in non-articled, human, specific, topicalized objects 

Opt = optionally 

SpecTop = in specific, topicalized objects 

Top = in topicalized objects 

(Top) = in topicalized objects for some speakers; in all cases for others 


The fact that the usage of the Megleno-Romanian and Aromanian Balkan Sprach- 
bund features is closer to the usage of corresponding features in adjacent Macedo- 
nian — a language belonging to a Slavic linguistic family — and differ from 
Romanian — a Balkan Romance language with which they are genetically closely 
related — shows that intensive language contact may override phylogenesis. 


Notes 


] Note that Weinreich (1953, 1958) argues against the use of the term Sprachbund, its fault 
being that “it implies a unit, as if a language either was or was not a member of a given 
Sprachbund" (cf. Weinreich 1953: 378), and opts for the term language union. More re- 
cently, Reiter (1994) attempts to do away with the terms balkanism and language union, 
while Hinrichs (1999), in the title of the book, as well as in its introductory chapter, 
avoids the term Balkan altogether, opting for Südosteuropa ‘South Eastern Europe’. 

2 Matras and Bakker (1997) point out that Balkan Romany has many Balkan Sprachbund 
features, while Lindstedt (2000) indicates that it is less “Balkan” than Balkan Slavic and 
Balkan Romance, but as much "Balkan" as Modern Greek. 

3 In some cases, there is a wider (Mediterranean or Central European) area in which the 
feature is witnessed. Ammann and van der Auwera (2004) demonstrate that the modal 
construction consisting of a complementizer and an independent main clause, i.e. the sub- 
junctive construction which appears independently and not as a complement of a verb, is 
used to express volition not only in the languages of the Balkans but also elsewhere in 
Southern Europe. 
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4 Most of the examples are taken from Tomić (2006). The following abbreviations are used 
in the glosses: 1/2/3 = 1Y/2rd/3rd person; Acc = accusative; Admir = admirative; Agr = 
agreement; Aor = aorist; Aux = auxiliary; Cl = clitic; Dat = dative; Def = definite; Dist = 
distant; Evid = evidential; F = feminine; Gen = genitive; Inf = infinitive; Imperf = imper- 
fect (tense), imperfective (aspect); Impers = impersonal; Inf = infinitive; Inv = inverted; 
M = masculine; Mark = marker; Mod = modal; M-P = medio-passive; N = neuter; Non- 
Fin = non-finite; Part = participle; Past = past (tense); Perf = perfect (tense); perfective 
(aspect); Pl = plural; Pres = present (tense); Prox! = proximate to speaker; Prox2 = proxi- 
mate to addressee; Refl = reflexive; Sg = singular; Subj = subjunctive 

5 The infinitive has been completely lost in Macedonian and Modern Greek and almost 
completely lost in Bulgarian and Aromanian. In Megleno-Romanian, it shows up in a 
limited number of constructions, while in Romanian many of the uses of the subjunctive 
constructions are shared by infinitives. As for Albanian, it is not quite clear whether the 
Indo-European infinitive was lost and subsequently reinstituted atypically or whether the 
Albanian quasi-infinitival constructions are not related to infinitives at all (cf. Joseph 


1983). 
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17 The Circum-Baltic languages 


Bernhard Wilchli 
1 The Circum-Baltic area as a contact superposition zone and as a buffer zone 
2 Contact superposition 
3. Buffer zone features between SAE and Eurasia 
4 Spreading “Uralic” and “Indo-European” properties 
3: Selected particular properties of CB languages within Europe 
5.1. Polytonicity 
5:2. Partitive and distributed head properties 
5.3. Logophoric pronouns 
5.4. Evidentiality 
5.5. No or only weakly grammaticalized future tense 
6. Contact phenomena leading to less structural similarity and substrate effects 
7. Conclusions 
1. The Circum-Baltic area as a contact superposition zone 


and as a buffer zone! 


The term Circum-Baltic languages refers to the languages spoken around the Bal- 
tic Sea, which mainly belong to two genealogical families, Indo-European (Baltic, 
Germanic, Slavic plus Romani) and Uralic (Finnic), but also comprise a Turkic 
language, Karaim, spoken in Lithuania. The Circum-Baltic (CB) languages do not 
form a structurally strictly delimitable area but rather a subarea of the Eurasian lin- 
guistic population. The motivation to treat Circum-Baltic languages together stems 
from the many typological parallels between not genealogically definable subsets 
of languages and dialects testifying to the turbulent history of the area, which has 
never been united politically and has always been in tension between competing 
influences from the East and from the West. The CB area is a contact superposition 
zone rather than a Sprachbund and is part of a “buffer zone” between Standard 
Average European (SAE) and Central Eurasia, which stretches from Sami in the 
north over the CB region to eastern Central Europe (Czech, Slovak, Hungarian) 
and the Balkans to the Caucasus. 

In comparison with other well-known linguistic areas where languages from 
more than one genealogical stock meet, such as South Asia or Mesoamerica, the 
number of languages involved in the CB area is relatively small — about 20-30 
depending on how they are counted (see Map 1): Baltic: Lithuanian (including 
Zemaitian), Latvian (including Latgalian), and extinct Old Prussian; Finnic: Livo- 
nian, Estonian (North & South), Votian, (Ingrian), Finnish, Karelian (three major 
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Skolt Sami 


Inari 


Lule Sami Sami 
Pite Sumi 
Ume Sami 
Edsele 
, ve Karelian 
Southern Finnish 
Sami 
Ludian 
Dalecartian Olonetsian 
Norwegian Stadin Veps 
m 
Swedish Ingrian 
tvian . 
i Russian 
Zemaitian Latvalian 
Lithuanian 
Ald Prussian 
cashubian Karaim 


Belorussian 


German 
"P Polish 
High German 


Map 1. The Circum-Baltic languages (excluding territorially widely spread languages 
like Romani or Yiddish) 


varieties), Veps; Germanic: Swedish, Danish, German, Low German, Yiddish; 
Slavic: Russian, Belorussian, Polish, Kashubian; Indo-Aryan: Romani; and Tur- 
kic: Karaim. However, what makes the CB area specific and highly relevant for the 
theory of language contact is the large number of well-documented dialects and 
contact varieties (many of them endangered or extinct). This is why this article fo- 
cusses on contact varieties and minor languages of the area and uses the few refer- 
ences allotted mainly to illustrate selectively the diversity of dialects and minor 
contact varieties in the area. This article concentrates on structural effects of lan- 
guage contacts. For a study of code mixing between Karelian and Russian in Kare- 
lia see Sarhimaa (2000). A more comprehensive survey of the CB languages in 
general is found in the papers in Dahl and Koptjevskaja Tamm (2001), with a sur- 
vey of the literature in the synthesis article by Koptjevskaja Tamm and Wilchli 
(2001, henceforth KTW), including many references to earlier literature. 

The CB area arose from a series of ancient Indo-European and Uralic spreads 
and is characterized by long and repeated language contacts between various Indo- 
European and Uralic language varieties over four millennia. The spreads entailed 
a low genealogical diversity on the macro-level. However, given the prolonged 
presence especially of Finnic and Baltic in the area, a local residual zone in the 
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eastern part of the area (the Baltikum) evolved which preserved traces from many 
subsequent periods of contact between Finnic and Baltic language varieties. This 
area is wedged in between more recent Germanic and Slavic spread zones and ab- 
sorbed small groups of minor spreads (such as Karaim and Romani). 


2. Contact superposition 


In order to assess the effect that various subsequent periods of contacts had on lin- 
guistic structure, it is instructive to consider domains with a multiplicity of inter- 
fering structural relationships. 

Lexical phenomena or grammatical phenomena with many lexical idiosyncrasies 
are often good indicators of areality because they can less easily be accounted 
for by universal grammaticalization paths. Such a favourable domain are pluralia 
tantum (collective nouns with plural markers), which have a high frequency es- 
pecially in Baltic, but also in other conservative Indo-European languages, and are 
an innovation in Uralic, with high frequencies in some Finnic languages (KTW 
629—637). Figure 1 is a neighbor net (Huson and Bryant 2006) of the occurrence of 
pluralia tantum in selected European languages in a sample of thirty concepts 
based on Baltic languages (KTW 734, note 13). It illustrates the similarity of plu- 
ralia tantum in Europe to Baltic languages. Similarity and distance are represented 
in neighbor nets as the shortest distance in terms of connecting lines between any 
pair of elements. The neighbor net shows that Livonian and Latvian Romani (both 
spoken in Latvia) are most similar to the Baltic languages, but there are no particu- 
larly close relationships between Livonian and Latvian Romani. Latvian Romani is 
rather more similar to Slavic languages. 

Characteristic for CB languages are verb particles and/or verb prefixes for 
determining activity or manner expressed in the verb stem (consider Livonian jard 
*away' with perfective function in example 13 below). Such particles are not com- 
mon in Uralic in general, but occur in Finnic, Sami, and Ugric (Hungarian, Mansi, 


Romanian 


German 
Modern Greek Romani 


yydtan 
* , OkiGreok 


Latvian Livonian 


Lithuanian acn 


= Spanish 
Latvian Romani ———— Serbo-Croatian = 
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Czech A Poli Y 

Belarusian ` ' m 
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Figure 1. Similarity of pluralia tantum in selected European languages to pluralia tantum 
in Baltic languages 
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Khanty). Path in motion verbs is mostly expressed by verb stems in Uralic lan- 
guages, such as Mordvin sovams ‘enter’, l’isems ‘exit’, but not in Finnic, where we 
have (Finnish) mennä (lähteä) sisään, ulos ‘go (depart) in, out’ instead. Some 
Finnic verb particles derive from stems which are Germanic or Baltic loanwords. 
Verb particles later derived from such stems have then been extended in use due to 
Baltic and Germanic influence especially in Southern Finnic. This again inspired 
Latvian and the northernmost Lithuanian dialect to develop verb particles and 
to use them more and more often together with, or instead of, prefixes. Latvian, 
Livonian, and Estonian verb particles often have very similar (but not identical) 
ranges of use and derive from semantically similar sources (Latvian vaļā ‘open, 
untied, loose’, locative of vala ‘freedom, leisure, permission, will’ corresponds to 
Livonian vdldin, originally instructive case of valda ‘power, free will, permission’, 
a Germanic loanword in common Finnic) testifying to their parallel development. 
Finally, some Finnic varieties with most intensive contact with Latvian (Livonian 
and the Leivu enclave, a Southern Estonian dialect in Livonia) have borrowed Lat- 
vian prefixes independently of each other (Wälchli 2001). 

Another domain reflecting multiple non-overlapping contacts are yes/no-ques- 
tions (KTW 712-714). 


3. Buffer zone features between SAE and Eurasia 


The following features exemplify the buffer zone character of the CB-area between 
SAE and Central Eurasia. 

(a) Word order (see also KTW 704—712): The Central Eurasian prototype (“Al- 
taic”, South Asia) has rigid subject-object-verb (SOV), rigid subject-local phrase- 
verb (SLV), and genitive-noun (GN) order. Characteristic for West Europe are, 
however, SVO/SVL (Celtic even VSO) and dominant NG order. The order proper- 
ties of western languages seem to be an innovation; for both Uralic and Indo-Euro- 
pean SOV is reconstructed. Between the West European and Central Eurasian 
poles, many CB-languages are characterized by different kinds of flexible consti- 
tuent order in the clause, a property shared with Hungarian and some languages of 
the Caucausus (Georgian, Ossetic). 

Itis well known that order properties strongly correlate, but they do not corre- 
late strictly, especially in buffer zones. Characteristic for a large part of the CB area 
is the combination of dominant SVO with GN order (Finnic, Latvian, Standard Li- 
thuanian, Danish, Swedish). Interestingly, rigid GN order in Continental Scandi- 
navian is an innovation that seems to have started in Danish and spread to Swedish 
around 1250-1350 and is not characteristic of contact varieties with Finnic sub- or 
adstrate (KTW 709). Consistent with the “earlier” shift of SOV > SVO relative to 
GN > NG is a certain preference for object-verb order in non-finite constructions 
(infinitives, converbs) in the whole area. (Non-finite constructions tend to share 
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some properties with noun phrases.) A strong grammaticalization of a particular 
kind of this tendency is the strict OV order in subordinate clauses in German. 

A very clear case of contact-induced change in constituent order is Karaim 
which, unlike other Turkic languages with rigid SOV/SLV, GN, and AdjN order, 
has SVO, NG and NAdj order, as in (1). N Adj order is probably due to Polish in- 
fluence. Slavic languages have dominant NG order and Polish has dominant N Adj 
order. 


(1) Karaim (Kowalski 1929: 112) (from a stage play) (SVL, N Adj) 


Salom alexem, k’ork’lu-C’a-m m'enim. bu-mo dz'ed-lar 
peace with.you beautiful-F-PosslscG my DEM-Q parent.PL 
jox juv’-d’a. 

NEG.EX house-Loc 


‘Hello, my dear, are the parents not at home?’ 


However, the explanation of constituent order in Karaim is not as simple as that. 
Karaims belong to the Karaite sect of Judaism and their written religious prestige 
language is Hebrew. Crimean Karaites were brought to Trakai by the Lithuanian 
Grand Duke Vytautas in 1397/8. They were an elite and had little contact with Li- 
thuanian spoken by peasants. Unlike other people in the area, Karaims have little 
oral tradition but a large body of literature, mainly consisting of translations from 
Hebrew, where Karaim strictly preserves Hebrew constituent order. There is thus a 
convergence of the influence of Polish and East Slavic varieties in spoken language 
and Hebrew in written language (Kowalski 1929: xxxix). Further obvious contact 
features in (1) are the question particle as second position clitic (rather than 
attached to the verb, as characteristic of Turkic), the feminine derivational suffix 
of Slavic origin, and the Polish loanword for ‘parents’ instead of a co-compound 
"father-mother". 

Other varieties in the CB area are strongly influenced by German word order. 
Standard Estonian word order is similar to German and exhibits certain features 
of V2 and verb-final order in subordinate clauses. Example (2) is from Nehrungs- 
kurisch. 


(2) Nehrungskurisch (Pietsch 1982: 98) 


Is rud-in-es ferie-és dar-ij t-iems 

in autumn-ADJ,,-LOC.PL holiday,,.Loc:PL make-pst3 that.DAT.PL 

vaik-ams t-as varn-es raut-um-s dauge 
child, ,,-DAT.PL that-NOM crow-NOM:PL snatch-NOML-NOM:sG much, 


dzaug-um-s 

rejoice, „-NOML-NOM!SG 

‘In autumn holidays the catching of crows gave much pleasure to children.’ 
Nehrungskurisch originates from Tamian Latvian dialects but was spoken by 


fishermen on the spit between Klaipeda and Prussia. The syntax at least of the idio- 
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lect of Pietsch (the texts were written down in the German exile long after World 
War II) is mostly German and many elements (also some morphological ones) are 
Lithuanian. In (2) the word order is German and there are definite articles as in 
German (used in slightly fewer contexts). The -ms forms in the dative plural are 
a relic in Latvian, restricted to Tamian dialects of Curonia, but well in line with the 
Lithuanian form of the dative plural. The subscripts in the glosses mark elements 
of Lithuanian and German origin. 

(b) Head and dependent marking: Both case and person marking are highly re- 
stricted in SAE languages when compared to Central Eurasia. The CB languages 
reflect this field of tension. However, some Finnic languages have a large number 
of cases (14 in Estonian and up to 24 in Veps, where the number is a matter of 
debate). An example for the reduction of cases is the use of the preposition iz with 
the genitive in Nehrungskurisch (1) instead of the Latvian locative. Person mark- 
ing on verbs is completely lost in Modern Swedish and Danish and in some Tamian 
dialects of Latvian, but many other CB varieties reflect restricted person marking. 
In Võru Southern Estonian there is a syncretism of 1sc and 1PL and of 2sc and 2PL, 
while 3sc and 3pL remain distinct, and in Livonian there is a complete syncretism 
of 1sc and 3sc. A general feature of Baltic is the non-distinctiveness of all 3rd per- 
son forms, paralleled in many Finnish dialects (but obviously not due to language 
contact). A relevant example is Stadin Slangi Finnish in (3), a mixed language with 
(Colloquial) Finnish grammar and a core vocabulary from Swedish, Swedish dia- 
lects, Russian, and other sources. 


(3) Stadin Slangi Finnish (Garam 2003: 26) 
Mesta-t mis kundi-t tsuppail-i ol-i 
place, „PL where.ıness man/guy-PL walk-Psr3(sc) be-PsT3(sG) 
tuttu-i. 
known-PTCP. PART. PL 
“The places where the men walked were known.’ 


Wiemer (2004: 504) adduces evidence for Lithuanian influence on the weakening 
of the 3sc/PL distinction in some Belorussian varieties. Characteristic for Uralic 
languages are possessive suffixes, which are lost in Southern Finnic (Estonian, 
Livonian, Votian), except in epic songs in Votian and some varieties of Estonian, 
where they are retained almost exclusively in formulaic vocativic invocations. 
Possessive suffixes are also rare in Veps. 

(c) Definiteness: Definite articles occur in most European languages except in 
the east and northeast. In Indo-European languages definite articles are an inno- 
vation which spread from South East Europe and there is a frequency cline Greek > 
Balkans > Italian > other Romance > Germanic/Celtic. Some few Slavic and Baltic 
language varieties have acquired definite articles (e.g. Bulgarian, some varieties of 
Sorbian). Early writings in Latvian written by German priests exhibit an article- 
like use of demonstratives and Nehrungskurisch has definite articles — at least in 


The Circum-Baltic languages 331 


the idolect of Pietsch (example 2). There are, however, several exceptions to the 
general definiteness cline across Europe. Northern Swedish dialects exhibit a much 
more extensive use of definite articles than other Swedish dialects and Standard 
Swedish, extending definite article usage even to indefinite contexts, but more ea- 
sily in single nouns than in complex noun phrases, as in (4). 


(4) Northern Swedish, Edsele (Bergvall 1991: 113) 
Ja ha gull-é 2 ddél ë stena 
I have:prs gold-DEF:N and dear INDF:PL stone:DEF/INDF 
‘I have gold and juwels.’ 


Uralic does not fully fit into the European definiteness cline. It is true that Hungar- 
ian, one of the two clear cases of article languages, is in contact with other article 
languages, but this does not apply to Mordvin, which has a complex definite de- 
clension having evolved from postposed demonstratives. A certain tendency to de- 
velop articles can be observed in Veps: the non-inflecting postposed particles se 
(sa) and ne (PL) of demonstrative origin (reminiscent of some Mordvin forms) 
have a very similar range of use as the postposed particle to in Russian dialects. 
However, the Veps particles are far from being simple definiteness markers and 
often indicate contrast (see example 5) in a similar way as the possessive suffixes 
of the 3rd and 2nd person in Mari, Komi, and Udmurt, which can also be attached 
to a large range of forms including personal pronouns: 


(5) (Southern) Veps (Sidorovo) (Zajceva and Mullonen 1969: 209) 


priha-d=ne edo ajo-i-ba, mä=se  jülghe 
young.man-PL-DEF:PL in.front drive-pstT-3pL, I=DEF:sG behind 
ajo-i-n 


drive-PsT-lsc 
“The boys went ahead (by vehicle), I followed behind.’ 


Another function of the Veps particles is to form indefinite pronouns, just like Rus- 
sian fo (Veps kuna-se, Russian kuda-to ‘somewhere [to]"). 

The development of definite articles in modern spoken Finnish is a matter of de- 
bate (Juvonen 2000). An ancient parallel is the development of definite forms of ad- 
jectives in Slavic, Baltic, and Germanic (with different formal origins in Germanic 
and in Slavic/Baltic and with different ranges of use in the different languages). 

(d) Pro-drop and expletive subjects: A characteristic and typologically rare fea- 
ture of SAE, more precisely of French and most Germanic languages is the obliga- 
tory explicit subject (“non pro-drop") even when semantically empty (so-called 
expletive subjects, as in English it rains, French il pleut, Swedish det regnar, but 
also Kashubian to padw). Expletive meteorological subjects are gradually expan- 
ding in some varieties of Latvian, interestingly enough however, without turning 
Latvian into a non pro-drop language. In colloquial Latvian, the pronoun viņš 
‘he’ is frequent but not oligatory with meteorological predicates (vins list ‘it rains’, 
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ungrammatical in Standard Latvian, where the pronoun is absent līst ‘[it] rains’). 
The expletive meteorological subject also occurs in some Tamian dialects that no 
longer have person marking but are not non pro-drop. Example (6) from Dundaga 
shows that the degree of emptiness of meteorological subjects can be gradual. The 
context is that people implored the vej-mat “Wind-Mother” and the dievini “gods 
(diminutive)' to stop the rain harmful to the hay harvest. Coming home, a woman 
is shouting what is quoted in (6), upset that viņs “it (?) /he (?) /she (?)" (there is no 
gender agreement outside of the noun phrase in this dialect) did not listen and just 
continued raining. 


(6) Tamian Latvian (Dundaga) (Draviniece 1995) 


Plav-e es iz-dzied-es un  iz-souc-es tik un tik, 
meadow-Loc I out-sing-PST:REFL and out-sing-PST:REFL only and only, 
bet vo vis klous! Vis tik | list! 


but o he listen:prs he only rain:prs 
*On the field Isung and implored as much as I could, but does it listen! It only 
rains!’ 


Further domains showing the tension between SAE and Central Eurasia in the 
CB area include predicative possession (KTW 675—679; Stern, this volume), 
comparative constructions (KPT 682-685, Wiemer 2004: 505—506), converbs, and 
relative clauses. 


4. Spreading “Uralic” and “Indo-European” properties 


Some diffusing properties in CB languages can be attributed to either Indo-Euro- 
pean or Uralic origin. Stress retraction to the initial syllable in Latvian is attributed 
to Finnic influence (but see KTW 638-640), Finnic has extended the use of nom- 
inal plural including pluralia tantum (see section 2 above), and Tamian Latvian 
dialects gradually lose gender due to Finnic influence (KTW 694—698). Another 
relevant example is the reflexive/middle voice discussed below. 

Indo-European languages are known for their detransitivizing tendencies in the 
verb. The traditional device for deriving intransitives from transitive, the middle 
voice, is lost in Baltic, Slavic, and the modern Germanic languages, after having 
left many traces in the active forms. It has been replaced by various middle strat- 
egies grammaticalized from reflexive constructions, with different systems in dif- 
ferent languages (see also Holvoet, this volume). Various kinds of interference 
can be observed especially in the contact situations between Lithuanian and Slavic. 
On the one hand, some Lithuanian dialects in contact with Belorussian extend the 
reflexive to the function of a passive on the basis of the Slavic model; on the other 
hand, some Belorussian and Polish varieties apply their reflexive markers in bene- 
factive function on the basis of the Lithuanian model (Wiemer 2004: 501—504). 
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Uralic languages, however, are rather transitivizing. Yet Finnic also had middle 
forms which happen to be lost in most modern Finnic languages except for two in- 
dependent "Southern" varieties, Veps and Southern Estonian. Veps exhibits certain 
parallels with Russian in the use of the middle, which are reinforced by detransi- 
tivized loanwords such as sogaas-ilíezoi ‘agree-PST3PL:REFL = they agreed’. In (7) 
there is a reflexive form with reciprocal function. Note also the Russian loanword 
oxota 'desire' (with a substitution of x by f), testifying to the many traces of contact 
in the realm of modality. 


(7) Veps (Zajceva and Mullonen 1969: 225) 
melhist -i-mois, mehe-ne affat män-da, a hä-nne 
please-Psr-lPL:REFL man-ADESs desire go-INF and:ADVERS he-ADESS 
affat nai-da. 
desire marry(woman)-INF 
“We liked each other, (I) wanted to marry and he wanted to marry.’ 


The Veps middle also extends to some functions of the Russian middle in passive 
function, such as kucu-se 'call-PRs3sG.REFL = is called’. 


5. Selected particular properties of CB languages within Europe 


5.1. Polytonicity 


Many CB languages have tonal oppositions. They can roughly be classified into 
three types: (a) tones on long syllable nuclei (especially diphthongs) in Baltic lan- 
guages (see also Holvoet, this volume; genealogically related to Serbian-Croatian 
and Slovene and Greek polytonicity), (b) overlength as a concomitant of reduction 
of non-initial syllables due to initial stress 1n Estonian, Livonian, and Tamian Lat- 
vian, and (c) phonologization of secondary stress associated with the original 
number of syllables of a word in most dialects of Norwegian, Swedish and Danish, 
but not in the Swedish dialects in contact with Finnish and Estonian. All three 
types are basically simple tone systems (i.e. with only one opposition). However, 
three tones exist in some dialects of Latvian and in Livonian, where types (a) and 
(b) meet and overlap. 

In Tamian Latvian dialects there is an overlap of (a) and (b). In the dialect of Dun- 
daga there are two tones on long syllable nuclei, broken (similar to the Danish stød, 
äbel) and drawn (bert). An additional tonal opposition is due to compensatory leng- 
thening of lost and reduced syllables and is realized in different ways: with short syl- 
lable nuclei as lengthening of unvoiced following consonants (pups), short nuclei 
with following voiced consonants as lengthening of the vowel realized as rising- 
falling or falling tone (rü5), with long syllable nucleus and broken tone as an over- 
long vowel or diphthong with broken-falling intonation (mié:Z), and with drawn 
tone it is neutralized (but some other dialects have overlong drawn-falling tone). 
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(8) Tamian Latvian, Curonia, Dundaga (Rudzite 2005: 212) 


tad siens ir koüz-es, labeb sa-3i:vt-s, rüs, miĝ:ž i 
then hay is heap-Loc:PL, corn together-live-PTCP, rye:PL, oats:PL is/also 
sa-bér-t apcirkn-es, un  uá:g-s un abel, pup-s... 


together-pour-PTCP chest-Loc:PL and berry-PL and apple:PL, bean-PL 
*then the hay is on stacks, the corn harvested, the rye and oats poured in 
chests, and berries and apples, beans ...’ 


Three tones exist also in some Central Latvian dialects, broken ^, drawn ^, and fal- 
ling ^. This is considered to be the original Latvian system, having developed from 
a two tone system (such as Lithuanian) under conditions of stress retraction to the 
first syllable. 

In High Lithuanian (Aukstaitian) every long syllable core has a tone but only 
those in stressed syllables are realized. In Low Lithuanian (Zemaitian), however, 
with partial stress retraction several syllables in a word can be stressed and carry 
tones. All vowels with superscript diacritics in (9) carry stress and tone. 


(9) Northwestern Zemaitian Lithuanian, Rietavas (Bacevičiūtė et al. 2004: 269) 


... mata: aram-tios maza bo:u™, dù aktàr-á:, 
see:PRS2SG arable-GEN little be:PsT3 two  hectare-PL 
nù  l'üb-à:m bó-Ib-ü: pa-si-sé:-s-mu. 


PART HABIT-PRSÍPL potato-GEN:PL PREV-REFL-SOW-FUT- 1PL 
*well there was little arable land, two hectares, well we used to plant potatoes." 


There are many further similarities and differences of tone systems: broken inton- 
ation (stød) occurs in all three types of tone systems in Danish, Livonian, most Lat- 
vian dialects and in Zemaitian, but has different origins. The functional load of 
tonal oppositions is generally low when compared to tonal languages of Central 
Africa, South East Asia, and Mesoamerica. 


5.2. Partitive and distributed head properties 


The semantically and syntactically determined alternation in the case marking of 
subjects, objects, and non-verbal predicates is one of the more widely discussed 
feature complexes in Finnic, Baltic, and Slavic (KTW 646—671). Major issues are 
the use of a partitive case or a case with partitive function (genitive) as subject or 
object (total vs. partial object), and the nominative object in Finnic, Russian dia- 
lects and Baltic (see Stern, this volume), triggered by different but similar factors 
in different languages, which lead to complex alternations in object marking. Par- 
titive genitive objects and nominative objects are attested in other Indo-European 
languages, too, but they are most characteristic in Baltic and Northern Slavic. In 
Finnic the extended use of the partitive (originally an ablative case) is a clear in- 
novation with marginal parallels in Mordvin. Not separable from the partial object 
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and subjects is the use of partitive or genitive case in numeral constructions, which 
are notorious for their complex case marking in Slavic, Baltic and Finnic (KTW 
698—704). This goes together with phenomena of distributed head properties that 
can frequently be observed in Finnic, Baltic, and Slavic languages, such as in (10) 
from Estonian: 


(10) Estonian (L. Vaba, p.c.) 
.. kõik-i | nende kolme hobuse-ga. 
all-GEN:PL that:GEN:PL three:GEN horse-coM:sG 
* ... With all those three horses’ 


The numeral kolm ‘three’ “governs” the partitive singular of the following noun 
(kolm hobus-t ‘three horse-PART.sG’) but not in oblique cases, where case is marked 
on both “head” noun and agreeing numeral (kolme hobuse 'three:GEN horse:GEN’). 
The singular number triggered by the numeral is retained but has no influence on 
attributes of the noun phrase to the left of the numeral, which show semantic rather 
than grammatical number, but all agree with the noun in case. A further compli- 
cation arises because the case in (10) is comitative, one of four Estonian cases 
marked only on the phrase-final “head” noun, requiring at the same time that all at- 
tributes have genitive case. The result is that all four words in (10) have some head 
properties: The oblique case is marked on the noun and determines case marking in 
the rest of the noun phrase; the numeral causes the noun to be singular; and both the 
quantifier and the demonstrative mark the semantic plural number of the phrase. 


5.3. Logophoric pronouns 


Finnish dialects (11) and High Latvian dialects (12) have logophoric pronouns 
(Nau 2006), a feature which is better known from African languages. Logophoric 
pronouns refer to the person whose words are expressed. 


(11) Finnish dialects (Nau 2006: 55-56) 
Se, sano, ett-ei hün;/se; voi tul-la 


3sG say[PRs3] that-NEG3 LoG/3sG can[PRs3] come-INF 
*(S)he; says that (s)he,/(s)he; won't be able to come’ 


The use of logophoric pronouns is associated with the lack of a clear distinction be- 
tween direct and indirect speech as in (12) from the Viläni High Latvian dialect: 


(12) High Latvian (Nau 2008) 
Jej sok-a taj iz ju: “Du-tu szej Heu as-t’...” 
she say-prs3 thus to he:acc give-cOND this:NOM:SG:F yOU:DAT eat-INF 
‘She says to him: ‘I [literally: this one] would give you to eat ...” 


Pronouns are very instructive because of the parallel differences between the stan- 
dard varieties and dialects in Finnish and Latvian. Standard Finnish and Standard 
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Latvian have a strict animacy distinction in pronouns (Finnish hän ‘(s)he’, se ‘it’, 
Latvian vins ‘he’, tas ‘it’). However, there are hardly any dialects of Finnish or Lat- 
vian making an animacy distinction. This is an invention of normative grammar- 
ians, independently imposed on Standard Finnish and Standard Latvian. Finnish 
hdn is used among other things as a logophoric marker in all dialects of Finnish 
(but is more grammaticalized in eastern varieties). In High Latvian (Latgalian) the 
demonstrative for speaker deixis Sys ‘this’ is used as a logophoric marker, while lo- 
gophoric pronouns are not reported for the central and Tamian dialects, which were 
in much stronger contact with Finnic. Neither are logophoric pronouns reported for 
Estonian and Livonian. This leaves us with a straightforward case of a typologi- 
cally highly marked but areally completely disconnected parallel whose origins are 
not clear. 


5.4. Evidentiality 


Estonian, Livonian, Latvian, and some Lithuanian dialects have an evidential 
mood (“modus relativus/obliquus") with past evidential forms being supplied by 
past participles (perfect without copula) and present evidential forms by a large 
range of non-finite verb forms including present participles in various case forms, 
infinitives, and nomina agentis. An evidential mood is not attested in Northern 
Finnic, Sami, and Mordvin but occurs in other Uralic languages further away, such 
as Mari. Evidentials in CB languages cannot be described appropriately without 
consideration of the register in which they occur, and the use and forms of evident- 
ials in standard languages strongly deviate from the dialects. An obvious example 
is the present evidential in Estonian, which derives from the Southern Estonian 
form (Tartu region, present participle in the partitive singular form -var), even 
though Northern Estonian is the basis for Standard Estonian. Kask (1984: 244—251) 
shows how this has come about in a complex history. In Livonian, Estonian, and 
Low Latvian dialects narrative and indirect reported evidence functions are domi- 
nant while the inferring and the admirative functions are more characteristic of the 
Lithuanian dialects with evidential forms. Nau (2008) argues that non-finite predi- 
cates in the High Latvian dialect of Viläni are not instances of evidential mood, as 
elsewhere in the languages of the region, but a voicing device, a technique to con- 
struct dialogues (together with the logophoric pronouns discussed above). 


5.5. No or only weakly grammaticalized future tense 


North European languages are known for their lack of future tense and for verbs of 
becoming used in a similar way as future copulas (Dahl 2000). Lack of a future 
tense has to be assumed for common Finnic, Germanic, and Slavic. Future marking 
is largely absent in Finnic (most markedly in Modern Estonian). Germanic lan- 
guages, except for English, have weakly grammaticalized periphrastic future con- 


The Circum-Baltic languages 337 


structions. In Russian the present tense of the perfective is used as a future and 
there is a periphrastic construction for the imperfective aspect. However, an old fu- 
ture tense derived from the Indo-European desiderative is heavily entrenched in all 
varieties of Baltic. The presence of a future tense in Latvian and the absence of a 
future tense of activity verbs in all Finnic contact varieties is one of the clearest 
structural differences. However, several varieties of Finnic have future marking 
restricted to the verb *be', orginating from a suppletive stem which is used for the 
potentialis mood in Finnish (lienee ‘might be’). (The potentialis is lost in most var- 
ieties of Finnic.) Such a future copula occurs in Veps and in (Curonian) Livonian 
and to a lesser extent in some insular dialects of Estonian adjacent to Livonian 
(Kihnu and Southern Saaremaa) but also in Old Estonian, where it is attributed to 
German influence (Ariste 1954: 287). These contact varieties of Finnic at the 
southern border are better examples for a future restricted to *be' than the parallel 
case of Hungarian discussed by Dahl (2000). In Livonian there is even a future per- 
fect formed with IT- as an auxiliary: 


(13) Curonian Livonian (Setälä 1953: 107) 


ku sind li-d sie kure jara tapp-on, siz sind 
when you FUT.AUX-2sG that:acc devil away kill-prcp then you 
rotk min jur tegiz! 


hurry:ımp2sG LGEN to again 
“When you will have killed this devil, then hurry back to me!’ 


What is remarkable in this case is that Livonian has been inspired to develop a 
more marked form of the future, the future perfect, but not the simple future (ex- 
cept for the copula). The Curonian Livonian lT- with infinitive (and optional dative 
agent) is further used for deontic necessitive modality in the same way as bis 
‘be:FUT3 ’ in Old Latvian, a function which is interestingly also attested in Veps. 
Salis-Livonian had a periphrastic future with infinitive and the same verb Ir- as 


6. Contact phenomena leading to less structural similarity 
and substrate effects 


Contact-induced change does not always result in greater structural similarity with 
the contact language. Independently, Curonian Livonian and Leivu, a Southern Es- 
tonian enclave in the northern Livonian part of Latvian without particular genea- 
logical relationship to Livonian within Finnic, have diphthongized long 6 and e 
and, what is more remarkable, short o and e (Livonian kiera ‘ball [of wool]’, k“of 
‘sack’, Estonian kera, kott). This is obviously due to the lack of a long and short o 
in Latvian, with an extension in the system also to front medial vowels. That this 
must be due to language contact can only be demonstrated because there is an in- 
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dependent parallel development in Livonian and Leivu. Latvian has no short 
diphthtongs, except in a very narrow area around Cape Kolka in immediate neigh- 
borhood to Livonian in Dundaga and Ance. There the diphthongs ie and uo in non- 
initial (unstressed) syllables are not reduced to e and a as elsewhere in Tamian Lat- 
vian but to short diphthongs ‘e and "o, which is how the change induced by Latvian 
in Livonian returns to Latvian in another form (and adds structural complexity). 
For further parallel sound changes involving Livonian see Ariste (1954). 

Other “covert” cases of contact-induced change are due to substrate effects of 
extinct varieties. There are many structural parallels between language varieties 
in Curonia which are not attested in the varieties of the same languages outside of 
Curonia (Curonian Livonian vs. Salis-Livonian, Curonian Latvian dialects vs. the 
rest, Western Lithuanian dialects vs. Eastern), at least partly due to the substrate 
of an extinct Baltic language called Curonian, which was probably West Baltic 
(like Old Prussian). An example is the use of the verb ‘live’ with the meaning 
‘work, harvest’; consider Livonian je ’lö ku’j-ainö ‘live:INF dry-hay:PART > harvest 
hay’ and Dundaga Latvian 3i:v-t soüs-sien ‘live:inF dry-hay:Acc > harvest hay’ 
(Draviniece 1995: 93, see also example 8 above). The parallel in the compound 
“dry-hay” is due to the polysemy of Livonian dina “grass; hay’, which is not par- 
alleled in Latvian, but obviously taken over from Livonian in Dundaga Latvian (but 
not elsewhere in Latvia). Another Curonism is the lack of a comparative form of 
adjectives and the use of a preposed particle juo instead. This is a Latvian form (juo 
"the more") which is borrowed by Livonian (jo/juo kougin ‘comer far > farther’). 


7. Conclusions 


The traditional approach to linguistic areas is to survey few selected features in few 
major languages (often standard varieties). Such an approach is hardly promising. 
Even in an area with few languages, such as the CB area, there are hundreds of lan- 
guage varieties whose contacts deserve to be studied, and there are hundreds 
of promising features to be considered if the lexicon and phonology are to be given 
due attention, too (see also Stern, this volume, for additional features). Itis particu- 
larly dangerous to focus on standard languages only since these often deviate in 
systematic ways from all non-standard varieties. While it is impossible to give 
a full survey of the CB area here, the aim of this article has been to draw attention 
to some minor languages and non-standard varieties and to some less prominent 
potential contact features. These serve to illustrate that, on the one hand, traces of 
language contacts are omnipresent but that, on the other hand, every contact var- 
iety, as long as it is not extinct, retains its own system and deviates in many ways 
from a simplistic code copying model in unpredictable ways. Therefore, exploring 
language contacts in the CB area cannot restrict itself to simply arguing in favour 
or against contact. Rather, the major aim should be to document the wide range of 
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varieties in the context of the structural diversity of the whole area and in connec- 
tion with the macro-area Eurasian, which the CB region is only a small part of. 


See also the following chapters in this volume: 1 by Holvoet, 8 by Laakso, 19 by 
Óstman and 20 by Stern. 


Notes 


1 While writing this paper I was supported by the Swiss National Science foundation 
(PP001-114840). Abbreviations not contained in the Leipzig Glossing Rules: ADEss 
adessive, ADVERSATIVE advers, COMPR comparative, EX existential, HABIT habitual, INEsS 
inessive, PART partitive, PREV preverb. 
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4. Conclusions 

1. Language contact without linguistic areas: the Mediterranean case 


A recent trend in areal typology (represented in works by Dahl 2001, Stolz 2006, 
and Campbell 2006 among others) tends to consider linguistic areas as largely ar- 
tificial linguistic constructs. A linguistic area is the simple accumulation “of indi- 
vidual cases of ‘localized diffusion’” (Campbell 2006: 18) that have taken place 
in a given area: many individual instances of borrowing and diffusion may create 
the impression of an “overall macro-contact among the languages, which has not 
necessarily been there” (Koptjevskaja Tamm 2002: 219). The mechanisms at play 
in these cases do not differ in any significant way from those involved in other in- 
stances of language contact and diffusion. 

All the areal-typological surveys aimed at assessing the impact of language 
contact across the Mediterranean (see, e.g., the papers collected in Ramat and Stolz 
2002) agree to conclude that we cannot think of the Mediterranean sea as a lin- 
guistic area in the same sense as we can speak of a Balkan Sprachbund or a Stan- 
dard Average European (SAE) area. While there are several contact phenomena at 
the micro-level, involving only two or three languages at a time, very few (if any) 
linguistic traits qualify as truly Mediterranean while being, at the same time, typo- 
logically not commonplace or, at least, rare or absent in the neighbouring non- 
Mediterranean languages. 

As Ramat (2002: xiv) correctly observes, “this is certainly not surprising: a 
strong linguistic integration presupposes a high number of bilingual or even multi- 
lingual speakers, as was the case in Anglo-Norman England. After the fall of the 
Roman Empire, such a situation never occurred on a comprehensive Mediterra- 
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nean scale". Contacts in the Mediterranean are, instead, those typically triggered 
by movements across the sea, which differ from those on land in that they normally 
involve a limited number of people, with fewer chances that “the newcomers will 
“sweep” through the area in the way characteristic of spread zones" (Koptjevskaja 
Tamm 2002: 218). Even language contacts involving larger communities, as in the 
case of the Arabic dominations of Spain and Sicily, have not resulted in heavy 
structural borrowing, possibly owing to the progressive marginalization of the 
Arabic-speaking population following the Islamic decline. 

Yet, under the view mentioned above, the Mediterranean sea is an interesting 
object of investigation for areal linguistics, as languages spoken in this area display 
a sum of many borrowing phenomena “of different sorts, for different reasons, in 
different settings and at different times" (Campbell 2006: 13). The current linguistic 
landscape of the Mediterranean has been intensively shaped by language contacts, 
migrations, and colonizations so that it is not inappropriate to think of the Mediter- 
ranean as made up of a number of contact zones at the outskirts of the SAE area. 

In what follows, I shall first (section 2) discuss the historical scenarios lying 
behind these borrowing phenomena across the Mediterranean, ranging from pro- 
cesses of irradiation starting from Late Latin and Koiné Greek (2.1—2.2) to lan- 
guage contacts through the Middle Ages. As regards the latter, the role of Arabic 
(2.3) and the so-called Lingua Franca (2.4) in the Middle Ages will be discussed in 
turn, and special attention will be paid to contact phenomena in “insular” Mediter- 
ranean languages such as Maltese and Pantesco (2.5). In section 3, I will focus on 
two instructive case studies in which language contact has possibly fostered the 
spread of grammatical features across the Mediterranean area: the development of 
definite articles (3.1), and the polysemy patterns of augmentative suffixes (3.2). 
Finally, I shall provide an overall evaluation of language contact in the Mediterra- 
nean in section 4. 


2. Language dynamics and language contact in the Mediterranean: 
historical scenarios 


Apart from prehistoric contacts (see the chapters by Mailhammer and Ramat in this 
volume), the linguistic landscape of today's Mediterranean appears to be the result 
of contacts that have their ultimate origins in Late Antiquity and the Middle Ages. 
Arbitrary as any periodization can be, the following stages in the history of lan- 
guage contact in the Mediterranean region may be recognized: 


(a) From Alexander the Great to the fall of the Roman Empire (4th century BC — 
4th century AD). 


i. The empire of Alexander the Great, possibly the first “trans-national” 
political entity in the history of the Mediterranean, favoured the spread of 
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the Greek language throughout the eastern Mediterranean. The major cities 
of this area at that time became part of Alexander’s empire and were active 
centers in the transmission and elaboration of Greek ideas and culture. Al- 
though this empire quickly disintegrated, Greek remained stable as the lan- 
guage of culture, administration and communication in the eastern part of 
the Mediterranean for centuries. 


. As the Roman Empire emerged as a new political power, inheriting the 


model of a trans-national imperium centred on the Mediterranean sea, Latin 
became the communication language in the western Mediterranean. The 
cultural unity of the Mediterranean was further strengthened by the emerg- 
ence and spread of Christianity, which was imposed as the official faith of 
the Empire and used both Greek (in the east) and Latin (in the west) as com- 
munication languages. 


(b) Movements of peoples from the Early Middle Ages to the beginning of the first 
millennium AD (4th—11th centuries AD). 


1. 


The east vs. west linguistic divide in the Mediterranean remained substan- 
tially unchanged until the Arab conquest. The tribes (Vandals, Sueves, 
Alans, etc.) that settled in the western part of the empire (Gaul, Iberia, and 
the Maghreb; 4th—5th centuries) were soon assimilated into the local cul- 
ture, and never seriously threatened the dominance of Latin as a communi- 
cation language in the west. On the other hand, the Byzantine Empire per- 
petuated the prestige of the Greek language in the east. Although the 
western islands (Corsica, Sardinia, Sicily and the smaller islands, including 
Malta and Pantelleria) were under the Byzantine rule, this did not imply the 
(re)introduction of Greek: immigration of Greek speakers was very low, 
and must have been balanced at any rate by incoming Latin-speaking im- 
migrants from Christian Africa, following the persecutions by Vandals and 
Arabs from the 5th to the 7th century (Brincat 2002: 70—71). 


. Starting from the 7th century, the Arab armies swept through most of the 


Middle East and Northern Africa (7th century), invaded Pantelleria (700), 
Visigothic Spain (711), Sicily (827) and Malta (870). At its greatest extent 
(8th-9th centuries), the Arab Empire controlled 3/4 of the Mediterranean 
region. The Arabs imported a rich intellectual tradition from the Middle 
East and North Africa. Muslim and Jewish scholars played a great part 
in reviving and expanding classical Greek learning in Western Europe. The 
Romanized cultures of Sicily and the Iberian peninsula interacted with 
Muslim and Jewish cultures in complex ways, thus giving these regions 
a distinctive cultural profile. Andalusian Arabic, the variety of the Arabic 
language spoken in Spain, had strong lexical influence on Mozarabic, the 
continuum of closely related Romance dialects spoken in Muslim-domi- 
nated areas of the Iberian Peninsula. A large part of Sicily is described as 


344 


lii. 


e 


Andrea Sansó 


Arabic-speaking even after 1250 (death of the Emperor Friedrich the 2nd). 
Malta was attacked by the Arabs and left in ruins (870) and was later re- 
populated by an Arabic-speaking community from Sicily, which settled on 
the island in 1048-1049. 

The Seljuk Turks advanced from their homelands near the Aral Sea first 
into Khorasan and then into mainland Persia before eventually conquering 
eastern Anatolia in 1068. 


(c) Later contacts (11th—15th centuries and beyond). 


1. 


li. 


lii. 


iv. 


The Normans, with the assistance of the Italian city-states of Genoa and 
Pisa, took Sicily from the Arabs (1068-1091), and at the same time the 
Spanish Christian kingdoms progressively expanded at the expenses of the 
Muslim domination in the Iberian peninsula (the so-called Reconquista, 
which came to an end with the conquest of Granada in 1492). By the end of 
this period, the linguistic landscape of the Mediterranean Sea was substan- 
tially similar to the present-day distribution of languages in this area. 

The Crusades (11th-13th centuries), the main historical event connecting 
and opposing the eastern and the western part of the Mediterranean, had im- 
portant consequences in terms of language contact. The need to transport 
and supply large armies led to a renaissance of trade throughout the Medi- 
terranean. More or less ephemeral “Crusader states" such as the Kingdom 
of Jerusalem (1099-1291), the Counties of Tripoli (1109-1289) and Edessa 
(1098-1144), the Principality of Antioch (1098-1268) were founded in the 
Levant as a consequence of the Crusades. The population of these states 
communicated in vernacular forms of French and Italian. Greek, Armenian, 
and even Arabic were also not uncommonly mastered by Crusader settlers 
(Metzeltin 1998: 602). 

By the very same time, trade from the East to Europe was soon in the hands 
of Italian traders, especially the Venetians, who acquired control of the 
Adriatic sea and of most of the islands in the Aegean, including Cyprus and 
Crete. Between the 15th and the 18th centuries, "se constituyó en boca 
oriental una imitación del italiano tendencial con vetas vénetas usado por 
los sábditos de la Serenísima y percibido por los extranjeros como funda- 
mentalmente italiano” (Metzeltin 1998: 605) [“A variety imitating Italian, 
with a Venetian flavour, developed among eastern speakers. It was used by 
people living under the Venetian rule and perceived by foreigners as being 
fundamentally Italian” ]. 

The Ottoman power continued to grow, and in 1453 the Byzantine Empire 
was extinguished with the fall of Constantinople. The Ottomans already 
controlled Greece and much of the Balkans, and soon began to spread 
through North Africa. The island of Cyprus became Ottoman in 1571, and a 
generation-long siege in Crete pushed Venetians out of this island in 1669. 


2.1. 
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A balance of power was then established between Spain and the Ottoman 
Empire until the 18th century, each dominating their respective half of the 
Mediterranean, making Genoa and Venice increasingly irrelevant as naval 
powers. Turkish spread throughout the Balkans as the official language of 
the Ottoman Empire. 

By the 19th century the European States began to colonize North Africa. 
France took Algeria in 1830 and later Tunisia. Britain gained control of 
Egypt in 1882. French and English came to serve as primary languages of 
commerce, administration, and economics, and their diffusion in North 
Africa has survived the dissolution of the colonial empires. 


Greek and Latin in the Late Antiquity 


Starting from Roman imperial times, Latin and Greek began developing some 
common typological features and gave rise to a sort of cultural koiné which spread 
all over Christian Europe. In most of these instances of convergence, Greek played 
the role of model language and Latin mostly that of replica language. Most of these 
developments were then inherited by the early Romance vernaculars. Leaving 
aside the impressive amount of lexical loans from Greek, the following cases are 
often cited as examples of (more or less heavy) structural borrowing (i.e. involving 
morphology and syntax) in Late (Christian) Latin: 


(i) 


(ai) 


(iii) 


(iv) 


some Greek suffixes became productive as a result of the huge amount of 
Greek lexical loans in Christian Latin, and perpetuated in Romance vernacu- 
lars (e.g. the suffix -ía, which formed abstract nouns from adjectives; the ver- 
bal suffix -izare, derived from Greek -ízein); 

the article-like use of ille in Christian Latin has possibly been reinforced by 
the existence of a definite article in Greek. In the Bible translation, ille cor- 
responds to the Greek definite article in 115 passages (Dietrich 1998: 125); 
a couple of future/prospective constructions of Late Latin (and early Ro- 
mance vernaculars) are said to depend on a Greek model: the construction 
habeo + infinitive with an obligative meaning closely matches the Greek 
construction with ékho + infinitive, while the prospective construction 
formed with a verb meaning ‘go’ + infinitive (e.g. French je vais chanter, 
Spanish voy a cantar), though already attested in Archaic Latin, might have 
been reinforced by the corresponding Greek construction with érkhomai + 
future participle (Dietrich 1998: 126); 

the development of object complement clauses introduced by quod instead 
of the accusativus cum infinitivo is said to depend on Greek complement 
clauses introduced by höti. 
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Apart from (1), these cases of convergence represent instances of universal paths of 
grammaticalization and thus, in principle, they might have developed indepen- 
dently from the Greek model. It is only on external grounds (ultimately, on the 
basis of what we know from the history of these two languages) that we can postu- 
late the dependence of these Latin developments on (Bible/Koiné) Greek. This 
fact, however, should not blind us as to the possibility that similar cases might not 
be the result of language contact. For instance, the correspondence between the 
construction habeo + past (passive) participle in Latin and the construction ekhö + 
perfect (passive) participle in Greek is possibly due to parallel independent devel- 
opments, as the Greek construction has never enjoyed wide currency in Greek 
(Dietrich 1998: 126). 

It must be emphasized that Latin and Greek, as the official languages of West- 
ern and Eastern Europe respectively, and as the languages of the holy texts, have 
probably played a non-secondary role in the transmission of these (and other) traits 
to those languages with which they were in contact: (Late) Latin as a written lan- 
guage extensively shaped vernacular varieties not only in Romance-speaking ter- 
ritories but also in western German territories, where the oldest documents are 
adaptations of religious texts such as Heliand and Otfrid; (Byzantine) Greek, on 
the other hand, was the dominant language in South-Eastern Europe. A paradig- 
matic example of the influence of the Greek culture on the development of local 
vernaculars is the enterprise of the two Greek missionaries Cyril and Methodius 
(9th century). They devised a new alphabet for the Slavic people (the so-called 
Glagolitic alphabet), and shaped a largely artificial language (Old Church Slav- 
onic) whose lexicon and syntax was heavily modelled on Byzantine Greek, and 
which served as a koiné language for Slavic populations throughout the Balkans.! 


2:2: Greek in the Middle Ages: the Balkans and beyond 


Although the sources of the various linguistic Balkanisms are much disputed, the 
ultimate origin of at least some of them is to be sought for in Byzantine Greek. 
These cases will not be dealt with in detail here (see the chapters by Tomié and Hill 
in this volume). On a wider Mediterranean scale, however, these cases are interest- 
ing to the extent that they are attested outside the Balkans in areas that were either 
in close contact with Greek-speaking communities or massively Hellenized. One 
case in point is southern Italy, where the following “Balkanisms” are found in the 
local Romance varieties:? 


(i) the substitution of the infinitive by means of a subordinate clause (the Cal- 
abrese type vogghiu mu/mi mangiu, ‘I want to eat’ [lit.: I.want that Leat]); 
(ii) the use of the remote past instead of the perfect in southern Calabria, Sicily, 
and Salento (e.g. Sicilian sta matina chiuviu, ‘this morning it rained’, opposed 
to Standard Italian/Tuscan stamattina é piovuto, ‘this morning it has rained’) 
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has developed, according to Rohlfs (1947: 17—20), under the influence of the 
Greek aorist, which is much more frequent than the perfect in Southern Ita- 
lian Greek; 

(ii) the distinction between a realis and an irrealis complementizer (the former 
used after verbs of saying, the latter after verbs of desire; e.g. Salentino cris- 
ciu ca vene, ‘I believe that he/she [will] come[s]’ vs. ogghiu cu mmancia, 
‘I want him/her to eat’) matches the Greek distinction between höti and na. 


2.3. Language contact in the Middle Ages: the role of Arabic 


The influence of Arabic on the languages of the western territories conquered by 
Arabs was never pervasive, except for a conspicuous amount of lexical loans: 


*Was ist vom Arabischen im Iberoromanischen geblieben? [...] kein arabisches Pho- 
nem in die Iberoromanischen Sprachen gelangt ist. Auch die morphologischen Ele- 
mente sind äußerst spärlich [...] Die syntaktischen Arabismen älterer Sprachepochen 
wirken nicht mehr nach. So müssen wir uns auf den Wortschatz konzentrieren" (Kontzi 
1998: 337). [ “What Arabic features have remained in Ibero-Romance? ... No Arabic 
phoneme has survived in the Ibero-Romance languages. Also morphological elements 
are extremely scanty ... The syntactic Arabisms of earlier linguistic stages have no 
currency anymore. Therefore, we must concentrate on the lexicon"] 


Kontzi's statement holds true also for Sicilian and, more generally, for Italo- 
Romance varieties. In Spanish, however, some examples of structural borrowing 
can be mentioned. One such example is the diffusion of the suffix - (forming eth- 
nonyms from place names: marroquí, ceuti, etc.). Another case in point is the 
Medieval Spanish use of a relativization strategy corresponding to the Arabic strat- 
egy relative pronoun + resumptive pronoun? (e.g. Et los rayosos son aquellos 
los cuales en ellos ... a dos forados, ‘and the striped ones are those in which there 
are two holes"): this strategy is particularly widespread in texts that are known to 
be re-workings or translations of Arabic originals, but enjoys also some vitality in 
texts that are not translations (Kontzi 1998: 334—335). 


2.4. Sailors, trades, and the Lingua Franca 


In the so-called “Crusader states" (see section 2c) different western ethnic groups 
used to settle in different quarters of the cities, so that each group could, to a certain 
extent, maintain its own language. In such a multilingual environment there is 
direct evidence that at least some people were bi- or even multilingual. Written 
documents from these states were frequently compiled in different Romance and 
non-Romance varieties: for instance, three versions (a French, a Venetian, and a 
Greek one) are known of the so-called Assizes of Jerusalem, a collection of the 
laws of the Kingdom of Jerusalem, and there are four extant versions (in Greek, 
French, Italian, and Aragonese) of the Chronicle of Morea (a 14th century text that 
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narrates the establishment of Franks in mainland Greece, specifically in Pelopon- 
nese, following the Fourth Crusade). In some other cases the amount of code- 
mixing makes it almost impossible to establish the language of some documents 
(this is the case of a Cypriot account written in 1423, analyzed by Cortelazzo 1989: 
389, in which Italian and French are so intermingled that they form a sort of mixed 
variety). 

In these multilingual communities of the Mediterranean Levant, a functionally 
restricted secondary language arose which was massively used in the domains of 
navigation and trade. This variety, traditionally known as Lingua Franca (or Sabir 
« Spanish saber 'to know, be able"), is a simplified version of Italian characterized 
by the following morphosyntactic features (Kahane and Kahane 1976: 36—38; 
Metzeltin 1998: 605-606): 


(i) the infinitive is used instead of the present and the future of the indicative; 
there are no moods: Smirne non aver bisogno di tua persona. Se voler andar 
Turchia, io ti mandar Costantinopoli (‘Izmir does not need you [lit.: your 
person]. If you want to go to Turkey, Pl send you to Costantinople’ ); 

(i) the past participle is used as a past form instead of the past indicative: mi 
visto propri occhi (‘I saw [it] with my own eyes’); 

(iii) tonic object pronouns are used as subject pronouns: mi aver tanto piacere de 
mia cara Annina (‘I am very much pleased about my dear Annina’); 

(iv) the construction ‘2nd person possessive + persona’ is used instead of the 2nd 
person singular pronoun (Smirne non aver bisogno di tua persona); 

(v) goal complements lack any preposition: se voler andar [ø] Turchia (‘if you 
want to go to Turkey’); 

(vi) human objects are frequently introduced by means of a preposition: mi bas- 
tonar per ti (tT ll beat you’); 

(vii) no articles are used: Tutta [0] compagnia venir casa mia (‘all the crew is 
coming to my house"); 

(viii) stare, ‘stay’, is generalized instead of essere, ‘be’: Tu chi star? (‘who are 
you?’). 


It is almost self-evident that many of these features are those typical of a stable 
pidgin. Overall, the Mediterranean Lingua Franca can be considered as a trade 
pidgin with Italian as its (primary) lexical source language. Different varieties 
of this language were spoken not only in the Crusader states in the East, but also 
in other areas in which commerce with the Italian city-states required the use of 
a shared means of communication (the Adriatic sea, Tunis, Algier). The lexicon 
of the northern African Lingua Franca appears to be more heavily influenced 
by French and Spanish: quando moi gagner drahem, moi achetir moukere (‘when 
I will earn money, I will take wife'; Schuchardt 1909: 457—459). 
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2.5. Language contact on Western Mediterranean islands 


The genesis and evolution of a trade pidgin had as a side-effect that the speakers 
of native languages spoken in the Mediterranean Levant did not give up their 
languages in order to adopt the language of the dominant western occupants. The 
establishing of a restricted contact variety, which was fully functional in everyday 
interactions among heteroglossic groups, somehow prevented cases of (massive) 
language shift and borrowing among languages in the eastern territories. The con- 
ditions that favour this type of change never existed in the Mediterranean Levant 
during the Middle Ages: no dominant language ever played the role of a target lan- 
guage in this region, nor did there exist intensive (and multi-lateral) language con- 
tacts involving more than the local elites who were most exposed to the dominant 
languages, i.e. sailors and traders. 

By contrast, it is in the languages spoken on the small islands of the western 
Mediterranean that we find the most significant cases of change in grammatical 
organisation driven by grammatical calquing. This type of change is the result of 
speakers’ bilingualism in other languages and typically involves the copying of 
constructional meanings from this/these language(s) and the creation of new struc- 
tures using inherited material to express these meanings. As a result, there is some 
degree of reorganization of semantic patterns (e, of “ways of saying things’) in 
the replica language, along with the restructuring of the patterns in which morpho- 
syntactic units are concatenated to form sentences, clauses, and phrases. 

Although the typically Arabic morphology of Maltese appears to be rather 
resistant to contact-induced change, there are some well-known morphosyntactic 
calques from Italian in Maltese, including, for instance, the possibility of forming 
three passive constructions with the same building blocks of the corresponding 
Italian constructions (the auxiliaries kien ‘be’, gie ‘come’, and baga’ ‘remain’). 
Another morphosyntactic feature of Maltese, the so-called differential object 
marking, is considered by some authors (e.g. Borg and Mifsud 2002; Ramat 2003) 
to be a good candidate for the status of a ‘Mediterraneanism’ (whatever value 
is attached to this term): direct objects in Maltese are obligatorily marked by 
a preceding particle (li)! if they are proper nouns or stressed personal pronouns 
(It-tifel ra lil Marija, ‘The boy saw Mary’, lit.: ART-boy saw to Mary). The particle 
(li) is connected to the preposition lil ‘to’. The same pattern is found in many 
Romance languages, ranging from Portuguese and Spanish to substandard var- 
ieties of Southern France, Corsican, and (Central and Southern) Italian dialects, 
but is also present in Old and Modern Hebrew, and in some eastern Arabic dialects 
(Lebanese, Cypriot Arabic). The choice of differential object marking is to vari- 
ous degrees conventionalized in all these varieties, but nevertheless it appears 
to be triggered by a mixture of animacy and definiteness: while in Romance lan- 
guages animacy appears to be crucial, in Modern Hebrew definiteness is the domi- 
nant factor. 
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Pantesco, the Sicilian-based dialect of Pantelleria, is another case in point 
(Brincat 2002: 81—82). Acquired by Arabic speakers over a long period of time 
in which they had become practically bilingual, Pantesco is characterized by sev- 
eral cases of grammatical calquing. Particularly significant among these cases 
are the following two: (1) the existence of a pluperfect formed with the invariable 
third person of the imperfect indicative of essere *be' (era) plus the remote past 
(era Cama, era cámásti, era cámáu, etc., ‘I had called, you had called, he had 
called, etc.’, lit.: he.was I.called, he.was you.called, etc.), very much like in Mod- 
ern Standard Arabic (and in Maltese, too): kaana Zaidun kataba ‘Zaid had written’ 
(lit.: be.PFv.3sc Zaid write.PrFv.3sc); and (ii) the coding of deontic modality by 
means of a construction with volere (‘to want’) with the meaning of dovere 
(‘must’), as in stu müru si voli autigghiári ‘this wall must be raised’ (lit.: ‘this wall 
REFL Wants raise"), a grammaticalization pattern that Pantesco shares with Maltese 
(dak li hu miktub irid jingaraftrid taqrah ‘what has been written must/should be 
read’, lit. ‘wants to be read’) and Sardinian (e.g. Logudorese cheret fattu ‘it must / 
has to be done’, lit: “wants done’; Putzu 2005: 159). 


3. Case studies 


In what follows, I will briefly discuss two instructive case studies that show the 
relevance of language contact in the development and transmission of linguistic 
features across the Mediterranean, even in the absence of a proper Mediterranean 
linguistic area. The first case study (3.1) deals with a feature that is highly con- 
tagious in situations of language contact, i.e. the definite article. In 3.2 I will ac- 
count for a polysemy pattern of morphemes in some Mediterranean languages that 
appears to be the result of (semantic) calquing from Greek and Latin. 


3.1. Definite articles 


Definite articles are particularly contagious in situations of language contact. The 
Mediterranean area is no exception to this rule. Putzu (2002; see also Putzu and 
Ramat 2001: 111—129) has traced a general history of the emergence of the definite 
article throughout the Mediterranean: the first Mediterranean language in which 
the definite article is attested is Ancient Egyptian (starting from documents of the 
New Kingdom, especially in the less formal registers of the language). In Semitic 
languages the article appears during the 1st millennium BC, whereas in the 7th 
century BC it emerges and stabilizes in Homeric Greek. The same “east-to-west 
gradient" (Putzu 2002: 254) characterizes all the "great civilization movements [in 
the Mediterranean area], including urbanism and the spread of writing" (Putzu 
2002: 254). Whether such civilization movements created the crucial socioling- 
uistic prerequisites for the borrowing of the definite article is of course undecid- 
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able, and it is perfectly plausible to hypothesize that articles have emerged as the 
result of general “requirements of discursive efficacy” (Putzu and Ramat 2001: 
129). However, there are also clues that the emergence of definite articles was re- 
inforced by contacts with model languages in which a definite article was already 
established: in more recent times, the fact that the article-like use of ille in Vulgar 
Latin is particularly frequent in holy texts, where it corresponds to the Greek defi- 
nite article, is suggestive of a process of grammatical borrowing with Greek as the 
source language. The same holds true for Slavonic languages: only those which 
have been subject to strong areal influence from Greek have developed a definite 
article. 


3.2. Augmentative suffixes 


Augmentative suffixes are used to derive typical evaluative forms whose interpre- 
tation corresponds to the paraphrase “big X". In some Mediterranean languages, 
these suffixes have an additional semantic reading which can be represented by the 
paraphrase “one who has/is/makes X to a high degree". For instance, the suffix -ás 
in Modern Greek is used to derive evaluative forms such as kefal-ds ‘big head’, and 
forms referring to human beings characterized by a showy property such as glossäs 
*chatter box, loquacious person'. The same polysemy is attested in other Mediter- 
ranean languages (e.g. Spanish caj-ón ‘large case’, and barrig-ón ‘big-bellied per- 
son’; Italian barc-one ‘big boat’, and brontol-one ‘grumbler’; Moroccan Arabic 
sbae-un ‘enormous lion’, and zolt-un ‘a very poor person’; Bulgarian Zen-iste ‘big, 
hefty woman’, and Cudö-v-iste ‘monster’, derived from Cudo ‘wonder’). Grandi 
(2002) has convincingly shown that this polysemy, which is very rare among 
the world's languages, is a feature that goes back to the Latin-Greek koiné of Late 
Antiquity. To be more precise, under the influence of Greek -iön and -ías/-ás, the 
Latin suffix -(i)o/-(i)onis (originally deriving nouns generically referring to human 
beings: centurio ‘centurion’, pellio ‘furrier’, etc.) began developing a (derogatory) 
meaning roughly paraphrasable as "one who has/is/makes X to a high degree". In 
Late Latin and Koiné Greek, these suffixes came to designate not only the posses- 
sor of an unusual property, but the property itself. The spread of this polysemy pat- 
tern has possibly been propelled by the power of this cultural and linguistic koiné, 
as demonstrated by the present-day distribution of these polysemous suffixes in the 
Mediterranean sub-area in which the pressure of Latin and Greek was stronger. 


4. Conclusions 
Some of the cases of borrowing mentioned in the present survey fall between the 


categories of “slightly more structural borrowing" and “moderate/heavy structural 
borrowing" on the borrowing scale devised by Thomason and Kaufman (1988: 
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74-75). “Slightly more structural borrowing” is a cover term for various phenom- 
ena including, for instance, the abstraction of derivational affixes from borrowed 
words and their addition to native vocabulary (as in the two cases mentioned above 
of the Latin suffix -izare, from Greek, and the Spanish suffix -£, from Arabic). 
“Moderate” and “heavy” structural borrowing represent a continuum of borrowing 
types which refer to changes in structure that involve the re-creation of construc- 
tional meanings in a target language by means of inherited material. The spread of 
the definite article from east to west, the Maltese differential object marking, and 
the Pantesco pluperfect mentioned in 2.5 may be included in this category along 
with the Late Latin temporal and aspectual constructions discussed in 2.1 — pro- 
vided that the hypothesis of their ultimate Greek origin is accepted — and the so- 
called “Balkanisms” of Southern Italian dialects exemplified in 2.3. 

These changes are all diagnostic of "reasonably intense cultural pressure" 
(Thomason and Kaufman 1988: 76) exerted, from time to time, by a given model 
language on one or more replica language(s), and confirm that the areal perspective 
is essential when studying the process of formation of the Mediterranean linguistic 
landscape. The absence of a global Mediterranean linguistic area does not exclude 
that micro-areas exist which mirror particular linguistic contacts in the course 
of time. What Koptjevskaja Tamm (2002: 212, adapted) writes about languages 
in the Circum-Baltic area holds true, all in all, for languages in the Mediterranean 
region as well: “there are no isoglosses covering all the ... languages; moreover, 
the isoglosses pick up different subsets of the languages, in many cases also ex- 
tending outside of the ... area proper. We also find that it is in the border dialects 
and minor languages that language contact has been most intensive and where we 
find the most exciting structural changes". 


See also the following chapters in this volume: 4 by Kabatek and Pusch, 7 by 
Menz, 9 by Kalluli, 11 by Tsangalidis, 12 by Stolz, 16 by Tomić, 17 by Wälchli, 22 
by Hill, 23 by Breu, 37 by Mailhammer and 38 by Ramat. 


Notes 


1 One may also note, in passing, that the aforementioned processes challenge the widely 
held view (exemplified, e.g., by Haspelmath 2001: 1507) that assigns to (Late) Latin (and 
Byzantine Greek) at most a marginal role in the transmission and irradiation of features 
that are typical of the Standard Average European. Late Latin, in particular, has possibly 
played both a passive and an active role in the dynamics of convergence that led to the 
formation of the European linguistic area: it replicated patterns of the spoken varieties 
with which it was in contact and, at the same time, it was a model language for innovations 
and processes of standardization in spoken vernaculars. For an instructive case study, the 
emergence and diffusion of impersonal man-constructions, the reader is referred to Gia- 
calone Ramat and Sansó (to appear). 


Mediterranean languages 353 


2 In Salentino, there is a progressive construction formed by means of a (frozen form of) 
‘stay’ + the conjunction and + the finite form of the main verb: sta mmangia, ‘he/she is eat- 
ing’ (lit.: stays and eats), sta mmangiati ‘you(pl.) are eating’, (lit. stays and you(pl).eat); 
the Latin conjunction ac is no longer recognizable in this construction, but may be recon- 
structed because it triggers the so-called raddoppiamento sintattico (syntactic redupli- 
cation) in the following verbal form (mmangia instead of mangia). A similar construction 
is used in Griko, the Modern Greek dialect spoken in southern Italy: estéo ce grafo, "Um 
writing’ (lit.: stay and L write). Contrary to Rohlfs (1947: 25), who hypothesizes a Greek 
origin of this construction, a Romance origin appears more probable, as the construction is 
never attested in Modern Greek outside the Southern Italian varieties. Moreover, the con- 
struction in Salentino appears to be more grammaticalized than its Griko counterpart, and 
sta has assumed the status of an invariable progressive marker. 

3 Relative clauses in Classical Arabic are introduced by a pronominal element, called 
"relative pronoun" in traditional grammars, that agrees with the head noun in the main 
clause but does not encode the syntactic role of the notional head in the relative clause. 
When the syntactic role of the notional head of the relative clause is that of a direct, in- 
direct, or prepositional object, the relative clause contains a resumptive pronoun referring 
to the notional head: 'al-uulaam-aani |-musiigiyy-aani llad-aanig 'arsala-humaa siiraa- 
nuu [the-boy-bu.NoM the-musical-DU.NOM REL-DU.NOM sent-them.pu Cyrano.NoM], 
‘the two boys musicians that Cyrano sent’. 
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1. Norden as the north of Europe 


This overview deals with language contact situations in the area locally referred to 
as Norden ‘the North’, in Swedish, Danish and Norwegian. When referring to the 
area under investigation I will use Norden, with apologies for this appropriation by 
the Scandinavian majority languages — Finnish would use Pohjoismaat, Icelandic 
and Faroese Norðurland, Northern Sámi Davviriikkat, etc. 

Norden today has some 25 million inhabitants; the area consists of five inde- 
pendent countries — Denmark, Finland, Iceland, Norway, and Sweden - and three 
autonomous territories: the Faroe Islands and Greenland, both belonging to Den- 
mark; and Aland, which belongs to Finland. Many of these geographical areas are 
discontinuous (with all the countries and territories having large islands off their 
coasts — and even further away, like the Norwegian Svalbard). Norden also recog- 
nizes non-political territories, specifically the Sámi territory, Sápmi, covering 
much of the northernmost parts of Norway, Sweden and Finland, and the north- 
easternmost parts of Russia. 

The geographically defined traditional core of the area is the Scandinavian Pen- 
insula (Sweden and Norway); together with what is today Denmark, these states 
make up "Scandinavia proper", and this area is indeed referred to as Scandinavia 
by inhabitants in Norden. Due to the close political and historical relations between 
Scandinavia proper and other parts of Norden, the term Scandinavia is also used — 
especially outside Norden - to include Iceland, the Faroe Islands, and Finland. 

Norden is not just a geographic area; for a long time there has been official Nor- 
dic cooperation in several, if not most, administrative sectors. This cooperation is 
channelled through the Nordic Council for parliamentary cooperation, and through 
the Nordic Council of Ministers for governmental cooperation. The Culture and 
Education and Training Committee, one of the five standing committees of the Nor- 
dic Council, is the one responsible, among other things, for multicultural and multi- 
ethnic issues, languages, research and research exchange programs. There is, for 
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instance, a language convention signed by the five Nordic countries, which entitles 
their inhabitants to use their own language in another Nordic country in dealing 
with official matters in court, and the convention strongly recommends the same 
policy be followed in dealings with the police, in education, in health care, etc. 

Cooperation is rapidly being enhanced between the traditional Nordic commu- 
nities and the Baltic states (Estonia, Latvia, Lithuania), and (north-western) Russia 
(especially St. Petersburg and Kaliningrad, not to mention the historically import- 
ant connection between Finland and Karelia): for example, Nordic linguistic 
graduate schools and graduate courses regularly feature joint participation from 
these regions. There are on-going attempts at establishing a larger territory of co- 
operation, by adding Poland and Northern Germany to the above, producing what 
has been called a Metaregion. 

Although many of these endeavours are recent, Norden is a fairly tight-knit 
community, and has seemingly been so for at least one millennium; and while 
Norden has been in contact with the rest of Europe and beyond, continuous 
Norden-internal cooperation has had a stabilizing effect on the local languages. 

In section 2, I will start with an overview of the different languages in Norden 
today and their present status, and in section 3 I will give an account of the main 
historical and political events that have shaped the area, arguing that Norden itself 
could well be seen as a Sprachbund. Section 4 deals with language contact inside 
this Sprachbund; here a major challenge is to weed out the causes, processes 
and results of linguistic and cultural contacts from similarities based on the close 
genetic relationship between the major languages in the area. The task is virtually 
impossible with regard to the development through time of the North Germanic 
languages, which in turn means that this section by necessity has to focus more on 
the relationship between these languages and other, genetically non-related lan- 
guages in the area. Section 5 briefly looks at the relations between the Sprachbund 
Norden of today and languages outside this area; and the final section 6 addresses 
recent issues of globalization and glocalization and the future of Norden. 


2: Overview of the languages in Norden 


Traditionally, from the point of view of the majority communities, Norden has lin- 
guistically been a very homogeneous area, with continuous Norden-internal con- 
tacts being the norm. Teasing out the give and take in contact situations is therefore 
a major challenge. During the last fifty years, there has been a dramatic increase in 
the number and kinds of languages spoken in Norden, and we are only now begin- 
ning to see the long-term effects of this influence. 

From the point of view of the languages themselves, in terms of linguistic cat- 
egorization, we note that the majority of the languages in Norden are genetically 
North Germanic languages. Some of the most distinctive features that distinguish 
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North Germanic languages from West Germanic languages are (a) the lack of an in- 
itial palatal in words like year: *jaara > Swd år; (b) the lack of initial [w] before 
rounded vowels: *wurda > Swd ord (‘word’); (c) reflexive and reciprocal forms in 
-s (and -st in the past in many areas): Swd möta ‘meet’, mötas ‘meet each other’; 
(d) postposed definite articles: Swd bilen ‘the car’; (e) V2 word order in large areas 
of Norden; and (f) reflexive possessive pronouns corresponding to the genitive -s: 
Nor Endre sin bil ‘Endre’s car’, lit. ‘Endre his car’. 

It is often said that the relationship among the North Germanic languages is 
best described as a dialect continuum, and Norden is often used as an example of 
how difficult and random the division into and distinction between languages and 
dialects is. It is true that speakers of Swedish, Norwegian and Danish can get by to 
some extent in their neighbouring countries if they speak their mother tongue — and 
speaking one’s local dialect rather than the standard often assures (some) under- 
standing. This view of inter-Nordic understanding, however, needs to be relativ- 
ized. As Delsing and Lundin Akesson (2005) show in a recent large-scale investi- 
gation of these issues, not all Norden-native speakers understand others in Norden 
equally well; Norwegians are much better at understanding the native tongues 
of their neighbours; Swedes have most difficulties with their close relatives, 
especially with Danish. Speakers of these three languages would not understand 
Icelandic and Faroese off hand. However, due to the traditional importance of lan- 
guage in primary and secondary education, where Icelanders, Faeroes, and 
Greenlanders learn Danish at school, and where Finns study Swedish and Finland 
Swedes Finnish, pan-Nordic understanding is greatly enhanced. Investigations 
show that Faeroes are the best at understanding all the other North Germanic lan- 
guages. 

In dialect research, the whole area inhabited by speakers of North Germanic 
languages — including the Swedish-language parts of Finland and Estonia — is seen 
as one large dialect area: isoglosses are always drawn irrespective of political 
boundaries, and especially in the border regions — e.g., between Norway and 
Sweden - it is impossible to say whether a dialect is a dialect of Swedish or a dia- 
lect of Norwegian. Typically, therefore, we talk about Scandinavian dialects, or 
dialects of Scandinavian, when referring to the traditional (rural) dialects. 

The North Germanic tree of genetic relationships is historically seen as being 
split up during the Viking Age (around 800-1200) into a western branch (with Nor- 
wegian, Icelandic and Faroese), and an eastern branch (with Danish and Swedish). 
Due to the close-knit relations in the area and the various political interactions over 
the years, this division has historical relevance at best. With respect to the further 
historical development of the genetically defined group of North Germanic lan- 
guages in Norden, scholars (cf. in particular Torp 1998) have found the wave- 
model to be particularly useful. The focus then is on the centres of dispersion, 
where the spread of a new feature has originated; in this respect the wave-model is 
more attuned to language contact than the genetic tree. True, if the North Germanic 
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languages were to be looked at as well-defined, static systems occupying a given 
position in a genetic tree, results of contact over the last 1,000 years could theor- 
etically be discerned, but such an account would not be in accordance with what 
has de facto taken place in the Nordic society over the same time-span. 

In addition to the 200 or so migrant languages that have reached Norden after 
1945 (the end of World War II), Norden also has a number of other traditional lan- 
guages than the North Germanic ones. There are also other ‘traditional’ Indo-Euro- 
pean languages in the area, specifically different varieties of Romani and Yiddish. 
In the south and southwest of Norden the North Germanic languages today border 
on areas with speakers of mostly West Germanic languages (varieties of English, 
Frisian and German). Further to the west, in Greenland, Danish is used as the ad- 
ministrative language and in higher education, whereas the local population are 
Inuits, who typically speak Greenlandic, an Eskimo-Aleutic language. 

The third major branch of languages spoken in Norden (in addition to Ger- 
manic, i.e. Indo-European, and Eskimo-Aleutic) is Uralic. Two major branches of 
Finno-Ugrian languages are extensively represented: Sámi and Finnish. There are 
ten — today, rather eight — different Sami languages in the area, of which Northern 
Sami is by far the most widely used, both in terms of numbers of speakers and 
because speakers of Northern Sami can be found across Sápmi in all the countries it 
covers. Northern Sami also uses the same writing system in the three Nordic coun- 
tries, and it has become something of a lingua franca for Sápmi. In Finland there 
are in addition Inari Sámi and Skolt Sámi; in Sweden and Norway we find Northern 
Sami, Lule Sámi, Pite Sámi, and Southern Sami, and in Sweden in addition to these, 
Ume Sami. In Russia, we find not only Skolt Sámi, but also Kildin Sami and Ter 
Sámi (with only a few speakers); the last native speaker of Akkala Sámi recently 
died. Many of the Sámi languages have very few speakers and are endangered. 

The other major Finno-Ugrian language in Norden is Finnish, spoken primarily 
in Finland by some five million speakers, but also harbouring some 300,000 speak- 
ers in Sweden. In addition, there are branches of Finno-Ugrian in Sweden — Meän- 
kieli — and in Norway — Kven — which have received minority status as languages in 
their own right; linguistically these two varieties are about as close to Finnish as, 
say, Nynorsk (one of the two official varieties of Norwegian) is to Swedish. 

Finnish is closely related to Karelian, its neighbor in Russia, and to Estonian, 
spoken in Estonia, to the south of Finland. Further to the south from Estonia, 
Norden touches on — over the waters of the Baltic Sea — Baltic languages (Latvian 
and Lithuanian), and to the east and south-east, Slavic languages (Russian and 
Polish). 

Like other areas of Europe, Norden is inhabited by speakers of hundreds of dif- 
ferent languages; some of the "new" languages have also concretely affected 
the traditional languages, especially in lexis. The Nordic states have over the last 
decades rewritten their constitutions as regards languages, so that Finland, for 
example, which has throughout its independence had two national languages, 
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Finnish and Swedish, now also recognizes in its constitution the Sámi languages 
(in Finland, Northern Sámi, Inari Sámi and Skolt Sámi), Romani, and signed lan- 
guages (Finnish Sign Language (FinSL) and Finland-Swedish Sign Language 
(FinSSL)). In addition, Karelian is also mentioned as a minority language. 

Languages explicitly mentioned or implied in the constitutions of the respect- 
ive Nordic nations are de facto the traditional languages in the area. New migrant 
languages have not been officially recognized to any greater extent as yet. How- 
ever, changes are taking place here, too. For example, it is now possible in Finland 
to choose among a host of mother-tongue alternatives in the population statistics, 
including signed languages. 


3. The contact area Norden 


In the prototypical case, being a separate language means for this language to have 
idiosyncratic features that are different from the idiosyncratic features of any other 
language. A definition of a set of idiosyncratic features is furthermore that it is not 
equivalent with universal language features, nor exhausted by typologically spec- 
ified features, nor solely the outcome of genetic relationships or areally defined 
features. In the case of Norden, and especially with respect to the North Germanic 
languages, these distinctions are, however, hard to use as reliable aspects to assess 
similarities and differences among languages. 

In a language contact situation we need to distinguish between the causes, 
the processes, and the results of contact (cf. Wei 2008). Norden-internally we can 
get an idea of the causes and processes, but whether the results are due to language 
contact or not is difficult to say for sure. We are on much safer ground in this re- 
spect when we look at how Norden is or has been in contact with its neighbours. 

Norden is an ‘imagined community’ in Anderson’s (1983) sense: It has a com- 
mon history, its states have a similar political climate, its peoples show similarities 
in personalities, and the majority of the population can understand and be under- 
stood by others (with different mother-tongues) in the area. 

Historically and politically, the nations within Norden have been intricately 
intertwined. For instance, today's Sweden, Norway and Denmark were in different 
unions with each other through the centuries: all of them were joined together 
in the Kalmar Union (1389/1397-1523) under Swedish rule; Norway was in union 
with Sweden both in the 14th century (1319-1363) and in the 19th century 
(1814-1905); as the union dissolved, Norway was de facto an independent state. In 
between, Norway had been one part in the joint Kingdom of Denmark-Norway 
for more than 400 years (1380-1814). Iceland declared itself independent from 
Denmark in 1944; (much of) what is today Finland was part of Sweden (as 
Sweden-Finland) from the 12th century until 1809. At present, although Norden 
hosts five nations, there are four official territories with some — but different kinds 
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of — degrees of self-determination status. The unruly, but geographically contained 
history has tried out most cooperative possibilities, and Norden today prides itself 
of being democratic and in political balance, fostering the image of the Nordic wel- 
fare states, which in turn boosts Nordic people’s sense of accomplishment. 

Ata very early stage, there were numerous Norden-internal colonisations, with 
Swedes starting to colonise Finland (Aland) and Norrland (Northern Sweden) in 
the 6th century; Norwegians colonising Northern Norway, Iceland and the Faroe 
Islands in the 9th century; and Danes starting to colonise Greenland around the 
same time. In addition, people migrated and have settled in the area throughout the 
centuries (e.g., Finnish farmers migrated to Sweden and Norway in the 17th and 
18th centuries), and groups of people from areas nearby Norden have constantly 
moved in: during the Viking Age, Slavs and Frisians came to Denmark; in the 
Middle Ages, German craftsmen and businessmen settled in the Hanseatic towns; 
and in the 16th and 17th centuries, Germans and Walloons settled in northern 
Sweden as a result of the growing mining industry at the time. Much of this expan- 
sion took place at the expense of the rights of indigenous people, especially the 
Sami. The Kvens settled in Northern Norway in the 18th century, and Meänkieli 
speakers were politically cut off from Finland in 1809, when Finland became a 
Grand Duchy of Russia. 

Areally, Norden harbours several linguistic and cultural areas: the Circum-Bal- 
tic linguistic area (geographically defined by the catchment area of the Baltic Sea); 
the North Sea littoral (focussing on contacts to the west and south-west of Norden); 
the area of the Hansa (defined by the economical, cultural and political contacts in 
northern Europe primarily in the 15th-17th centuries). But Norden itself can well 
be argued to fulfil the requirements of being a linguistic area, a Sprachbund. 

In addition to their respective national identities and patriotic feelings with 
respect to sports and other competitions, people in Norden also have a Nordic 
identity, ethnographically and conceptually. This goes back to a common sense of 
*the good old days', when Norden was a common linguistic community with com- 
mon interests and a common ideology as against foreign influence. It has even been 
suggested that since the change from the fupark with 24 runes to the *modern' one 
with 16 runes a millennium ago was so sudden and rapidly accepted throughout the 
area, there must have been an explicit common Nordic decision to this effect. 

In more recent times, we can distinguish several waves of ‘the Nordic project’ 
(cf. Ostman and Thggersen 2010). The first was part of national romanticism and 
culminated in the pan-Scandinavian Stockholm meeting in 1869, where Scandina- 
vianism was the ideal and both status planning and corpus planning, including pur- 
ism, were held in high esteem. The second wave of the Nordic project came after 
World War IL, when the Nordic language communities each set up their own lan- 
guage councils and language committees, overtly focussing on discourse planning, 
but with corpus planning as the basis to build on. These councils have since had 
joint annual meetings and have continued to coordinate their activities and duties. 
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4. Norden-internal language contacts 


As a geolinguistic area, Norden is not much different from other areas: speakers 
of powerful majority languages have more impact as they are speakers of super- 
stratum languages; the substrata are preserved in the onomasticon and in certain 
culturally defined domains. Below I will sketch the main causes and processes — 
and some results — of contact in Norden, starting in the east, and moving westward. 
The pidgin Russenorsk and some other Scandinavian-Russian contacts are dealt 
with in chapter 20 by Stern in this volume; the Circum-Baltic (linguistic) area, i.e., 
the results of contact among people living around the Baltic Sea is discussed in 
detail in chapter 17 by Wälchli in this volume. These topics will only be dealt with 
in passing here. (For some of the overviews below, the discussions in Goebl et al. 
1997 and Bandle et al. 2002/2005 have been of considerable help.) 

Finland today has a population of 5.3 million. From the times of the Christian 
crusades eastwards in the 12th century until 1809, Finland was an integral part of 
Sweden, where Swedish was the language of administration. When a Grand Duchy 
of Russia (1809-1917), Russian was used by the Russian ruling class, but with little 
concrete interaction between Russians and the Finnish- and Swedish-speaking man 
and woman in the street; Swedish remained Finland's only official language until 
1863. In the spirit of national romanticism, the idea of one-language-one-people- 
one-nation, coupled with increasing Russification activities in the latter part of the 
19th century, Finnish language and culture were upgraded in the minds of Finns, 
and many speakers of Swedish in Finland took Finnish names and changed their 
language to Finnish, teaching their children Finnish as their first language. Dis- 
cussions about the linguistic future of Finland were harsh at times in the late 19th 
century, but after Finland had declared its independence in 1917, the Language Act 
of 1922 stipulated that Finland has two national languages, Finnish and Swedish. 

Finnish-Swedish bilingualism is widespread in Finland, and the educational 
system requires both languages to be taught at least to some extent to all school 
children. As a Scandinavian language, Swedish offers Finnish speakers access 
to the rest of Norden; and varieties of Finnish have been used in Sweden (and 
partly in Norway) in some places ever since the Middle Ages. The most obvious 
exchange between the two languages can be found in the lexis. Finnish language 
planners (in Finland) have been relatively prone to *purism' in relation to borrow- 
ing and foreign influence, so that thousands of Finnish neologisms were created in 
the 19th century. But whereas the creation of words like puhelin (‘telephone’; from 
puhe ‘speech’ + the derivational -(/)in) was the winning alternative over the inter- 
national variant telefoni before World War IL, when new gadgets like television 
entered the culture and the language in the mid-1950s, the short-lived Finnish sub- 
stitute word näköradio 'sight-radio' soon gave way to the international form tele- 
visio ['televisio] that Finnish still today uses. Quantitatively, Finnish has borrowed 
most loanwords from Swedish; borrowing in the other direction is not so frequent, 
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although basic-vocabulary words like pojke (‘boy’; Fin poika) have been taken 
over into Swedish. 

Finland Swedes (finlandssvenskar) prototypically have Swedish (known as 
Finland Swedish) as their mother tongue. Most Finland Swedes live near the coast 
in the west, the southwest, and the south of Finland. The Aland Islands, with Ma- 
riehamn as their main town, are a semi-independent territory with Swedish as of- 
ficial language. There is an abundance of loan words into Finland Swedish from 
Finnish — so much so that word lists and dictionaries of these words (finlandisms, 
fennicisms), which Finland Swedes are advised not to use, have been produced 
time and again for more than a century; the most notable being Bergroth (1917 et 
passim) and af Hállstróm-Reijonen and Reuter (4th ed. 2008). In addition to lexical 
influence from Finnish on Finland Swedish, the lack of word accent distinctions 
(Acute Grave) in (Standard) Finland Swedish has been attributed to influence 
from Finnish, which is characterized by word- and phrase-initial stress. 

There are varying — and sometimes fierce — debates about whether Finno- 
Ugrian or Germanic tribes came to Finland first and chased the Sámi further north. 
Other suggestions have also been advanced, e.g., that there has been constant 
interaction between Finno-Ugrian and Indo-European tribes in the area (cf. Wiik 
2002). The prehistory of Norden as a contact area has no doubt had a considerable 
effect on why the languages look the way they do today. Individual features may 
also have been affected or may be due to a contact situation, but it is not clear in 
which direction the influence has taken place. For instance, we do have some evi- 
dence of previous stages in the immense variation we find in the more than 80 rural 
dialects of Swedish Oe, Scandinavian) in Finland. On the other hand, if the Ger- 
manic tribes and the Finno-Ugrian tribes have lived side-by-side for more than a 
millennium (at least), it is to be expected that the groups have had some degree of 
contact. 

Romani, spoken by the Roma, can be traced back to the first millennium AD 
in Finland. The three dialects of Finnish Romani have only recently started to be 
investigated systematically. Romani was until very recently a secret language with 
low prestige; in the middle of the 1950s, 70 % of the Roma knew Romani, but 80 96 
used Finnish. Today there are very few native speakers of Romani; only scattered 
words are learned by adolescents. 

The Sámi (altogether about 60,000) have organised themselves politically in all 
the three Nordic countries covered by Sápmi: the Sámi Parliament in Finland since 
1973, the Sameting in Norway (1989), and in Sweden (1993). Some 80% of the 
Sámi in Finland have Finnish as their mother tongue. We find several hundred 
Sámi words in Finnish, especially in northern varieties of Finnish, e.g., tunturi 
*mountain'. Conversely, we find several thousand loanwords from Finnish in the 
Sámi languages. Again, though, since both Finnish and Sámi languages are Finno- 
Ugrian languages, it is not always easy to say what has been borrowed from where, 
if anything. Recently, there have been attempts at creating domestic words like 
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dieda ‘science’ (cf. Fin tiede) in Northern Sámi, which has majority status within 
the Sámi population, further enhanced through the pan-Scandinavian orthographic 
reform of 1979. According to this reform Northern Sámi was to be spelled the same 
way irrespective of the political nation where it was written. 

The northern parts of North Germanic and Sámi Norden have had cultural con- 
tacts with, and been influenced by the Slavic and Finno-Ugrian languages to the 
east, and Norden-internally through frequent Northern Sámi contacts. This in- 
fluence extends to northern Norway and to the northern parts of Sweden, Norrland, 
whereas Old Norse influence is strongest on Southern Sámi in the central parts of 
Norway and Sweden. The simplified consonant system (coupled with a fairly com- 
plex system of vowels and diphthongs) of Southern Sámi — in comparison to 
Northern Sámi — might in part be due to contacts with Germanic. 

The population of Sweden is 9.3 million. Before World War IL, Sweden (as the 
geographic area it is today) was culturally and linguistically very homogeneous — 
except for the indigenous Sámi and Tornedalians in the north; until the 1960s the 
guiding principle was assimilation to the majority, Swedish, culture. The common 
Nordic labour market declaration signed in 1954 saw a considerable migration to 
Sweden from the other Nordic countries, since special work permits were no 
longer needed among Nordic people who wanted to settle down and work in an- 
other Nordic country than their home country, and passports have since not been 
needed for inter-Nordic travel. The Nordic Council's language convention, giving 
Nordic citizens the right to use their own language in another Nordic country, also 
helped to enhance migration. 

As a political power, Sweden has a long history of multiculturalism and multi- 
lingualism; Sweden was a very powerful nation in the region, especially during 
its Great Power Period (Stormaktstiden, 1611-1718). At this time, German was the 
lingua franca in the Kingdom of Sweden, but even before that, in the Middle Ages, 
German exerted considerable influence on Swedish with respect to both grammar 
and pronunciation. Only later, in the 18th century, French gained in importance. In 
the late 1960s, multiculturalism and multilingualism was made the common plat- 
form for the Swedish society; the melting-pot principle gave way to special in- 
struction for migrants. In 1976 the Home Language Reform was instituted, giving 
children the right to get education in another language than Swedish. Finnish 
speakers had a special status, since Finnish was deemed a national language and 
part of Sweden's cultural heritage. In addition, Finnish was a lingua sacra in north- 
ern Sweden, due to the considerable influence of the religious Laestadianism in the 
area, especially on the Sámi. 

Of the ca. 15,000 Sami living in Sweden, half are (still) in the north, in Norr- 
botten. Most of the linguistic impact of the Sámi languages on Swedish (and on 
Meänkieli) is in lexis. Since bilingualism is nowadays widespread among the 
Sámi, code-switching (into Finnish, Swedish, or Norwegian) can easily become 
the preferred option in situations where modern terminology is needed. 
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The Scandinavian dialects in Norden form a continuum so that the western dia- 
lects in Sweden are very similar to Norwegian dialects — partly due to the common 
political history of what is today Sweden and Norway. Similarly, the southern Sca- 
nian dialects have a number of Danish features, in spite of active Swedification 
after Denmark lost the area to Sweden. Meänkieli (or, Tornedal Finnish, tornedals- 
finska) has official minority status in several of the municipalities in northern 
Sweden. It has been strongly influenced by Swedish, to the extent that the words 
for mother and father — mamma, pappa — are from Swedish. Meänkieli pronunci- 
ation also has Swedish features, including the use of the typical Swedish mid- 
vowel [u] and the back open a-sound [a]. Influence from Swedish can also be found 
in Meänkieli syntax: for example, necessive expressions in Finnish would have 
the subject in the genitive, and the finite verb in third person singular — the form for 
impersonal constructions, but in Meänkieli the subject is in the nominative, and the 
finite verb is controlled by the subject. Further, finite verbs in 3rd person singular 
have a distinctive form with the ending -pi in Meänkieli: hdn saapi ‘she receives’ 
(cf. Fin hdn saa). 

Genetically, Swedish makes up the eastern branch of North Germanic together 
with Danish. Bonds between Denmark and Sweden are still considerable; the south 
of today's Sweden, Scania, was lost by the Danes in 1658, but similarities in pro- 
nunciation between Scanians and Danes persist. Danish is spoken mainly by 
the 5.5 million Danes living in continental Denmark, but Danish is also a strong 
language, especially of administration, in Greenland and in the Faroe Islands. Fae- 
roes — and Icelanders, too — typically learn Danish as their Scandinavian language. 
And since Norway and Denmark formed a union for 400 years, Danish also has a 
stronghold in Norway - to the extent that one of the two Norwegian national lan- 
guages, Bokmál, might to an outsider seem very Danish in its written form. 

In the south of Denmark, in Sønderjylland, German has minority status. In 
1864 Denmark lost (parts of) its southernmost part (Schleswig and Holstein) to 
Germany; in the area now known as Sønderjylland natives use three languages, 
High German, standard Danish, and Sgnderjysk (the Southern Jutland dialect of 
Danish) — and the same holds on the other side of the border in Germany. 

The idea of one-nation-one-language seems in a sense to have survived in Den- 
mark; especially in comparison to Norway, there is very little everyday tolerance 
for dialects in Denmark. There is far-reaching dialect levelling in Denmark, with 
Copenhagen as the magnet of linguistic centralisation; so much so that nowadays 
we can say that there are two linguistic points of reference for Danes: Copenhagen 
and the (geographically unspecified) Province. That is, we do not find as clear an 
expression of local regional dialect identity in Denmark as we find in most of the 
other Nordic countries. Recently, though, the traditionally esteemed ‘High’ Copen- 
hagian has given way to what Tore Kristiansen (2001) has called Low Copen- 
hagian, a variety that is spoken in the outskirts of Copenhagen. This is the variety 
that Danes generally seem to orient towards nowadays. In Denmark, what seems to 
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be taking place in view of general globalisation and democratisation processes is 
not destandardisation as such, but what Mattheier (1997) has called demotisation 
(Demotisierung). 

In 1953 Greenland became part of Denmark; it is one of the autonomous terri- 
tories in Norden. In 1979 a Home-Rights Act declaration was signed, making 
Greenlandic the main language of the island, but with the specification that Danish 
shall also be taught at school; either language can be used for official purposes. 
History has it that Erik the Red (Erik Rede) landed in southern Greenland in 982, 
and there was continuous Danish habitation in Greenland until in the 14th century. 
Danes came back in the 18th century, and after a century of Danish-Norwegian 
settlement, Greenland was declared Danish in 1814. Today some 15% (of some 
60,000) of the inhabitants speak Danish, with certain linguistic features (like 
the conspicuous Danish stød) losing foothold, and with the dialect split between 
the more prestigious western dialect and the eastern variety becoming more pro- 
nounced. 

The Danish-Greenlandic language contact seems like a textbook example of 
how the substratum language mostly influences Danish lexically through loan 
words, whereas the superstratum language Danish has more far-reaching effects 
on the language system. For instance, the Greenlandic numeral system only went 
up to 20; today the twelve first of the Greenlandic numerals are in use, but Danish 
numerals and Danish (also vigesimal-based) counting are used for higher 
numbers — and sometimes also for the lower digits. 

The Faroe Islands were colonised from the southwest of Norway in the 9th cen- 
tury; in 1035 they became a Norwegian province. In connection with the Kalmar 
Union at the end of the 14th century the islands became part of the Danish-Norwe- 
gian Kingdom. When the union was dissolved, the Faroe Islands remained Danish 
and received county status (as an amt) in 1816. Still, there has always been exten- 
sive contact between the Faroe Islands and the town of Bergen in Norway. The 
Faroe Islands is now an autonomous territory within Denmark, with a population 
of 50,000. 

Since Danish was the only writing system, a Danish translation of the Bible 
was used in Church in the Faroe Islands at the time of the Reformation. This set the 
ground for Danish-Faroese bilingualism: the Danes did not learn Faroese, so the 
Faeroes had to learn Danish in order to be able to handle administrative and clerical 
issues. Even though Danish has continued to be important, the Faeroes today tend 
to look up to Iceland as their big brother. The Faroese writing system is fairly re- 
cent, being established in the 1850s. It was created with Old Norse (and Icelandic) 
as the model and with the expressed goal to ‘purify’ the language and get rid of 
Danish and international loanwords. Although the writing system was standardised 
in the spirit of national romanticism, the different dialects on the islands are con- 
sidered equally good, and can be used also on formal occasions. 

Iceland was colonised in the 9th and 10th centuries by Norwegians — both from 
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what is today Norway and from Norwegian-populated areas in Scotland and Ire- 
land. In 1262 Iceland became part of the Norwegian Kingdom; later it was part of 
Denmark for 500 years until its independence in 1944. Today Iceland has over 
300,000 inhabitants; it is said to be ethnically and linguistically homogeneous; the 
traditional view is that “there are no dialects in Iceland" — similarities between Old 
Norse (Old Scandinavian) and modern Icelandic continue to be stressed by Ice- 
landers. Detailed analyses of the linguistic situation in Iceland (cf. Arnason and 
Práinsson 2003) have shown, though, that there is indeed dialectal variation in Ice- 
land, too, both in pronunciation and in morphosyntax, the biggest differences being 
between the dialects in the north and the Reykjavik variety. Loan words do exist, 
and they are adapted phonologically and morphologically, e.g. from Danish, bill 
(Dan bil ‘car’); from English, sjoppa ‘shop’. 

The Norwegian borders were stabilised in 1814 after the area had been in union 
with, and a colony of, Denmark for 400 years. During the Danish period the Nor- 
wegian of the Middle Ages disappeared and written Danish prevailed. During the 
period of 1814—1905 Norway shared king and foreign policy with Sweden; Nor- 
way declared independence in 1905 and today has 4.8 million inhabitants. 

During the 400 years of common history of Denmark and Norway a creoloid 
called Danish Norwegian, Dano-Norwegian, came into being. It has been charac- 
terised as a creoloid since it had mother-tongue status from the beginning and 
could be understood by Danes and dialect-speaking Norwegians. Dano-Norwegian 
is primarily the result of the contact between spoken Norwegian and written Dan- 
ish that developed within the educated classes in Norway in the 18th and early 
19th centuries. Danish was the written language in all of the Danish-Norwegian 
Kingdom, and thus set the standard for morphology and lexical elements, whereas 
prosody, phonology and syntax continued to be influenced by spoken Norwegian. 

Dano-Norwegian was the mother tongue of the upper middle class and had 
high prestige. In the times of national romanticism, Ivar Aasen (1848) forcefully 
brought forth the argument that since the language of the upper class was a result 
of language contact, and since Old Norse and the Norwegian dialects were closely 
related, the language of the upper classes does not have the prerequisite of func- 
tioning as a national linguistic symbol: the real Norwegians are the dialect-speak- 
ing peasants, which made up 95 % of the population in the early 19th century. In 
1853 Aasen presented a standard for the writing system of Norwegian that was 
based on the common denominators of the dialects; in 1885 this proposal was 
deemed to have the same status as the Dano-Norwegian standard. 

Today, there is no official, codified standard spoken Norwegian. The national 
language is “Norwegian” in two written varieties — both of which are taught at 
school. At the end of the union with Denmark, Norway was linguistically divided 
between (a) those who wanted to continue on the path set in the union, with a lan- 
guage that was greatly influenced by Danish - this was later to be called Bokmál 
(‘book language"), with its most well-known proponent in the 19th century being 


Language contact in the North of Europe 371 


Henrik Wergeland; and (b) those who felt that ‘real’ Norwegian should be based on 
the local dialects — later known as Nynorsk (*new Norwegian"), with Ivar Aasen as 
a most ardent field worker who made Norwegians respect their dialects. Because of 
the generally positive attitudes towards dialects in Norway, the country is still in 
many places hailed as a promised land for the acceptance of variation. The two var- 
ieties were given equal status in 1885, and in 1907 it was made compulsory for 
everyone to learn both varieties at school — at least to some extent (as sidemäl). 

Norway, in accepting variation and the influence of the language of the rural 
population thus took a very different turn from what had happened in Sweden and 
Denmark, where the language that developed into the standard for the written lan- 
guage was very close to the spoken language of the upper classes; other varieties 
had less status. 

As separate languages, Old Norwegian and Old Swedish are known to have 
been in contact at least in the 14th century through the influence of Saint Birgitta. 
There is even said to have been a pidgin language called Birgitta-Norwegian 
(birgittinernorsk) based on these two languages. That is, the Revelations of the 
Swedish Birgitta were written in Latin and translated not only into Swedish, but 
also into Norwegian; thus we can see early contact between Latin and Old Norwe- 
gian influencing the style of written Norwegian. 

During the time of the Hansa, Old and Middle Norwegian came in close contact 
with Low German, especially in the 14th and 15th centuries. Low German was the 
lingua franca of the Hanseatic League, and we can see its effect most clearly on the 
town dialect of Bergen, which is the only traditional Norwegian dialect with a two- 
gender system; other dialects have a three-gender system. The peculiar Bergenian 
gender system is argued to be the result of language contact, as the Hansa had a 
prominent colony in Bergen. During these centuries the Scandinavian languages 
generally changed from synthetic to more and more analytic languages. It is diffi- 
cult to ascertain that it is precisely the contact with Low German that is the primary 
cause for this change, but it is safe to say that the typological change was faster be- 
cause of this contact. 

Northern Norway has experienced language contacts for 2,000 years; here, 
multilingualism is the typical state of affairs. Still, there have been constant lin- 
guistic conflicts in the area due to the repeated attempts by the Norwegian author- 
ities to assimilate and integrate the Sámi and Finnish (Kven) minorities in the area; 
until the 1950s total assimilation was the goal. We can find some Sámi loan words, 
idioms and constructions in the Northern Norwegian dialects, but otherwise the 
influence has clearly gone in the opposite direction: the Sámi lexicon, syntax, and 
phonology have been considerably affected. For instance, retroflex pronunciation 
was formerly unknown in varieties of Sámi. Bilingual Sámi-Norwegian adoles- 
cents are said to be speaking Sámi-Norwegian, an ethnolect that is different from 
other dialects of Northern Norwegian in that it has a Sámi substrate; Norwegian 
monolinguals in the same area also speak this ethnolect. For instance, Sámi lacks 
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the Acute- Grave word accent (toneme) difference that is typical of Norwegian and 
Swedish; this feature is also lacking in Sámi-Norwegian. Sámi languages do not 
distinguish between voiced and voiceless initial plosives (p~b; t~d; k~g), and they 
lack aspiration after initial plosives; Sámi-Norwegian has voiceless initial plosives 
without aspiration. 

Norwegian and Finnish have been in contact in this area since the 18th century. 
The local Finnish variety in the area has in fact been influenced by both Norwegian 
and Sámi - the syntax mostly by Sámi languages and the lexicon by Norwegian. 
Finnish in northern Norway accepts initial consonant clusters to a much higher de- 
gree than the Finnish in Finland; in Sámi, too, we find the same feature — under in- 
fluence from Norwegian. The Sami languages — especially Northern Sámi — and 
Finnish have also been in contact for some 200 years in the northern parts of Nor- 
way. This is perhaps more of an adstratum situation, although it seems as if Sámi 
has had more of an influence on Finnish than the other way round, especially in the 
area of syntax. 

Norway is geographically spread out, and although the dialect map of Norway 
shows distinct isogloss areas, we also find linguistic ‘islands’; e.g. the Troms area 
around the university town Tromsø is considered a southern-dialect island in the 
north of Norway. We find an even more complex sociolinguistic situation and a 
multitude of varieties in Longyearbyen on the Svalbard Island. Svalbard has only had 
permanent inhabitants from the 20th century onwards, with no traditional dialect; 
today we do find dialects (cf. Mæhlum 1992) on Svalbard, but the variety spoken in 
one household can be very different from the neighbour's variety; and children in the 
same family can speak different varieties due to the friends they happen to have. 

The geographically based overview presented above has focussed on the main 
causes and processes of linguistic and cultural contacts in Norden. I have given 
examples of results of various kinds: phonetic, morphophonetic and prosodic re- 
sults, morphological and morpho-syntactic results, and lexical and semantic evi- 
dence of contact. The overview has also made an effort to consider socio-prag- 
matic and discourse aspects in order to illustrate how we need to look at the causes 
and processes behind the actual concrete results that we see in linguistic form. 
Especially after World War IL, the generally accepted one-language principle in 
Norden, where each Nordenian speaks his or her (variety of a North Germanic) 
language to speakers of other North Germanic languages, has secured enough of 
mutual intelligibility to foster, and even actively enhance, borrowing among 
speakers of different North Germanic languages; this has also been seen in the 
similarities in corpus planning in the different speech communities. 

We have also seen the influence of minority and unofficial languages, including 
dialects, on the emergence of, and variation in, standard languages. And we have 
seen the joint endeavours for standardisation activities going in similar directions 
in the different communities in Norden. This covers both spoken and written lan- 
guages (cf. the Stockholm meeting in 1869, and onwards), and signed languages 
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(with a boost in a corresponding meeting in Copenhagen in 1907), and both corpus 

and status planning. 

The overview has given the broad — and thus very shallow, or even superficial — 
causes, processes and results of contact. When we look in detail at any two varieties 
in contact, we can be more specific, but still, due to the close genetic relationship of 
the North Germanic languages and dialects, it is impossible to give definitive 
answers as to what influenced what. To round off this section, I will give some 
hints of the kinds of similarities we find in those varieties I am most familiar with 
myself; the choice of features is thus a random selection. 

1. Whereas traditionally North Germanic languages had three genders, the stan- 
dard mainland languages have two. In some areas, e.g., in the northern parts of 
the Swedish Ostrobothnian dialects on the west coast of Finland, there is only 
one gender. These varieties are geographically close to Finnish varieties, which 
lack gender differentiation, i.e. they de facto have only one gender, which is the 
same as saying that they lack the overt marking of morphological gender. 

2. Doubling phenomena are generally not as frequent in North Germanic lan- 
guages as has been reported e.g. for Dutch (cf. De Vogelaer and Neuckermans 
2002). But in certain varieties (e.g. in Ostrobothnian Solv) excessive doubling 
is the norm, especially subject doubling (as in An a han joort he "He has he 
done that’) and negative, and contrastive doubling (as in Noo a an joort he han 
noo 'Sure has he done that he sure"). The pragmatic reason for this is no doubt 
communicative explicitness, but the fact that Finnish is geographically close 
by, with the typical double marking of, say, possessiveness, and subject, might 
have had an indirect effect on doubling in Solv. 

3. Many Ostrobothnian varieties retain the imperative plural, which has otherwise 
been lost in standard and regional forms. Finnish also keeps up a formal dis- 
tinction between singular and plural imperatives. 

4. Lexical polytonicity (Accent-1 and Accent-2, Acute- Grave, stød) is typical all 
over Scandinavia; this is not, however, the case in the Swedish varieties in Fin- 
land. Since the distinction is not a general one throughout the Scandinavian 
area, it 1s clear that we cannot blame Finnish for the lack of lexical tones; 
however, the lack of such a distinction in Finnish has at least not promoted the 
Finland Swedish varieties to further enhance it. 


5. Norden in contact 


The last 50 years have seen a dramatic increase in the number of languages in the pre- 
viously fairly homogeneous Norden. This can be observed even in Finland: its first 
immigrants arrived as late as in the early 1970s, and 25 years later (2006), with a 
population of five million, Finland had over 120,000 foreign citizens from 174 
countries, who spoke 138 different languages. The non-traditional languages have 
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already had a marked influence in the bigger cities in Norden (especially in Stock- 
holm, Copenhagen, Oslo, and Góteborg), mostly starting with adolescent use. 
Long-term effects and language change are, however, hard to predict at present. 

Over the centuries there have been five languages which were the leaders of 
linguistic export to the Nordic countries, especially within the lexicon: Latin, 
Greek, French, German, and English. In addition, different spheres of life have 
enriched the native vocabularies, e.g., Italian in music and food; and particular 
geopolitical situations have left traces, e.g., Russian words and phrases in Finland. 

People from Norden have travelled extensively and constantly for more than a 
thousand years: Viking expeditions and settlements around 800-1000 covered 
most of (the coasts of) present-day Europe, Greenland, what is today the east 
coasts of Canada and the U.S.A., as well as large areas of western Russia of today, 
down to the south-east of Europe and the northern parts of Asia Minor. 

Contacts between Finnish (and Finnic languages generally) and Russian have 
left important traces. The most salient features from the Scandinavian languages in 
Russian are names, like Igor, from Scandinavian Ingvar. Further, some northern 
Russian dialects use a postposed article, and the similarity between the use of re- 
flexive clitical particles (-s, -st in Scandinavian; -sja in Russian) as intransitive 
markers has often been pointed out. 

There are different views on whether the area around the Baltic Sea, in particu- 
lar its catchment area, variously called the Circum-Baltic Area or Baltic Europe, 
qualifies to be called a Sprachbund proper (as e.g. Raukko and Ostman 1994 would 
have it), or whether it is more of a contact zone, a buffer between the east and the 
west; cf. the discussion in Dahl and Koptjevskaja Tamm (2003). Although the con- 
tact between different peoples in the area has a long history and the various areas 
have been in political unions for longer or shorter periods of time, some scholars 
feel the contact has not been continuous enough to call the area a Sprachbund. Still, 
we can see clear influence of the Germanic languages on the Baltic-Finnic lan- 
guages, of Finnish on Swedish (especially on Finland Swedish), of Estonian on 
Estonia Swedish, of German on Latvian and Polish; and a give and take between 
Eastern Finnic and Northern Baltic languages, between Slavic (especially Russian) 
and Eastern Finnic languages, between Lithuanian, Polish, Belorussian and Ukrai- 
nian, etc. Norden as such, however, has most of the ingredients to be called a 
Sprachbund: the long-standing political unions and frequent contacts, and the 
sense of an imagined community. And through the ages the kingdoms of Norden 
have also included most of the Circum-Baltic area. Arguing that Norden is not a 
Sprachbund on the basis that most of the languages it encompasses are North Ger- 
manic is opting out and neglecting other languages than the standard languages. 

Thus, the Finnish necessive contruction mentioned earlier would have a nomi- 
native-agentive correspondence both in Estonian and in Finnish dialects. Further, 
we find initial question particles in yes-no questions in the languages around the 
Baltic Sea. Sure, the tendency to put the focus on the first element in a yes-no ques- 
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tion is not restricted to languages around the Baltic Sea, but the close contact has 
had a synergetic effect and helped to retain and reinforce this structure. The same 
kind of synergy (i.e. pragmatic, multi-cause) effect can be seen in the similarities 
of the complex systems of demonstrative adverbs in Finnish and in dialects of 
Scandinavian in Finland — despite the lack of surface similarity between the sys- 
tems. (For details, see Ostman 1995.) 

Especially if we assume that language and culture go together, we can find a well 
of peculiar similarities both inside Norden and in the Circum-Baltic area. For one, 
exonyms flourish, indicating that the creators of an exonym have experienced a need 
(typically, close contact) to establish a domestic name. Thus, Stockholm is Tuk- 
holma in Finnish, Tallinn is Tallinna in Finnish and Reval in Swedish. Once the 
similarity — cf. the imagined community - is established, there is room for differ- 
ences in the similarity: you have to show both that you are similar to your neigh- 
bours, but also that you are not identical to them; cf. the notion and process of neigh- 
bour opposition (naboopposition; cf. Larsen 1917), which says exactly this. In 
a view where language, culture and mind are inseparable, contact is a process, a dy- 
namic notion that has to do with continuous adaptation, not merely with different 
stages following each other. Change is dependent not only on internal system 
changes, but also on the influence of the external processes of culture, ideology, 
speech communities, identity, and the implicit attitudes expressed between the lines 
of what is explicitly said. 

Many features (both on a macro- and on a micro-level) in a language, and es- 
pecially in language change, are affected by contact, but it is not clear in what 
direction the influence has taken place. The features we see as the result of various 
processes are seldom derived from something superficially concrete in another lan- 
guage. We have seen that semantic systems as a whole (cf. the demonstrative ad- 
verbs) are in contact; and we have discussed the possibility that something like a 
particle-question, most likely pragmatically defined, can make up a system that can 
be taken over or enhance a development in another languge. Thus, we see that dif- 
ferent kinds of particle-questions are being developed, e.g. utterance-final particles 
(tá, dà, eld, na) in Solv. Maybe what is being borrowed is, indeed, the concept par- 
ticle-question; and like all pragmatic concepts, they tend to get borrowed very 
early on in a contact situation. 

Norden has not only been in contact to the east and to the south(-east). We have 
already mentioned the North Sea littoral as a comparative area of influence to the 
Circum-Baltic area. (See e.g. the discussions of relativisation on the North Sea 
littoral in Poussa 2002.) We have also seen the tremendous impact of the Hansa in 
the area for several hundred years, and how Low German in particular spread to 
Norden as a direct result of intricate mercantile and other relationships. Influences 
from further away are also well known in history, starting with what we know of 
the Fenni from Tacitus, and especially in view of the contact the Vikings had with 
the rest of the world more than a thousand years ago. 
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At present, Norden is still a joint community internally, but with respect to in- 
ternational politics it is overtly divided in that Finland, Sweden and Denmark are 
part of the European Union, whereas Norway and Iceland are not. So far this has 
not had a major effect on everyday activities in Norden. What has been conceived 
of as a greater problem is that of globalisation generally, and in particular Norden's 
relation to the Anglo-American global village. 

Globalisation as such has probably had the same kinds of effect on Norden as 
elsewhere, in business and higher education more directly, but its effects have nat- 
urally been experienced locally as well. As Verschueren et al. (2002) point out, it is 
not only globalisation as democratisation as such that is a challenge, but the very 
idea that virtually anybody in the world can get to know what is happening in one's 
own small community will have an effect on our behaviour, also on our linguistic 
behaviour. 

In Norden the language councils and committees that were established in the dif- 
ferent speech communities after World War II have worked together to come up with 
joint strategies to counter too much ‘foreign influence’ — in practice, loanwords and 
structures from English. The language policies that have been advanced in the Nor- 
dic countries have many similarities, since the speech communities are up against 
the same kinds of processes: (a) loss of linguistic domains, i.e. that the native lan- 
guages are losing their creativity and usefulness in certain domains, most signifi- 
cantly in science, technology, and medicine, but also elsewhere; (b) language rights; 
on the one hand that minority languages are getting more recognised, but on the 
other hand that national education fails to support the learning and teaching of other 
Nordic languages to the same extent as used to be the case — the man and the woman 
in the street want their children to learn English as soon as possible; and (c) the chal- 
lenges of parallel language teaching, especially at university level, where the more 
recent goal is to a/so teach courses taught in English in the national language. 

So far the success of the work of the language planning agencies is hard to 
assess; sure, one cannot know what would have happened if the language councils 
had not existed, but it is notoriously difficult to influence an already wide-spread 
development. Josephson (2009) argues that there are at least three factors that 
influence the impact of the planning: (1) practice, Le people's language use and 
language choice in everyday communication; (ii) attitudes and ideologies, both in 
relation to English and in relation to one's mother tongue and one's neighbouring 
languages; and (iii) the ineffectiveness of laws and regulations, not to mention lan- 
guage planning as such. 

What will most likely not change is that the influence of English will get 
stronger, that the Nordic speech communities will get more and more plurilingual, 
that some (standard) languages will lose their former glory (with some minority 
languages seeming to gain in importance), and that we have a media explosion on 
all fronts, with obvious linguistic effects. But the Nordic (imagined) community 
will prevail in some form: the traditional languages have a high degree of simi- 
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larity, there is on-going interaction across the borders, and there is a continuous 
process of common-norm building and symbolic interaction. 

The Nordic today is characterised by political, academic and cultural cooper- 
ation. Ideologies and identities are negotiated in relation to forces from outside, in 
particular in relation to factors like the loss of linguistic domains, which have to do 
with globalisation. 


6. Glocalisation 


On the basis of our knowledge of history and linguistics we can learn about the ex- 
ternal causes of a language contact situation, and we can see how the language-in- 
ternal system adopts and moulds the elements that come in from another language. 
We can approach the results and processes of contact deductively by referring 
results to concepts like grammaticization and analogy. It is more difficult to show 
the direct relevance of notions like imagined community, hegemony, ideology, 
attitudes and ambivalence. Still, these are notions that we know have an effect on 
language change in contact situations today; so there 1s little reason to discard them 
in searching for an understanding of earlier contact situations. What linguistics 
talked about in earlier times as superstratum and substratum might today have to be 
couched in terms of globalisation and glocalisation, respectively. Maybe in the 
end, all grammars are glocal grammars: grammars that appropriate elements from 
other languages and make them their own. 

Contact is not a secondary phenomenon. It is too essentialist to think that L1 
and L2 exist, and then come into contact with one another. Especially in our pres- 
ent-day society of late modernity, this is not a fruitful starting point. Variation, 
variability and adaptability are central concepts for understanding contact, and 
have always been. Besides, convergence is just one aspect of contact; divergence 
as a strategy is just as important. 

In the MIN-project (Moderne importord i spräka i Norden), which was in op- 
eration over much of the first decade of the 21st century in the Nordic countries, the 
focus was precisely on how modern imported words and structures are experienced 
and dealt with by ordinary people in seven speech communities in Norden (the Ice- 
landic, Faroese, Danish, Norwegian, Swedish, Finnish-language Finnish, and 
Swedish-language Finnish). (For an overview, see e.g. Sandøy 2007.) 

Some of the results of this project are fairly surprising. A simple frequency 
count of the number of loanwords and structures that have been adopted into 
Norden shows that the number has not risen markedly during the last half century. 
But despite this, English is experienced as a threat. Informants' explicit opinions 
about English and English loanwords are generally negative, but when we look at 
language contact on an implicit level (measured e.g. in matched-guise tests and 
analyzed with discourse-analytic methods), in some communities (e.g. in Swedish- 
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language Finland; Mattfolk and Kristiansen 2006) a moderate use of English is 
highly appreciated. That is, people's perceptions might be very different from their 
implicit attitudes — and it is typically the implicit attitudes that govern language 
change. Globalisation is pressure from above. In the Nordic democratic way of 
thinking, you are thus supposed to be negative towards globalisation; but impli- 
citly you can appreciate it. The importance of the local is stressed with reference to 
tradition, but no one wants to live in a museum. Or are the MIN-results really an 
expression of self-colonisation, where the implicit goal is not to become equal to 
the invader language, but to become as good as one thinks one can be? That is, one 
is restricted by one's own conception of what is the most one can attain, and that 
one should be satisfied with that — a kind of colonisation from within. Glocalisation 
is then really a countermove against self-colonisation — as well as a countermove 
against globalisation. 

We saw earlier that there are several stages that can be discerned in the history of 
the imagined community Norden: (1) the mythical good, old days; (2) the rise of 
Scandinavianism at the end of the 19th century; and (3) the cooperative establish- 
ment of language planning agencies and committees in the different Nordic speech 
communities after World War II. What we experience in Norden at present, i.e. glo- 
calisation, is a new wave of Nordic cooperation, a new wave of ‘the Nordic project’. 

The glocal ideology incorporates and appropriates the global and makes it one's 
own. For instance, Swedish-language Finland is pressured from at least three direc- 
tions: (1) the global cultural and linguistic influence from English; (2) the influence 
from Finnish as the majority language and culture in Finland; and (3) the normative 
big-brother influence from Sweden. All these affect the structure, the usage and the 
ideology of Finland Swedish. And in the system of the language the results of these 
different types of contacts take the form of a glocalised appropriation. 

In contradistinction to hegemony (in terms of Foucault and Gramsci), ambiva- 
lence is the counterforce that shows the way for how contact situations can be 
handled by people living in contact situations. We hear a lot about dialect levelling 
and how the linguistic landscape is rapidly changing. But levelling is only one, he- 
gemonic, side of the coin — the globalisation side. At the same time, at the grass- 
roots level, regionalisation is shaping new ways of speaking. Both levelling and re- 
gionalization are the results of language and culture contacts, but they differ in the 
mentality of the speakers of the affected languages. 

Whether Norden will retain its status as an imagined community or not will to a 
large extent depend on the implicit attitudes of the people and peoples themselves 
in the North of Europe. Contact situations are always complex, but we will get a 
better understanding of Norden if we do not only see it as the North of Europe, but 
also treat it as a Sprachbund, as a linguistic area. 


See also the following chapters in this volume: 17 by Wilchli, 20 by Stern, 32 by 
House, and 33 by Pahta and Taavitsainen. 
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of Europe 


Dieter Stern 


1. Introduction 

2% The remote past — Eastern Slavs, Finns, and Varangians 

3. Trade pidgins in the Northeast of Europe: Focus on Russenorsk 
4. Surzhyk and Trasianka 

1. Introduction 


Dealing with historical and recent contacts of Russian, Harald Haarmann (1999: 
782) categorizes language contacts in general into ‘interlingual-interethnic’ and 
“interlingual’ ones. Notwithstanding possible theoretical pitfalls of this distinction, 
it provides a convenient tool for distinguishing contacts based predominantly on 
written sources (‘interlingual’) from those deriving from immediate face-to-face 
interactions on a daily basis (‘interlingual-interethnic’ ). In what follows, I will ex- 
clusively deal with face-to-face contacts, i.e. what Haarmann calls ‘interlingual-in- 
terethnic’. This will exclude the effect of learned borrowings from further con- 
sideration. The geographical restriction to East and Northeast Europe in fact means 
Russian language contacts. 


2; The remote past - Eastern Slavs, Finns, and Varangians 


The common ancestor of modern Russian, Belorusan, and Ukrainian, was an unfo- 
cussed group of East Slavic dialects, which formed a continuum with West Slavic 
dialects. It was surrounded by Baltic and Finnic languages. The contact relation- 
ship between Slavic and Baltic is hard to disentangle due to their genetic proximity 
and will not concern us here. More promising from a contact linguistic perspective 
is the close, almost symbiotic historical neighborhood of Eastern Slavs and Finns. 
Contact between both linguistic groups has up to the present day been determined 
by language shift from Finnic to East Slavic. Veenker (1967) is still a valuable 
compilation and discussion of the evidence for possible Finnic substrate influences 
in Russian. The features discussed in his book are supposed to date back to the pre- 
history of East Slavic. The most probable period would be the one from 700 AD to 
1000 AD. Veenker identifies seven structural features as unquestionable outcomes 
of substrate influence: 
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a. The merger of /a/ and /o/ into /a/ in unstressed syllables, known as akanye, 
has an almost exact match in Moksha-Mordvinic, a Volga-Finnic language at the 
southeastern fringe of the East Slavic territory. The case for a Volga-Finnic origin 
is supported by the fact that akanye did indeed spread from a southeastern direction 
across the East Slavic dialect area. There 1s, however, also a discontinuous area in 
the northwest, where akanye 1s particularly strong, which cannot be accounted for 
by a Finnic substrate. 

b. A more tenuous case is the loss of the copula throughout East Slavic with a 
few pockets of copula retention in the westernmost dialects of Ukrainian. There is 
only one Finnic language, Komi, which parallels East Slavic in this respect: 


(1) Rus ja  celovek 
Kom mie mort 

I human 
‘Tam a human.’ 


What is more, Komi probably had no contact history with Slavic prior to the 11th 
century, the time of the first Novgorodian expedition into Komi territory, which 
is situated in the remote northeastern corner of Europe at the slopes of the Ural 
mountains. The phenomenon appears to have spread into Eastern Slavic from the 
Volga-Oka area, where Eastern Slavs lived side by side with the Finnic Muroma 
and Merya. Unfortunately, nothing is known about their languages. However, the 
Russian construction need not necessarily present an exact copy of an underlying 
substrate construction. A possible alternative model would be the type of con- 
struction found in Erzya-Mordvinic, which is spoken in the Volga-Kama area 
close to the original center of irradiation. In Erzya, nominal predicates take on a 
personal affix, which agrees with the subject NP. The equivalent for (1) would 
be: Mon loman’-an, with the personal affix -an for the 1st person singular. The 
resulting construction in Russian would be due to contact-induced simplifi- 
cation. 

c. A rather obvious parallel, which is backed by the majority of Finnic lan- 
guages in contact with East Slavic, is the adessive construction for the expression 
of possessivity on the sentence level: 


(2) Rus U menja knig-a 
at ISG.GEN book-NOM 
Fin Minulla on kirja 
1SG.ADE COP book.NOM 
‘I have a book. 


Despite the different appearance both constructions are absolutely equivalent. The 
prepositional phrase u menja ‘near me, at me’ semantically corresponds exactly to 
the adessive case construction in Finnish. Belorusan and Ukrainian also make use 
of the transitive strategy common to most European languages: 
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(3) Ukr (1) U mene knyh-a 
near 1SG.GEN book-NOM 
Ukr (2) Ja ma-ju knyh-u 
1SG.NOM have-1sc.PRs book-Acc 


The intermediate position of both Ukrainian and Belorusan reflects their signifi- 
cantly weaker direct exposure to Finnic as well as their long-standing cultural ties 
with Polish. The features mentioned so far are common to all East Slavic dialects 
and have accordingly found their way into the modern standard languages. There 
are other noteworthy substrate effects which are restricted to the northern and 
northwestern dialects of Great Russian. Accordingly, these dialects may be as- 
sumed to represent the core area of Slavic-Finnic contacts. 

d. A prominent feature of supposedly Finnic origin is the merger of Common 
Slavic /ts/ and /¢/ in the northwestern dialects of Novgorod and Pskov. It is abun- 
dantly recorded in birch bark letters, which have been unearthed in Novgorod and 
surroundings and in liturgical manuscripts from Novgorod ever since the 11th cen- 
tury. The phonological opposition between /c/ and /č/ is unknown to Finnic lan- 
guages so that a straightforward case can be made for a substrate influence. 

e. A geographically more limited feature is a comparative formed from nouns 
to express relative location: 


(4) Rus bereg na bereg-u berez-ee 
shore.NoM on shore-Loc  shore-COMPAR 
Fin ranta ranna-lla rann-emma-lla 
shore.NoM shore-ADE shore-COMPAR-ADE 
Translation ‘shore’ ‘on the shore’ ‘closer to the shore’ 


This feature is restricted to Russian dialects of the Russian-Karelian contact zone. 

f. An areal feature of wider distribution, encompassing North Russian dia- 
lects, Finnic, and Baltic, especially Lithuanian, is the use of the nominative for 
the direct object in impersonal necessitative constructions (cf. also chapter 17 by 
Walchli, this volume): 


(5) Rus nado  peck-a topi-t’ 
Lith reikia ` krosn-is kurin-ti 
NECESS stove-NoM light-INF 
Fin pitää  lämmitt-ää uuni 
NECESS light-INF ` stove.NoM 
“The stove ought to be lighted.’ 


This feature is well documented in Old Russian texts. It is obviously contact-in- 
duced, its ultimate source being Finnic. Timberlake (1974) accounts for the non- 
canonical case assignment in terms of a prominency hierarchy of morphological 
case. In the impersonal modal construction, which lacks a canonical subject, the 
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object moves up in the argument hierarchy to the position of the most prominent 
argument and is accordingly assigned the nominative case. 

g. Finally, North Russian dialects have developed a resultative perfect con- 
struction, which is obviously based on the locational possessive construction dis- 
cussed in (c) and was accordingly given the label of possessive perfect. Like the 
nominative object this construction has attracted the attention of linguists for its 
unusual properties: 


(6) Rus u ego knig-a procita-n-o 
at 3sG.GEN book-NoM.F read-PTCP-N 
"He has read the book.’ 
(7) Rus u volk-ov tut ide-n-o 


at wolf-GEN.PL here go-PTCP-N 
“Wolves have walked along here.’ 


Veenker (1967) unconvincingly attempts to derive this construction from the mar- 
ginal use of the adessive as an instrumental in Finnic, which, he maintains, inspired 
the subject in the adessive in North Russian. A more probable account can be given 
by assuming the extension of the Russian-Finnic possessive construction to cover 
also the resultative perfect, copying the analogous model of the common European 
transitive perfect construction with auxiliary have. Since there is no Finnic model 
for this, a Scandinavian influence would have to be assumed, as was indeed pro- 
posed by Panzer (1984). The parallel to the auxiliary construction with have would 
also imply that the prepositional phrase should function as a proper subject. Tim- 
berlake (1976) demonstrates that the prepositional phrase indeed possesses genu- 
ine subject properties, such as being resumed by an empty subject pronoun in a co- 
ordinated construction, as in the following example: 


(8) Rus Vot udocka u tebja by--a | by vzja-t-a, 
ADV fishingrod at 2sG.GEN be-PsT-F COND take-PTCP-F 
vot nalovi-l-ó togda. 
ADV catch-Psr-M then 
‘If you had taken the fishing rod along, you would have caught [some fish].’ 


The North Russian possessive perfect leads on to yet another important ethnoling- 
uistic group in the earliest history of the Eastern Slavs, namely the Norsemen or 
Varangians. These came as traders to the lands of the Eastern Slavs and stayed as 
rulers. Their linguistic heritage is generally said to consist of hardly more than a 
few vocabulary items and a couple of personal names like Igor’ from Ingvar. 
Indeed, there seems to be little else which would corroborate the assumption of 
structural interferences from Scandinavian. Panzer (1984) points to the North 
Russian post-positive article with its obvious parallel in Swedish and Norwegian. 
Ureland (1977) demonstrates that Russian and Swedish both developed a bound 
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marker of intransitivity out of a reflexive clitic. This bound marker is used in 
both languages for passive-like constructions (cf. also chapter 17 by Wälchli, this 
volume): 


(9 Rus On  otkryvaet dver’ Dver’ otkryvaet-sja 
Swd Han oppna dorr-en Dérren Oppna-s 
He open.3sc.prs the-door The-door open.3sG.PRS-REFL 
“He opens the door.’ "Ihe door opens / is being opened.’ 


Finnic-Russian contacts are not merely a phenomenon of the remote past. In fact, 
everyday contacts between both languages have never come to an end and still con- 
tinue to produce interesting results. The most recent development is the possible 
emergence of a bilingual mixed code in Karelia. Sarhimaa (1999) gives a detailed 
account of the extensive code-mixing practices among bilingual speakers of Kare- 
lian and Russian, where a common stock of lexical items triggers unconscious 
switches, or rather glides, from one language to the other. In the more extreme 
cases of mixed conversation, which Sarhimaa labels Karussian, ethnic flagging of 
the lexical material seems to have broken down almost completely. It appears that 
the switching does no longer convey specific social values pertaining to the lan- 
guages involved. Instead the mixed code itself seems on the way of becoming a 
hallmark in its own right for the bilingual rural communities of Karelia. 


3. Trade pidgins in the Northeast of Europe: Focus on Russenorsk 


The northeastern fringe of Europe was home to three trading pidgins. The best- 
known is certainly Russenorsk, which is also the only one which has been docu- 
mented in sufficient detail and will thus be the focus of this section. Second comes 
Borgarmálet, a Swedish-Saami jargon used by Saami traders. In order to account 
for the alleged Saami features of Russenorsk, Broch and Jahr (1984: 69—71) pro- 
posed the idea that Russenorsk is in fact a relexification of Borgarmálet. However, 
almost nothing is known about Borgarmälet. The only source is a travel account 
(Hógstróm 1847), which gives the name of the language and also a short dialogue, 
which does not allow for a judgement about a possible relationship to Russenorsk, 
however. À better documented case with obvious formal connections to Russe- 
norsk is Solombala Pidgin English, which was in use between sailors of English 
and other origin and Russian residents at the port of Arkhangel'sk (Broch 1996). 
Like Russenorsk, this pidgin employs a generalized verb marker -om, which 
qualifies as conclusive evidence for a common source for both pidgins. More de- 
tailed overviews of Russian-lexifier pidgins are given in Stern (2002). 
Russenorsk (RN) emerged out of trading contacts between Russian merchants 
from the White Sea littoral and Norwegian peasants and fishermen of Northern 
Norway (Finnmark and Troms). The Russian merchants bartered flour, grain, 
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hemp, birch bark, wooden boards, etc. against fish, especially herring, which was 
in great demand in Russia as fasting food. The Russian merchants embarked on 
their journey to Northern Norway at the beginning of summer from Arkhangel'sk, 
Kholmogory, Sumskoj posad, and Kem as soon as the ice had melted away and re- 
turned late in summer. The origins of this barter trade can be traced back to the be- 
ginnings of the 18th century, but only with the Russian-Danish treaty of friendship 
of 1782 — Norway was then part of Denmark — and the permission of free border 
trade did trade activity increase significantly. At the beginning of the 20th century 
the barter trade started to decline, with World War I and the October Revolution 
finishing off what had no future anyway. 

The first to make the pidgin known to a wider scholarly public was Olaf Broch, 
who published a grammatical sketch with etymological commentaries (1927) and 
a collection of the extant texts (1930). Initially, his findings did not meet with much 
attention. Broch and Jahr (1981, 1984) present a sum total of the extant knowledge 
and pertaining theories about the pidgin with an appendix of all known source 
materials. Since their seminal work on the pidgin, no additional source material has 
been found. 

RN is a rather rudimentary pidgin, which at various times has alternatively 
been classified as a prepidgin or stabilized pidgin. It stands out as one of the few 
attested dual-source pidgins, with an almost equal share of Russian and Norwegian 
vocabulary along with English, Dutch or Low German, Saami, and French lexical 
elements. The English and Dutch/Low German elements indicate a possible nauti- 
cal source for RN. It also links RN in an obvious way to Solombala Pidgin English. 
Even though its grammar and vocabulary reveal a high degree of variability, a mor- 
phological core structure can be identified. There are four more or less stabilized 
features which constitute what Dan Slobin (1979: 43—46) has presented as the 
grammatical minimum for a language to function. Except for the fourth, all fea- 
tures are apparent in the following example: 


(10) moja tvoja på vater kast-om 
lsG 2sG prp water throw-V 
‘I throw you in the water.’ 


a. moja, tvoja as generalized forms of the pronoun for the 1st and 2nd person: The 
forms correspond to the Nom. sg. fem. of the Russian possessive pronoun. The ex- 
tant sources offer only two contexts for 3rd person pronouns. One is represented by 
the Russian Nom. sg. masc. on and the other by the equivalent Norwegian form 
han. There is an intriguing parallel to Chinese Pidgin Russian, which has also gen- 
eralized the forms moja and tvoja. Kozinskij (1973) takes this as the basis for his 
hypothesis that all Russian pidgins known to him — besides RN and the Chinese 
Russian Pidgin this is a presumed Turkish-Russian pidgin of the Caucasus, which 
is attested in one of Tolstoy's Caucasian novels — derive from a common ancestor. 
This monogenetic theory has been taken up and elaborated by Perekhvalskaya 
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(2006), who postulates a pan-Siberian protopidgin (sibirskij pidZin) as the source 
for all Russian lexifier pidgins. The parallels indeed seem hard to explain other- 
wise. Lunden (1978), however, presents a possible alternative. This feature may 
have been borrowed from Chinese Pidgin Russian by long-distance traders who 
did business in Kyakhta as well as in Arkhangel’sk. 

b. pá as a generalized preposition used for peripheral case relations: The form 
has a double etymology in the Russian preposition po and the Norwegian preposi- 
tion på, which also overlap in meaning. RN pd serves a variety of other functions, 
which have not yet been completely understood, but may in part derive from the 
homophonous Russian prefix po- denoting perfective aspect. Bakker (1987) tries 
to shed light on the somewhat enigmatic multifunctionality of pá by putting it in a 
comparative perspective. He compares the RN multi-purpose preposition and pre- 
verbal marker pá/po to similar elements in a variety of other pidgins, and con- 
cludes that the multifunctionality is due to continued interferences among a set of 
maritime pidgins. His conclusions prompted the criticism of Belikov and Perekh- 
valskaya (1989). Jahr (2003) intended to settle the question once and for all. His at- 
tempt to derive the preverbal usage of po from the prepositional usage in temporal 
expressions like po morradag ‘tomorrow’, however, misses the point. Most of the 
preverbal uses can easily be related to Russian usage, where po- can be prefixed to 
almost any verb to denote a short period of time: Russian spat’ ‘to sleep’ > pospat’ 
‘to take a nap’. RN would have slipom ‘to sleep’ > på slipom ‘to sleep a little’, 
analogously to Russian. 

c. The general and invariant verb marker -om, as in betal-om ‘to pay’, 
drikk-om ‘to drink’, smotr-om ‘to look’, kral-om ‘to steal’: This marker has at- 
tracted by far the greatest attention but its etymology has remained unsolved to this 
date. Broch (1927: 249) was the first to give it a try. Discussing the probable pro- 
nunciation as well as various etymological options, he ultimately derives -om from 
the Swedish adhortative as it is found in sjungom ‘let us sing.’ His etymology did 
not meet with approval because it involved a peripheral morphological element 
from a language which had no share in the emergence of RN. A more reasonable 
alternative was at hand with the Russian 1st prs. pl. pojdjom ‘we go’, which is very 
frequently used as an invitation to move to another place (‘Let’s go.’). But the 
problem with this isolated verb form remains basically the same. It is hard to im- 
agine a grammaticalization path to a general categorial marker. Further etymologi- 
cal suggestions are summed up by Broch (1996: 96—97). Laakso (2001) once again 
looks for a possible Saami substrate influence and comes up with the marker for 
verbal nouns -mA. However plausible a verbal noun marker may seem as a starting 
point for the grammaticalization of a general verb marker, the complicated mor- 
phophonetics of the form stands against this evolutionary path. The most recent at- 
tempt is Stern (2007), who traces the marker back to the Cape Dutch pidgin, which 
has a marker of identical form and meaning. The marker may have made its way 
into RN through Dutch whalers who went out for whales around the Cape of Good 
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Hope as well as the North Cape, and had regular contacts with the resident popu- 
lation on the littoral. It should be noted that by far not all attested verbs of RN show 
the ending -om. 

d. Finally, there is the noun marker -a. This is even less grammaticalized than 
its counterpart, the verb marker -om. The noun marker is found with nouns 
(kleb-a < Russian chleb, daga « Norw. dag) and adjectives (bela « Russian 
belyj). The a-ending of the generalized pronouns moj-a, tvoj-a could also be in- 
terpreted as representing the noun marker, thus giving it an alternative expla- 
nation as an internal development of RN. The noun marker itself can be derived 
from various Russian nominal endings, which may in fact have formed a com- 
bined input to set the grammaticalization process going. Russian feminine nouns 
have an ending -a in the nom. sg. as e.g. vydra ‘otter’ (attested as vudra in RN), 
whereas masculine and neuter nouns have a homophonous ending -a in the gen. 
sg. as platka (nom. platok ‘cloth, scarf, handkerchief’ ). Interestingly, the latter is 
attested as platka in RN, with the stem alternation for oblique cases platk- instead 
of the nominative stem platok. This confirms the participation of the Russian case 
endings in the grammaticalization process. An additional source is possibly Nor- 
wegian foreigner talk. As Broch (1927: 227) points out, nouns ending in -a sound 
exotic to Norwegians. A Saami influence has also been discussed. Saami adds 
an -a to monosyllabic nouns borrowed from Swedish, as in biila ‘car’ « Swedish 
bil. In this case, however, etymologically monosyllabic nouns in RN would be 
expected to take on the a-ending more often than polysyllabics, which is not the 
case. 


4. Surzhyk and Trasianka 


The newly independent states of Belarus and Ukraine witnessed in the past two 
decades the renegotiation of the status of the new state languages Belarusan and 
Ukrainian in relation to the former official language of the Uniate state, Russian. In 
contrast to the other former Union republics, matters have been additionally com- 
plicated by the close genetic proximity of the languages in question. All of them 
belong to the eastern branch of the Slavic language family. This has facilitated the 
emergence of a continuum of fused lects on the level of everyday communication, 
which combine elements of Belarusan and Ukrainian with Russian elements. In the 
process of this fusion of language varieties Russian is unconsciously treated as the 
acrolectal roof language despite the fact that both Belarusan and Ukrainian have 
their own officially recognized and institutionalized standards. The typical out- 
come of mixing are Russian-lexified speech varieties with a mixed Russian-Bela- 
rusan/Ukrainian phonetic and morphological matrix, which are known as Surzhyk 
in Ukraine and Trasianka in Belarus. The following example contrasts Surzhyk 
with its Ukrainian and Russian standard equivalent: 
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(11) sur brat moj mnoho rabo:ta-ó u horod’-e. 
UKR brat mij bahato prac'u-je v mist’-i. 
RUS brat moj mnogo raboto-jit v gorod i. 


brother my much  work-3sc.PRs in city-Loc.sG 
*My brother is working a lot in the city.' (Bilaniuk 2005: 127) 


Lexically, the Surzhyk sentence shows a complete match with Russian. On the 
phonetic level, Ukrainian /h/ is used instead of /g/, which is the etymological Rus- 
sian equivalent. On the other hand, Russian vowel reduction shows up in mnóho, 
though not in the second syllable of hórod'e. A similar example can be given for 
Trasianka: 


(12) TRA jana taryu-je cv 'at-ámi na rynk’-i 
BEL jana yandlju-je | kv'étk-ami na irmas-y 
RUS ana targu-jit cv 'itámi na rynk’-i 


she  trade-3sgprs flower-iNsTR.PL on market-Loc.sG 
*She trades in flowers at the market-place.' (Lishovec 2002: 330) 


Here again there is a complete lexical match between Trasianka and Russian with 
distinct Belarusan phonetic features, as /y/ instead of Russian /g/ and the realiz- 
ation of unstressed /e/ as /a/ (yakanye) instead of Russian Ju (ikanye) in cv’atami. 

Surzhyk and Trasianka are only beginning to attract scholarly attention. For 
Surzhyk, Flier (1998) has undertaken the first attempt to identify the formal par- 
ticulars. The sociolinguistic side of Surzhyk has been given in-depth treatment by 
Bilaniuk (2005), who illuminated the wider background of language ideology and 
power relations, which are negotiated through language in Ukraine. An overview 
of the formal and social aspects of Trasianka is given by Liskovec (2002). For both 
Surzhyk and Trasjanka see also Hentschel and Zaprudski (2008). 

Language mixing in Surzhyk and Trasianka, which from the perspective of dia- 
lect levelling and urban koineization in other European languages appears as the 
most natural thing to happen, has become emblematic of the present Ukrainian and 
Belarusan language debate and is interpreted as a symptom of cultural decline fol- 
lowing the dissolution of the Soviet Union. Thus, the derogatory terms Surzhyk 
and Trasianka function in public discourse more often than not as instruments of 
cultural polemics than as descriptive categories. Most speakers still adhere to an 
ideal of linguistic purity as a key concept for language choices but due to the break- 
down of institutional authority following the political turnover of 1989, a process 
of active participation in the negotiation of what makes a pure language has set in. 
Ever since 1989 the political readjustment of power has been paralleled by a genu- 
ine democratization of language. In the search for new or re-newed identities, Uk- 
rainians, and to a lesser extent Belarusans, have set out on a personal quest for the 
linguistic form which most suits their identity, and it seems that traditional institu- 
tions of language instruction have lost much of their former almost absolute power 
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over the speech community at large. Attempts at regaining institutional control 
over language use have resulted in manuals of proper Ukrainian usage. The process 
of renegotiation created a multi-layered pattern of language use oscillating be- 
tween the extreme poles of Standard Russian, which still looms large, and Standard 
Ukrainian or Belarusan. There may be as many varieties of Surzhyk or Trasianka 
as there are patterns of ethnolinguistic and social reorientation. The often repeated 
simple story of Ukrainian or Belorusan boors coming into the cities and trying to 
speak proper Russian is in no way capable of capturing the complexity of the pro- 
cess. 
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1. Introduction 


The Southwest of Europe, which for the purpose of this chapter is defined as em- 
bracing the Iberian Peninsula, the Balearic Islands and the Southern and South- 
western parts of France between the Gironde and the Rhóne estuaries, is a highly 
interesting area for the study of both historical and contemporary language contact 
situations. This is mainly due to five reasons: the early and intense Romanization 
of large parts of the territory; the historical and continuing presence of Basque, the 
only language isolate in Europe and, at the same time, the only surviving non- 
Indo-European language in Western Europe, in the core area of this territory (see 
chapter 10 by Haase); the cultural impact of the Arabic culture during the long- 
lasting presence of Muslims on the Iberian Peninsula; furthermore, in more re- 
cent times, the political and demographic changes in Spain, which, due to efficient 
language planning in favor of the Spanish regional languages, have led to new 
types of language contact situations; and, finally, the fact that the urban and coastal 
areas of this territory have attracted migrants from different linguistic and socio- 
economic backgrounds, a fact from which still more new types of contact situ- 
ations ensued. 

This chapter will briefly address the issue of the most important and — as far as 
the present linguistic landscape is concerned — most decisive historical language 
contact situations from Antiquity to modern times (section 2), but the main focus 
will be on different facets of the current multilingual constellations found in south- 
western Europe and concomitant situations and processes of language contact 
(section 3). 
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2. Historical language contact 


In the entire area under consideration here, Romance languages are present and 
spoken by the vast majority of the population as primary languages. These lan- 
guages developed from the spoken varieties of Latin, which were brought into this 
territory from the late 3rd century BC onwards by soldiers, merchants and colon- 
ists through the military, economic and cultural expansion of the Roman Empire. 
Coastal areas along the Mediterranean Sea were romanized first (from 197 BC 
on the Iberian coast and from about 120 BC on the Gaulish coast), with inland 
Romanization advancing at a slower pace, while some mountainous zones in the 
Pyrenees resisted thorough Romanization until the Middle Ages. For a long time, 
scholars of Romance linguistic history have emphasized the impact of substrate 
and superstrate languages for the divergence of the Vulgar Latin spread by the 
Roman settlers, and for the configuration of today's Romance-speaking Europe. 
The definition of substrate and superstrate languages implies a period of language 
contact of variable length that ends in the loss of one of the languages involved, the 
conquerors' tongue in the case of the superstrate language and the conquered popu- 
lation's tongue in the case of substrate languages. A third term, adstrate, has been 
coined for a language that is in (long-lasting) contact with (an)other language(s) 
and exerts influence on it (them). This is a rather infelicitous terminological triad 
since both superstrate and substrate languages function as adstrate languages for 
a significant stretch of time, and after territorial retrenchment without complete 
extinction, a language may be considered a substrate and adstrate language at 
the same time. Take the case of Basque which is a substrate to Castilian (Spanish) 
in those areas that it has lost since Romanization, and an adstrate to Castilian 
nowadays. 


2.1. Pre-Roman languages and Romanization 


Already before the arrival of the Roman conquerors and settlers, Indo-European 
and non-Indo-European languages co-existed on the Iberian Peninsula. Apart from 
isolated coastal areas where Phoenician (a Semitic language) and Greek had been 
brought in through commercial expansion from Greece and Punic Carthage, Celtic 
languages covered most Western and Central parts of the Peninsula. Iberian and 
Basque were common in the Northern and Eastern areas, and Tartessian, a non 
Indo-European language about which very little is known, was spoken in the 
South. North of the Pyrenees, the Aquitanian language, which is now generally 
considered to be a precursor of Basque, met with Celtic languages in the North and 
with ancient Ligurian, probably an Indo-European language which again is 
scarcely documented, in the East. Two controversial issues have long been dis- 
cussed by scholars of the language history of Iberia, and both are related to lan- 
guage contact. The first of these is the status of the Celtiberian language, which 
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covered mainly the Ebro basin and adjacent regions. According to recent advances 
in the interpretation of archaeological testimonies, Celtiberian can be ascribed to 
the Celtic language group and, more exactly, to the most ancient stratum of Celtic 
(Echenique Elizondo 2003: 611—612), even if ethnic, cultural and linguistic con- 
tact between Celtic and Iberian groups, as suggested by the established denomi- 
nation of the language, may have been common in the heartland of Iberia, where 
this language was located. The second controversial issue is the status of the Ibe- 
rian language and its relation to Basque. From the 19th to the mid 20th century pro- 
ponents of the so-called *Vasco-Iberian hypothesis’ (which, in nuce, was brought 
forward as early as in the 16th c. and developed scientifically e.g. by, among others, 
Wilhelm von Humboldt [1821] and Hugo Schuchardt [1907]) claimed Iberian to be 
a direct predecessor of Basque. Nowadays, however, a more cautious position pre- 
vails according to which Iberian and Basque share certain features, but probably 
more as a result of prolonged language contact than of common genetic origin. The 
concept of Sprachbund is sometimes advocated in order to describe the Iberian- 
Basque relation. Iberian may have served as a vehicular language well beyond its 
traditional Eastern-Iberian territory (Echenique Elizondo 2003: 610) and may have 
transmitted some of its features via Basque to Ibero-Romance, namely to Castilian. 
Much of this remains speculative due to the fragmented knowledge of the Iberian 
language system. Basque influence on contemporary Castilian, however, is con- 
sidered to be beyond doubt. Apart from lexical items, some structural features, 
such as the sound change of initial etymological /f-/ to /h-/ in Castilian, are 
attributed to Basque influence, just as the reduced inventory of only five vowel 
phonemes. 

Contrary to the not insignificant impact of substrate languages on Romance 
tongues in Southwestern Europe, Germanic superstrate influence has been very li- 
mited, despite the migration (from the early 5th century onwards) and the domi- 
nant political position of different Germanic tribes in the area, such as the Vandals, 
the Suebi and, most prominently, the Visigoths. Two main reasons can be adduced 
for this fact: first, unlike what happened in Gaul, the Germanic tribes were already 
strongly romanized when they arrived in the Southwest of the continent; and, sec- 
ond, again unlike later developments in Gaul, the linguistic landscape and distribu- 
tion of varieties was heavily reshaped between the 8th and the 15th centuries due to 
the conquest of large parts of the Iberian Peninsula by Arab Muslims (711-732) 
and the subsequent Christian Reconquest (718—1492). 


2.2. Language contact from the Arab conquest until the Early Modern period 


The influence of Arabic (both Classical/written and vernacular/dialectal) on the 
languages spoken in Southwestern Europe south of the Pyrenees has been studied 
in detail as far as the lexicon is concerned. On the phonological, morphological 
or syntactic level, by contrast, only very few, if any, structural features have been 
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identified in contemporary Ibero-Romance languages and in Basque which are 
clearly attributable to language contact with Arabic (or Berber, since many of the 
conquerors from beyond the Strait of Gibraltar were of Berber origin). During 
the period of Arab dominance, a situation of multiple language contact and multi- 
lingualism emerged which involved Classical Latin, late Vulgar Latin and Proto- 
Romance varieties, Hebrew (used as a ritual language by the Iberian Jews) and 
Arabic, leading to a complex constellation of medial and conceptual diglossias. 
The (Proto-)Romance vernaculars used by autochthonous Iberians, Ibero-Goths 
and Jews in the Central and Southern parts of the Peninsula continued to evolve 
under Arab rule; in the 19th century the rather misleading term Mozarabic was 
coined as a cover term for these varieties. With the advances of the Reconquista, 
the Romance dialects of the mountainous Northern fringe of the Peninsula, from 
where the Christian reconquest started, and the subsequent distribution and settle- 
ment of speakers of these varieties, ‘Mozarabic’ was gradually phased out. At the 
same time the Northern dialects evolved into the Romance languages with their 
elongated North-South-stretching territories characteristic for the linguistic map of 
the Iberian Peninsula up to Modern Times, followed by the lateral expansion of 
Castilian (mainly to the detriment of Asturian-Leonese and Aragonese). Interest- 
ingly, among the earliest textual testimonies of Ibero-Romance (as opposed to 
Latin) are some of the so-called kharjas, final verses of muwashshah poems other- 
wise written in Arabic or Hebrew (and in Arabic script). They show salient forms 
of lexical and structural code-mixing, as illustrated by the following transliterated 
example from the 11th century (García Gómez 1965: 320; Romance elements are 
italicized): 


ifen, "indi, habibi! come beside me, my friend! 


seyas sabitore: you should know 
tu huyda samaya your flight is a bad thing 
jim8i, adunu-ni! come on, come together with me! 


Both the copulaless predicative construction in the third line and the imperative 
form based on the Romance verbal root adunar in the last line correspond to 
the morphosyntax of Arabic. Obviously, literary texts like the kharjas cannot 
be considered representations of actual oral practice in Muslim-ruled Iberia. But 
they may well be read as an indication of intense language contact at the time, the 
precise nature of which unfortunately remains unknown and insufficiently docu- 
mented. 

Due to the Reconquista, the (Proto-)Romance dialects of the Northern Iberian 
Peninsula extended their territories of use far southwards, while simultaneously 
forging themselves an identity as autonomous Romance languages different from 
Latin. Basque, by contrast, did not profit from these expansive moves territorially — 
despite the significant presence of Basque speakers in the Castilian troops. This is 
probably indicative of an already high degree of bilingualism within the Basque 
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community at this early medieval stage. Another important facet of post-Recon- 
quista migration and linguistic contact is the increase of pilgrimages to the tomb of 
St James (Santiago) in Galicia in the Iberian Northwest, which developed from the 
9th century and attracted pilgrims from across Europe from the 11th century on- 
wards. Along the main routes on the Northern peninsula, many Gallo-Romance- 
speaking clergymen, traders and craftsmen settled and constituted the — frequently 
economically privileged — social group of Francos. Among these, Occitan (langue 
d'oc) and French (langue d'oil) varieties were current both as mediums of oral and 
written communication (Martínez González 2006). Furthermore, Occitan as a lead- 
ing high variety for literary (especially lyrical) use during the 12th and 13th cen- 
turies had a significant impact on the Romance varieties on the Eastern edge of the 
Peninsula (now forming the Catalan diasystem). Significantly, the first known 
‘grammar’ of a Romance language, the Razós de trobar written by the Catalan poet 
Ramon Vidal de Besalü in the early 13th century, is dedicated to a description of 
(literary trobadoresque) Occitan, and this language remained influential in the 
Catalan-speaking zones until at least the 15th century. 


3. Modern and contemporary language contact in Southwestern Europe 


Today, apart from the small territories of the Principality of Andorra and the British 
colony of Gibraltar (see section 3.2), the Southwest of Europe is divided between 
the three nation-states of France, Spain and Portugal. Linguistically speaking, 
these three countries have in common that their dominant national languages are 
considered as world languages on the basis of their geographical distribution out- 
side of Europe and their high numbers of L1 and L2 speakers. This gives consider- 
able prestige to these languages, a fact that has an obvious influence on the lan- 
guage contact situations and linguistic attitudes in these countries. At the same 
time, all three states have been important colonial powers (in the case of France 
and Portugal until the mid/late 20th century), a fact that has an impact on current 
immigration patterns in these countries. 

On the other hand, France, Spain and Portugal differ significantly as far as lin- 
guistic plurality and language policy are concerned: France has traditionally been 
considered a politically and linguistically centralized state with low esteem for dia- 
lects or ‘regional’ languages and little interest in the promotion of linguistic diver- 
sity, emphasizing the unifying role of but one linguistic variety, namely Standard 
(or near-Standard) French. This attitude was given legal expression in the 1992 ad- 
dition to the French constitution stating that the language of the French Republic is 
French, a clause preventing the ratification of the European Charter for Regional 
and Minority Languages by France (cf. Ager 2008). Along with its proclaimed 
national monolingualism, the French government has pursued a rather ambiguous 
and contradictory language policy during the last decades as, to a certain extent, 
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linguistic plurality has been acknowledged, after all, through measures in the realm 
of education and cultural affairs. Portugal, too, has claimed national monolingual- 
ism for a long time, but contrary to France, where this linguistic homogeneity has 
always been ideologically motivated and ‘constructed’, it corresponds largely to 
the linguistic reality of Portugal. Dialectal divergences are moderate in European 
Portuguese — as in most Ibero-Romance varieties outside the northern fringe of the 
Peninsula — and Mirandese, an Astur-Leonese dialect spoken by some 7,000 
people in the Northeast of the country, is the only territorialized non-Portuguese 
language in Portugal and legally recognized as such since 1998. Post-Franquist 
Spain, on the other hand, explicitly recognizes its linguistic diversity in the 1978 
constitution which, in art. 3.2, declares “the other languages of Spain” (i.e. Basque, 
Catalan, and Galician) as co-official in the corresponding autonomous regions 
alongside with Castilian/Spanish, the official and obligatory language of the whole 
state. However, linguistic plurality is far from being an uncontroversial issue in 
contemporary Spain. According to the Spanish constitution, official and social 
multilingualism is restricted to territorially (and, to a certain extent, ethnically) cir- 
cumscribed areas. In contrast to federally organized states such as Belgium or 
Switzerland, language contact situations between autochthonous languages and 
the state language are therefore limited to these (officially or non-officially) bilin- 
gual regions. Vertical language contact is found in the entire country between the 
autochthonous language(s) and non-autochthonous dialects/languages of migrants, 
but again these processes are particularly salient and problematic (and best studied) 
in the bilingual regions. In Southwestern France and in Portugal, the situation is 
less intricate; vertical and horizontal language contact phenomena (the latter being 
almost nonexistent in Portugal) evolve independently from each other. 


3.1. Language contact between autochthonous languages 


Contrary to what happened in the central and southern parts of the peninsula due to 
the effects of Christian reconquest and resettlement, the autochthonous languages 
in Southwestern France and Northern Spain have evolved continuously and there- 
fore developed as regular dialectally diversified continua. Dialectalization namely 
of the Asturian varieties of Astur-Leonese, Basque, Aragonese and Occitan was 
fostered by the absence of a standardized written form of these languages. Gen- 
erally accepted standard varieties currently exist for Catalan (on the basis of Pom- 
peu Fabra’s early 20th century codification) and Basque (in the form of Euskara 
batua, introduced in the 1960s). The conflicts on the written forms of Galician and 
Occitan — at least in the areas focused on here — were settled in favor of the codifi- 
cations of the Royal Galician Academy and of Louis Alibert, respectively. How- 
ever, the weak institutional support for Standard Occitan and for a coherent lan- 
guage policy has allowed the situation to deteriorate towards advanced language 
decay in many Occitan-speaking areas of Southwestern France. Traditional Occitan 
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dialects have been replaced by Occitan varieties heavily influenced by Standard or 
near Standard French (these are known as francitan) or by tertiary dialects of 
French characterized by Occitan substratal features (Known as Regional French). 
Contact phenomena in French-influenced hybrid Occitan varieties are found in the 
lexicon, of course, but also on the phonetic level (i.e. substitution of apical [r] by 
uvular [R]) as well as on the morphological and morphosyntactic levels (e.g. the 
decay of simple past forms and the rise of periphrastic present perfect forms anal- 
ogous to the French passé composé). Peculiar features of the Gascony Occitan var- 
ieties are generally well conserved since they constitute emblematic characteristics 
of these varieties (e.g. the preverbal evidential-like *enunciative' particles or initial 
/h-/, which originates from Latin /r-/). A special process of dialect leveling is tak- 
ing place in the Gascony Occitan variety of the Aran valley in Northwestern Cata- 
lonia. This variety is exposed to influence from Catalan, Aragonese and, in recent 
periods, Spanish. The joined effects of officialization of the dialect on the basis 
of a slightly differential codification, increased mobility and a different perception 
of the political frontier with France seem to strengthen linguistic individualization 
of Aranese Occitan despite its declared affiliation with Occitan in France. Another 
situation of convergence has arisen in the French part of the Basque Country: 
traditionally, language contact took place between the Iparralde dialects of Basque 
and the Gascon dialects of Occitan. In modern times, borrowed features from 
Occitan have facilitated transfers from the new prestige language French (cf. 
Haase 1992), and the horizontal dynamics of language contact between geographi- 
cally adjacent languages have been converted into vertical language contact be- 
tween traditional regional tongues and the dominant national language. However, 
such a view inevitably implies much simplification, since today French in South- 
western France is also a deeply rooted language, which is neither any longer con- 
sidered as non-autochthonous nor as ‘imposed’ by the vast majority of speakers, 
not even in rural areas, where French constitutes the primary means of communi- 
cation. 

The situation is different in the bilingual regions south of the Pyrenees and, 
more specifically, in the regions known as ‘historical autonomous communities’, 
i.e. the Basque Country (Euskadi), Galicia and Catalonia. Their rural areas are 
characterized by a generally high degree of dominance of the regional languages 
in informal conversation and partly also in formal contexts, with the national lan- 
guage Spanish frequently being perceived as 'imported' and alienating, despite a 
generally sufficient mastery of it. Zones of intense language contact are therefore 
mainly found in the border areas of the language territories, on the one hand, and in 
the urban and semi-urban areas, on the other hand, where regional language and 
national language clash most saliently. A dialectologically emblematic example of 
the former is the contact zone between Catalan, traditional Aragonese and the ter- 
tiary dialects of Aragonese Spanish (which tend to replace the primary Aragonese 
dialects in the modern age) in Western Catalonia and Eastern Aragón. For Catalan 
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speakers in Aragón, the position at this linguistic crossroad leads to conflicts of 
identity (cf. Fort-Cafiellas 2007). These, in turn, contribute to phenomena of lin- 
guistic hybridization comparable to francitan, whereas such hybrid varieties do not 
exist in other parts of the Catalan-speaking area. A situation comparable to that of 
Eastern Aragón is found at the border zone between Galician and Asturian dialects 
in the western Principality of Asturias. In the heartlands of the historical auton- 
omous communities, however, considerable efforts have been made in both status 
and corpus planning. As a result, the regional languages have been consolidated 
and the former patterns of functional and medial diglossia have been modified or, 
to a certain extent, even reversed, especially in the case of Catalan in Catalonia. In 
all officially bilingual autonomous regions there is an increasing number of people 
who declare to master the basic skills of understanding, speaking, reading and writ- 
ing the regional language involved (i.e. Catalan in Catalonia, the Valencian Region 
and the Balearic Islands; Basque in Euskadi and Northern Navarra/Nafarroa, and 
Galician in Galicia). Recent sociolinguistic research has shown, though, that such 
figures tell us only part of the truth: while competence in using the regional lan- 
guages in written form has indeed gone up remarkably, actual (oral) use of these 
languages does not follow at the same pace or even stagnates (cf. Casesnoves 
Ferrer, Sankoff and Turrell 2006 and Vila 1 Moreno 2008 for data on Catalan; 
Aizpurua Telleria and Aizpurua Espin 2005 on Basque, Monteagudo and Bouzada 
Fernández 2003, on Galician). Generally, and again most prominently in urban 
areas, the numbers of speakers who declare themselves to be bilingual have 
increased. The interpretation of proclaimed bilingualism differs according to the 
region: whereas in the Basque-speaking area it is considered as a positive develop- 
ment by language planners and activists, in Catalan-speaking regions it is often 
seen more critically as a symptom of declining loyalty towards the regional lan- 
guage. 

The bilingual historical autonomous communities in Spain present an array of 
classical contact phenomena in the vernaculars of these areas. Apart from the no- 
toriously frequent borrowings on the lexical level (e.g. with regard to syntactically 
loosely integrated elements such as discourse markers), transfers and calques exist 
on all other levels of linguistic structure, too. Hitherto, such contact phenomena 
have mainly been studied from the perspective of the regional language, but the 
perspective taken on these phenomena has widened in recent years, probably as a 
side-effect of the sociolinguistic consolidation of these languages. So now there is 
an increased interest in studying contact phenomena, language dynamics and 
underlying attitudes from the perspective of the national language Spanish, too 
(cf. the thematic issues 184 [2007] and 193/194 [2008] of the International Jour- 
nal of the Sociology of Language, or Sinner and Wesch 2008 with a focus on the 
Catalan-speaking regions). In accordance with the sociolinguistic finding that the 
relations of diglossic super- and subordination can no longer be described as easily 
as they were in pre-democratic Spain, contact-induced features also seem to evolve 
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more strongly in both directions. For instance, the loss of the distinction between 
voiced and unvoiced /s/ — a phonemic distinction in Catalan but not in Castilian — 
has long been described in terms of Spanish influence on Catalan. But the reverse 
process of introducing voiced /s/ — albeit, for the time being, as a mere allophonic 
variant — is now also attested and described as a Catalan influence on Catalonian 
Spanish. Such insights lead to a new perception of the contact varieties of Spanish 
with autochthonous languages, in that Spanish spoken with a Catalan, Basque 
or Galician ‘accent’ is no longer denigrated as deviant or "bad" Spanish, but as 
full-fledged tertiary Spanish dialects or instances of Regional Spanish. 


3.2. Language contact involving dialects 


The term dialect is notoriously difficult to apply, especially in opposition to lan- 
guage, and this holds for France and Spain in a particularly acute way. Just take As- 
turian and Aragonese (see e.g. Kabatek 2006), for which language status is claimed 
by local activists and some linguistic scholars, while other scholars advocate dia- 
lect status for them (in the sense of: dialects of Spanish). Among the uncontrover- 
sial European Spanish dialectal varieties, the case of Andalusian as the most dis- 
tinctive dialect has attracted special attention (cf. Narbona 2009). A very peculiar 
contact situation, bringing together Andalusian and English, is found in the British 
colony of Gibraltar, which had been occupied by British troops in 1704 and 
integrated into the United Kingdom in 1713. Despite British presence (mainly 
military) for more than 300 years, Andalusian Spanish has remained the dominant 
oral language on the “Rock”, despite the existence of a structurally still Spanish- 
dominant contact variety called Yanito (or Llanito; cf. Levey 2008). This variety is 
characterized mainly by lexical loans from English but also from Caló (the Spanish 
Gipsy language), Italian and other languages of the Western Mediterranean. Thus, 
English terms for realia unknown in or uncommon for Spanish culture up to the 
mid-20th century have been adapted phonologically, yielding loanwords such as 
chinga (« chewing gum) or arishu (« Irish stew) (Levey 2008: 5). Language use 
and language attitudes in Gibraltar are as controversial in research as is the whole 
Gibraltar issue in politics. Many Spanish scholars emphasize the linguistic loyalty 
of Gibraltarians towards their Romance language due to consciousness of their 
Hispanic roots (Martínez González 2006: 1875). British and other non-Spanish 
authors, on the other hand, claim that the use of Spanish is dwindling in favor of 
higher competence and esteem for and use of (British) English as an expression of 
non-Spanish identity and pro-British feelings. The latter have been strengthened 
significantly since the vociferous claims for a return of Gibraltar to Spain by Fran- 
co's government in the 1960's and the Spanish blockade of the terrestrial access to 
the *Rock" between 1969 and 1982. 


402 Claus Pusch and Johannes Kabatek 


3:3. The importance of urban areas and neo-speakers 
for current language dynamics 


It is common sociolinguistic knowledge that urban areas constitute poles of attrac- 
tion and, at the same time, centers from which linguistic innovations and ‘fashions’ 
irradiate, and that cities and conurbations serve as ethnic and cultural foci and 
melting-pots. Traditional language varieties are assumed to be much better pre- 
served in the linguistically conservative rural areas, whereas the urban centers are 
generally associated with dialect leveling and loss of traditional ways of speaking. 
However, language preservation movements and language standardization ini- 
tiatives often originate in urban societies among intellectual elites frequently 
linked to cultural and academic institutions. 

A recurrent figure in urban linguistic activists’ movements is the neo-speaker. 
This is a person whose first language has not been the regional variety but the 
dominant national tongue and who has grown up, and frequently continues to live, 
in a social surrounding where the dominant language is the current means of ex- 
pression. At a certain point in her or his life, this person has decided, however, to 
use the regional variety or language regularly and as consistently as possible for 
daily communicative purposes. Neo-speakers are particularly numerous in the 
Basque-speaking areas, where they are known as euskaldun berriak and tend to 
outnumber native (L1) speakers of the language (Aizpurua Telleria and Aizpurua 
Espin 2005: 51—52), but they are also common in Galicia, in Occitan-speaking 
France, and other areas, too. These neo-speakers, due to their urban background 
and educational qualifications, are frequently active in professional areas that con- 
tribute to language diffusion, such as the media or education (cf. Kabatek 1996 for 
Galician cases). When expressing themselves in their language of deliberate 
choice, they tend to apply the ordinary strategies of L1-to-L2 transfer. Yet at the 
same time they tend to exhibit strong — excessively strong, as some would claim — 
attitudes of linguistic purism, which manifests itself in the over-representation of 
emblematic and often highly conservative features in their speech and in the avoid- 
ance of linguistic elements that could be interpreted as a loan or an interference 
from the dominant language. L1 speakers of the regional variety or language, 
especially elderly people, are usually rather critical concerning the neo-speakers’ 
linguistic output. At the same time, as the situation of Basque suggests, these 
speakers and their innovative and at the same time conservative way of speaking 
will most certainly be decisive for the future of many regional languages, not only 
in the geographic area presented in this contribution. 


3.4. The role of education and media 


The educational sector is known to be a key area of intervention in policies aiming 
at reversing language shift, and it has also become a primary arena for language 
contact. Together with the media, schools are the platforms for diffusion of stan- 
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dard varieties and, as such, contribute significantly to linguistic convergence and 
integration. The ways in which linguistic diversity is handled in the school systems 
differ significantly in the area under scrutiny here. In France, the almost exclusive 
language of teaching is Standard French but regional languages can be taught on a 
voluntary basis since the Deixonne law of 1951. In some regions, including South- 
western France, bilingual programs are now available in public secondary schools, 
as well as (theoretically) monolingual teaching in the regional languages in private 
primary schools (so-called calandretas in the Occitan-, iskastolak in the Basque- 
and bressola schools in the French Catalan-speaking areas). In Spain, Catalan is 
the dominant school language in Catalonia (cf. Mufioz 2005; Vila i Moreno 2008) 
and the Balearic Islands, where immersion methods are applied, while in the Val- 
encian Country predominantly Spanish- or Catalan-taught programs exist side by 
side, with the teaching of Valencian Catalan remaining compulsory though. In Eus- 
kadi a more diversified system with different degrees of Basque presence exists, 
the main tendency nowadays being the preference for the dominantly Basque op- 
tion (cf. Cenoz 2009). In Galicia, Galician is dominant even though Spanish is 
strongly present. As mentioned before, these educational policies have improved 
the basic language skills in the regional languages, generally without reducing the 
students' competence in the national language. However, this heavy presence of 
the regional language(s) as object and medium of teaching and as regular medium 
of communicative exchange within the school has not led to an equal presence of 
these languages in the schoolyard and outside, neither in areas with strong social 
support for this educational policy (like Catalonia or Euskadi) nor in zones with 
weak social backing (such as the Valencian Country or Southwestern France). 
Many children for whom the regional language is not the L1 and/or the ordinary 
means of expression in daily life (and apart from Catalonia and Galicia, these 
children tend to constitute the majority in the classrooms) perceive this language as 
"the language of school", as opposed to the language in usage for “life outside 
school", which for many of them is the national language. This does not imply 
a negative attitude towards the regional language; on the contrary, in the eyes of 
the schoolchildren this language is generally the more prestigious one. Recent 
research in Catalonia (Woolard 2009) has shown that secondary school students in 
the Barcelona area consider Catalan the more cultivated and ‘elegant’ language, 
whereas Castilian is qualified as ‘coarse’ and vulgar, and this independently from 
the L1 background of the students. This means that students know register vari- 
ation — including casual speech and hybrid, contact-influenced forms of speaking — 
for Spanish only, but that no such diastratic or diaphasic variation is perceived for 
Catalan. This partly explains the reluctance to use Catalan in ordinary communi- 
cative contexts. Furthermore, the normative impact of the standard variety, which 
is dominant in the educational system, may also have a surprising effect in that 
children with an L1 background in the regional language experience increasing 
linguistic insecurity with greater knowledge of the norm, while students whose L1 
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is the national language do not feel insecure in the same way when expressing 
themselves in the regional idiom (cf. Baldaquí 2009). 

Another important area for language planning and diffusion of standard var- 
ieties is the media, most prominently the electronic media, which is considered 
to contribute strongly to processes of linguistic convergence. Regional languages 
have a merely anecdotal presence in the media in France, while their role in the 
Spanish media landscape is more prominent. Broadcasting companies in the auton- 
omous communities in Spain are generally extremely aware of the issue of linguistic 
diversity and language contact phenomena, and try to keep variants apart as neatly 
as possible with sometimes remarkable expenditures on monitoring devices (cf. 
the contributions in Kabatek and Pusch 2009). This applies both to the contrasts 
between regional and national language and between dialectal varieties of the re- 
gional tongues. 


3.5. The impact of migration and globalization on language contact 


Internal (national) migration has had important effects on the linguistic composi- 
tion of the bilingual regions on the periphery of the Spanish state during many dec- 
ades of the 20th century, generally strengthening the presence of the national lan- 
guage. This holds particularly for the industrial centers in Catalonia, Euskadi 
and, to a somewhat lower degree, Asturias. Galicia, as well as many other regions 
in Spain and Portugal, and Southwestern France have been zones of (internal or 
external) emigration rather than immigration. This has changed drastically with the 
development of mass tourism from the 1960s onwards and with the so-called 
‘new’, globalization-driven migration initiated at the end of last century. These 
waves of migrants are obviously very diverse concerning origin, socio-economic 
background, and motivation, but they converge in creating even more complex 
scenarios of functional language distribution and language contact. 

The new migrant minorities in Spain and Portugal consist, roughly speaking, of 
groups from Latin America, Western and Northern (Maghrebian) Africa, and East- 
ern European and Balkan countries. Within these groups, migrants from Ecuador 
and Colombia, Morocco, and Romania are demographically dominant in Spain, 
whereas in Portugal the most prominent countries of origin are Cape Verde and 
other former African colonies, Ukraine and — although with some distance — Bra- 
zil. In France, by far the most immigrants are of Maghrebian/African origin; Latin 
American immigration is insignificant and Asian immigration is more important 
than on the Iberian Peninsula. The preferred destination of immigrants to Portugal 
is the Lisbon capital region, whereas in Spain immigrants cluster in Madrid and 
along the Mediterranean coast, with a particularly high concentration in the Cata- 
lan-speaking regions. Many, if not most Africans migrating to France and Portugal, 
as well as Latin Americans coming to Portugal and Spain, know and master the 
national language of their host country either on L1 or on comparatively high com- 
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petence levels, a fact that despite undeniable dialectal divergences facilitates com- 
munication and integration. There is a risk that matters will evolve less smoothly 
in the bilingual regions, especially when immigrants ignore the multilingual reality 
of these areas upon their arrival. This is why Catalonia, for example, tries to give 
information on this issue in advance through information campaigns in the most 
important countries of origin of the immigrants, an effort that is frequently in vain 
due to the proportion of clandestine immigration. The picture of language attitudes 
and linguistic behavior of Spanish-speaking Latin Americans who settle in the bi- 
lingual autonomous communities in Spain remains unclear for the time being, and 
results from research on this issue are sometimes contradictory. However, as far as 
Catalonia is concerned, it seems that the stereotype of the Spanish-speaking immi- 
grant who is reluctant or totally unwilling to adapt to and communicate in Catalan 
is not (or no longer) valid. Latin Americans, as well as the other immigrants con- 
sidered here, come to Spain in search for better economic conditions. Thus they 
all realize that knowledge of the regional language enhances the opportunities to 
achieve this goal, which may in turn justify the additional effort of getting ac- 
quainted with it. At least among Latin American students and youngsters, attitudes 
towards institutional and social bilingualism and towards Catalan seem to be posi- 
tive (cf. Huguet and Janes 2008). For immigrants with other linguistic back- 
grounds, especially those speaking primarily African languages and French, learn- 
ing Spanish or Catalan represents an equally huge extra effort, so that these persons 
are less biased concerning the initial language of integration into the host society. 
Depending on age, place of residence, and the professional situation, these persons 
start to integrate via Catalan, Spanish or both at the same time (cf. Juarros-Daussà 
and Lanz 2009). Not surprisingly, many school teachers in the bilingual regions 
have a particular empathy towards language diversity and welcome the presence 
of additional languages in the classroom as an enrichment (Palou and Fons 2009). 
If such attitudes, in turn, are communicated to immigrant children who (also) speak 
rather ‘exotic’ (regional, tribal or indigenous) languages from their country of ori- 
gin, a feeling of pride can arise which leads to beneficial effects on their integration 
into the bilingual host society and a full acceptance of the regional language as a 
means of normal daily expression. Empirical findings show, however, that since 
these students adapt gradually to the linguistic behavior of their autochthonous 
classmates, an increase in the use of Spanish takes place by the time they reach 
adolescence. 

The situation is different for immigrants from Central and Northern Europe, 
who migrate to the southwest of the continent either as (mostly young) profes- 
sionals or as retirees. These waves of migrants are particularly strong in the case 
of Mediterranean Spain (where residents of British, German and French origin pre- 
dominate) and Southern Portugal (with Britons, Germans, and Dutch residents), 
while (as yet) being of marginal importance in Southwestern France. Generally 
speaking, the ethnic and socio-economic background of these migrants gives them 
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more freedom to decide if, and to what extent, they wish to integrate linguistically 
into the receiving society. These migrant groups frequently organize in relatively 
hermetic, ghetto-like communities with their own service and communication net- 
works, and the above-mentioned professionals usually find their customers among 
(mainly the elderly of) their compatriots. For professional activity in a foreign 
country, a basic knowledge of (one of) this country's language(s) would seem 
unavoidable, but the increasing availability of English as a lingua franca in 
communicative exchanges with the autochthonous population must be also taken 
into account. As far as 'vacational' and retirees' immigration is concerned, 
these migrants are generally perceived as indifferent towards language questions 
and linguistic integration, and as hostile to any political measures implying the 
obligatory use of regional languages. But actually, little research has been carried 
out in this field so far and the reality might well turn out to be quite different than 
this general perception suggests (for some first case studies in Spain, cf. Turrell 
2001). 


See also the following chapters in this volume: 10 by Haase, 24 by Troester-Mutz, 
25 by Extra, 29 by Darquennes, 38 by Ramat and 44 by Wright. 
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1. Introduction 


Our knowledge of language contact in South-Eastern Europe (SEE) in the pre- 
modern era is fragmentary and to some extent based on speculation, but it was 
undoubtedly widespread and intense and led to the rise of the so-called “Balkan 
Linguistic League” (Balkansprachbund). In Macedonia, for instance, in the pre- 
national period adult men typically spoke a number of different languages or dia- 
lects. In SEE as elsewhere in the world, dialects of the same language can influence 
one another: peasants from different villages meet at the market, on the way to the 
market and at fairs and at festivals and this sometimes leads to the creation of 
interdialects. In SEE under Ottoman rule, Turkish was the language of the towns, 
especially in commerce and administration, as well as that of some villages. Turkish 
retained its prestige among urban dwellers well into the 20th century and even up to 
today in Macedonia and Kosovo (Friedman 2002). In the Bulgarian-speaking ter- 
ritories there was contact between Bulgarian as the indigenous language, Greek as 
the language of trade and culture and the language of instruction in secular schools 
(Goebl etal. 1997: 1489; Hinrichs 1999: 812) and Turkish as the language of the Ot- 
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toman administration. Greek was also the language of diplomacy in the early period 
of the Ottoman administration (Hinrichs and Büttner 2000: 233). Turkish colonists 
in different parts of the Balkans would have communicated with their indigenous 
fellow peasants partly in Turkish and partly in the indigenous dialects. In Albania, 
Greek was the liturgical language for the Albanian Orthodox and the language of 
the Greek or Hellenized commercial upper class of society. Greek was also the lan- 
guage of instruction for Orthodox Albanian children (Goebl et al. 1997: 1455). 

There was also intensive language contact where language territories met, e.g. 
Srb brindza ‘cheese’ in the Banat < Rum brînză (Solta 1980: 156-157; Popovic 
1960: 572, 590—591). Apart from lexical borrowings, Vidoeski (1993) notes a ten- 
dency for the phonemic systems of Macedonian, Albanian and Greek dialects in 
border areas to converge. West-Roumelian (i.e. western-Balkan) Turkish dialects 
as spoken by Islamicized autochthonous populations display changes due to this 
language contact on all levels — phonological, morphological, syntactic and lexical 
(Goebl et al. 1997: 1448). 

Northern-Macedonian dialects were heavily influenced by Serbian when 
Macedonia was part of the mediaeval Serbian empire (as evidenced by, e.g., kukja 
‘house’ or sudija ‘judge’, rather than kašča, sadija, which we would expect ac- 
cording to East-South-Slavonic historical phonology). 

The different languages of SEE were also influenced by non-SEE languages: 
Slovenian and Croatian were heavily influenced by Austrian German, Bulgarian 
by Russian. In the 19th century most SEE languages were influenced by French. 
The different languages of SEE were codified in the 19th and 20th centuries, 
usually following a purist model (the exception was Vuk KaradZi¢’s codification of 
Serbian), but this often meant replacing old borrowings with new ones considered 
to be more acceptable, e.g., in Bulgarian Turkish borrowings were replaced with 
Russian (or Church-Slavonic) ones (cf. Damerau 1960: 21-23), in Rumanian Tur- 
kish and Slavonic borrowings with French, Italian or Latin lexemes (bursd, elev, 
jurnal, amical, deposeda, comerciant — Goebl et al. 1997: 1467-1468). Slovenian 
and Croatian often substituted Czech or Russian borrowings for German ones (e.g. 
časopis ‘magazine’ or naslov ‘title’ < Czech or točka ‘point; full stop’ < Russian). 
Many activists of the "national revival" in various SEE countries and others spent 
their formative years in a Russian-speaking environment such as Odessa or St Pe- 
tersburg and introduced many Russian loans into their own speech. On the other 
hand, German-speaking universities attracted many students from SEE in the 
19th and early 20th centuries. All the languages of SEE have adopted so-called 
"internationalisms", e.g. Rum /aboratoriu, infecție, vertebral, concret, Cro and Srb 
abolicija, antika, indikator, individualizam (Hinrichs 1999: 547). In a field like 
philosophy we have borrowings of Latin or Greek terms, e.g. via French, as well as 
calques e.g. from German. 

With the triumph of the nation state on the French model and the disintegration 
first of the multilingual empires and then of multinational states such as Yugosla- 
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via, there is less of the traditional language contact. (On the latest developments cf. 
Hinrichs and Büttner 2000: 69-82.) Nonetheless, there are still linguistic minor- 
ities in all states in SEE, e.g. in Rumania 2 million Hungarians and 100,000 Ger- 
mans (Goebl et al. 1997: 1463; 1473). Thus, one-sided language contact has 
replaced the traditional multilateral language contact. This one-sided language 
contact normally leads to massive loans from the majority into the minority lan- 
guage, as, for example, in the Macedonian dialects in Greece, which use Greek lex- 
emes to denote anything that goes beyond the realm of folklore and agriculture 
(Hill 1991: 168). Today, on the other hand, there is throughout SEE intensive “in- 
termediary borrowing" (Filipović 1988: 347) from English, which results in a huge 
number of (mostly unassimilated) loans (cf. 4.2 below). 

This article will concentrate on loans as evidence of language contact. Lack of 
space precludes citing all sources in detail. 


2. The history of language contact in SEE 


2.1. The Roman conquest 


The Roman conquest began when the Romans expelled the Illyrians from the Ad- 
riatic coast in 229 BC and continued for some 300 years until Dacia was established 
as a Roman province in 106 AD. Roman roads and commerce as well as schools fa- 
cilitated Romanization. Romanization was intense along the Via Egnatia from Dyr- 
rachium (Durrés) via Thessalonica to Byzantium (Constantinople), where veterans 
were settled in 42-35 BC. The administrators would have used classical Latin, but 
the soldiers, colonists and merchants spoke sermo vulgaris/plebeius/rusticus, so- 
called “Vulgar” Latin (cf. Desnickaja 1990: 125; Hinrichs 1999: 545—584). In con- 
tact with Latin, the prestige language, no doubt after a period of bilingualism, the 
Old Balkan languages (Illyrian, Thracian, Dacian, Brygian, Paeonian, Macedonian 
[not to be confused with contemporary Macedonian, which is a Slavonic lan- 
guage]) disappeared, except for Albanian, which continues either Illyrian or Thra- 
cian, although it was heavily influenced by Latin (Hinrichs 1999: 277-299). On the 
old Balkan languages cf. Solta (1980: 11—39). Otherwise, the speakers of these Old 
Balkan languages shifted to Latin, spawning new languages such as Rumanian, 
Arumanian (Macedo-Rumanian, Vlach), Megleno-Rumanian and Istro-Rumanian 
(Solta 1980: 64-108; Hinrichs 1999: 149-150; 551—552). The many elements that 
are common to Albanian and Rumanian indicate a period of mutual contact at 
this time. Another Romance language, Dalmatian, was spoken along the western 
(Adriatic) coast of the Balkan Peninsula and on the islands from the early middle 
ages, but over the centuries its speakers shifted to Croatian (on Dalmatian cf. Solta 
1980: 142-154; Muljačić 2000). Dalmatian words in Croatian and Serbian include 
sapun ‘soap’ « saponem, racun ‘bill, account’ « rationem and banja ‘bath’ « balneae 
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(Solta 1980: 155). The Croatian coastal and island dialectal govorin (1st p. sg. « go- 
voriti ‘to speak’) or osan ‘eight’ is attributed to language contact with Dalmatian 
(Solta 1980: 155—156) or Venetian (cf. 2.7 below). 

The many supposedly substratum words in Rumanian include baci 'cheese- 
maker; dairyman; herdsman’ (Arum baciu, Megleno-Rum bats, Alb bag), cătun 
‘village’ (Alb katund) and minz ‘young animal, foal’ (Arum mändzu, Megleno- 
Rum mondz, Alb més — Solta 1980: 39; 46-47; 86; Popovic 1960: 199). The oldest 
Slavonic loanwords were apparently adopted through the intermediary of the Da- 
cians, e.g. baltd ‘marsh, bog’ from pre-9th-century Proto-Slavonic *bolto, or rind 
reflecting the nasal vowel in Old Bulgarian red». More recent borrowings reflect 
contacts with Serbian speakers in the Banat, with speakers of Hungarian and Ger- 
man in Transylvania, with the East Slavs in Moldova, Maramures and north-east- 
ern Transylvania, and with Turkish speakers in Muntenia and Moldova (Des- 
nickaja 1990: 167). The basic Christian terminology in Rumanian is of Latin origin 
or was transmitted via Latin from Greek: bisericd ‘church’ « basilica < Grk 
vasiliki, a boteza ‘to baptize’ « baptizdre < Grk baptizein, credintd ‘creed’, crestin 
‘Christian’, cruce ‘cross’, inger « angelus, pdcat « peccatum etc. (Solta 1980: 90, 
Hinrichs 1999: 545—546). Later Old Church Slavonic served as a mediator for 
words from Greek (aghiasmd ‘holy water’, dascăl *deacon') but there are also 
genuine Slavonic words such as sfint ‘saint’ and duh ‘spirit’ (Solta 1980: 90). By 
the 12th century we see such Graeco-Slavonic loans from many fields, and by the 
16th century we note the formation of court terminology under Slavonic influence 
(Goebl et al. 1997: 1460; Hinrichs 1999: 261-276). 

Whether Albanian is descended from Illyrian or Thracian is a vexed question 
due to the paucity of our knowledge of these two languages (on the Albanian 
ethnogenesis cf. Solta 1980: 109-123; Hinrichs 1999: 281—283), but loanwords 
can give us hints about the original homeland of the Albanian language. There 
are some 30 ancient-Greek loanwords in Albanian, of which some relate to 
north-western Doric Greek, e.g. mokér, mokén ‘millstone, grindstone’ « machaná 
(Ionian-Attic mechane), Tosk drapër, Geg drapén ‘sickle’ < * drápanon (Ionian- 
Attic drépanon), mollé ‘apple’ « málon (Ionian-Attic melon). There were ancient 
Doric colonies on the Albanian coast, such as Lissos (Lezha) and Epidamnos (near 
Durrës [Aveedy.ov]), and there were also settlements further inland — Amantia, 
Buthroton, Byllis, Fenike. It is natural to assume that these Doric items were bor- 
rowed into Albanian (that is, Illyrian or Thracian) in contact with the Doric Greeks 
living in these colonies. 

Latin was the official language of the Byzantine Empire until 7th century and 
remained the language of justice and the military even after that. Some 3000 Latin 
terms penetrated Byzantine Greek, of which 207 are still alive in Modern Greek 
(Desnickaja 1990: 129; Hinrichs 1999: 150-151; 565—568). Solta (1980: 164) 
mentions kitron ‘citron’ < citrus, karina ‘keel’ < carina, skúpa ‘broom’ < scopa 
and makelláris *butcher' « macellarius. The Crusades brought renewed Latin 
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influence (cf. Hinrichs 1999: 834—835). Many Latin words in the Balkan languages 
were borrowed via Greek. 


2.2. The arrival of the Slavs 


At the beginning of the 7th century Slavonic tribes crossed the Carpathians and the 
Sudetan passes into SEE, reaching the Adriatic around 700 (Desnickaja 1990: 133; 
Hinrichs 1999: 568). In the 7th and 8th centuries the Slavs settled throughout 
Greece, but most of the area was re-Hellenized from 9th century. Slavs settled in 
central and south-eastern Albania from the 6th and 7th centuries, but most of this 
area was Albanianized from 16th century onwards (Desnickaja 1990: 52). 

The Slavs encountered Latin and Greek speakers and — as we can assume — a 
partly Romanized form of Illyrian or Thracian, the ancestor of Albanian. The Slavs 
adopted many elements of the superior Roman civilization, as evidenced in the 
terminology relating to housing that was borrowed from Latin, such as Srb Cro 
Blg komin ‘chimney, hearth’ < caminus, or gastronomic terms such as Blg ocet 
‘vinegar’ < Lat acetum. 

There has been constant language contact between Slavs and Albanians from 
the time when the Slavs arrived in SEE up to the present day, which has led to a 
considerable number of Slavonic loanwords in Albanian, the oldest being from 
Proto-Slavonic, e.g. druvar *woodcutter' < *drovarb (Blg dial drbvár, Srb drvar), 
balté ‘mud’ « *bolto ‘swamp’. 


23: The arrival of the Proto-Bulgarians 


In the 7th century the Proto-Bulgarians under Khan Asparukh (Isperikh) crossed 
the Danube from the north into what is now Bulgaria. Within a few generations 
they were assimilated by the Slavonic majority. Only a few Proto-Bulgarian loan 
words have survived in contemporary Bulgarian, attesting to the relatively small 
number of the Proto-Bulgarian élite and hence the relatively restricted scope of 
language contact in this case. In Old Church Slavonic (Old Bulgarian) let us note 
boljarin® ‘boyar’ , kap® ‘idol, picture’, kumiro ‘idol, picture’, Ervtog® ‘dormitory, 
room’, as well as the agentive suffix -cii (later borrowed again into modern Bul- 
garian from Ottoman Turkish), as in konigocii ‘scholar’. Some supposedly proto- 
Bulgarian words in contemporary Bulgarian are san ‘rank’, bäbrek ‘kidney’, biser 
‘pearl’ and beleg ‘note’ (Mirtev 1963: 73-74; Hinrichs 1999: 151). 


2.4. The arrival of the Magyars 


In 896 the Magyars (Hungarians) moved into Pannonia, assimilating the Slavonic 
population and adopting many words from them. The many Hungarian place 
names of Slavonic origin such as Debrecen or Czongrád reflect this substratum 
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(Popovié 1960: 111), as do the many Slavonic loan words in contemporary Hun- 
garian such as asztal ‘table’ < stol», bab ‘beans’ < bobs and palota ‘palace’ < 
polata. 

The Magyars moved into Transylvania in the 10/11th century (Goebl et al. 
1997: 1479). The Hungarian-Rumanian symbiosis has led to a host of linguistic 
interferences and borrowings, e.g. Rum (Transylvania) sobd ‘room’, (Walachia 
and Moldau) ‘stove’ < Hng szoba ‘room’ (< OHG stuba ‘heated room; stove’ < VL 
*extufa or Proto-Slavonic *(jb)stoba < OHG). This word was borrowed by Serbian 
and Turkish and wandered as far south as Macedonia and Greece (‘room’ in stan- 
dard Macedonian, ‘stove’ in central and southern Macedonian dialects, ‘stove’ in 
Modern Greek). 

The earliest Rumanian loans in Hungarian were borrowed from transhumant 
shepherds and are registered at the beginning of the 14th century: cserge 'carpet, 
coarse woollen cloth’ (Rum cergd), furulya ‘pan flute’ (Rum fluerd). Since 
Transylvania was incorporated into Rumania the Hungarian minority have been 
speaking their language in a Rumanian-language environment, which has led 
to many interferences such as the semantic calque kékek 'aubergines' (lit. “blue 
[vegetables]’) < Rum vinete or pätlägele vinete. Contact with Rumanian has led 
also to phonetic changes in Transylvanian Hungarian. 

The Magyars have lived in contiguity with Slavonic peoples ever since they 
moved into Pannonia in the 9th century. Loans from Serbian such as gatya ‘pants’ 
bear witness to this longstanding language contact. Hungarian words in Serbian 
include čizma ‘boot’, soba ‘room’ (orig. from Slavonic), lopov ‘thief’ and gazda 
‘wealthy peasant’ (orig. from Slavonic). Since the Croatian heartland was ruled 
from Budapest for a long time, Hungarian loans are more numerous in the Kajka- 
vian dialects, e.g. orsag ‘empire, state’ (Popovic 1960: 602-603). On Rumanian- 
Hungarian language contact see Goebl et al. 1997: 1478-1486. Hng város has been 
borrowed into most of the languages of SEE, reflecting the widespread adoption of 
the Hungarian concept of 'town'. 


2:5; Byzantine and Modern-Greek influence on Bulgarian and Serbian 


The first Slavonic literary language was Old Church Slavonic, created by Saints 
Cyril and Methodius for the Moravian Mission. The oldest Slavonic Christian 
terminology is from Latin via Old High German, e.g. oltar (« OHG altári « Latin). 
Old Church Slavonic was based on the Old Bulgarian dialect of the Salonika 
(Thessalonica) district but was heavily influenced by Byzantine Greek. Bulgarian 
and Serbian Christian terminology is predominantly of Byzantine Greek origin, 
e.g. Srb davo ‘devil’, andeo ‘angel’, manastir ‘monastery’, iguman ‘abbot’ and 
calques such as Blg blagoslavjam ‘to bless’ « evloyín (eulogein), blagodarja ‘to 
thank’ « efcharistin (eucharistein) or blagodetel ‘benefactor’ « everyétis (euer- 
gétés) via Old Church Slavonic. Srb Blg vladika ‘bishop’ < Óóespótis is a semantic 
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calque (cf. also Hinrichs 1999: 831). However, most of the Greek loan words in 
Bulgarian, Macedonian and Serbian are from the popular language and testify to 
the intensive language contacts between Slavs and Greek speakers, e.g. Srb pana- 
dur ‘fair’, angarija ‘forced labour’, ceramida ‘tile’, Blg evtin ‘cheap’, Mcd oti ‘be- 
cause, for’, kata(den) ‘every (day)’, ela! ‘come!’ (suppletive imperative of the 
verb dojde ‘to come’) or the popular interjection more, mori < more, which have 
developed the allegro forms bre, vre (Popovic 1960: 598; Miréev 1963: 65; Ko- 
neski 1965: 183-184). 


2.6. Latin and German influence 


The Croats were converted to Christianity by the Roman church (642-731). Like 
the Serbs they received Germanic-Romance terms such as post and oltar. Apart 
from a short interlude in the second half of the 9th century, when the Byzantine 
mission succeeded in supplanting Rome for a short time (Hinrichs 1999: 832), the 
Croats have been linked with western and central Europe. Latin was the official 
language until 1847 (at times also German and Hungarian — Hinrichs 1999: 818), 
while the lingua franca in Central Europe was German — cf. in contemporary 
standard Croatian kofer ‘case’, cigla ‘tile’, logor ‘camp’, though many such Ger- 
manisms have been replaced by puristic calques (Golubovié 2007; Rammelmeyer 
1975) or borrowings from Czech or Russian. But in Bulgaria, too, German in- 
fluence was strong (šuko ‘power point’, polir ‘polish’, kulturtreger ‘person of cul- 
ture', cf. Parveva-Kern 1999). 

The Hungarian kings Geisa II and Andrew II settled German speakers in 
Transylvania in the middle of the 12th century, which led to the establishment of 
some 250 German-speaking towns and villages. German-speaking colonists were 
settled in the Banat following its reconquest by Austria at the end of the 17th and 
the beginning of the 18th century. When the Ottomans were driven out of the area 
between the Danube and the Transylvanian Carpathians German-speaking colon- 
ists were settled there by the Austrian military under Emperor Charles VI from 
1716/18, under Maria Theresia from 1744 and under Joseph II from 1782; they 
came mainly from the Upper Rhine (Swabia), but also from other areas (Styria, 
Tyrol, Bohemia, Hesse, Rhine Palatinate). The Austrian military settled German 
speakers in Bukovina after it was liberated from the Ottomans in 1774. German in- 
fluenced all languages in the area. In Bukovina, German was not only the language 
of administration but also served as a lingua franca for Ukrainians, Rumanians and 
Serbs. German lexemes were borrowed from the semantic fields of the military, 
schooling, jurisprudence, trade, railways, machinery etc., mainly during the time 
of the Banat military border area (1764-1873), where German was the language 
of command, but also from the beginning of the colonisation. Examples from 
Rumanian (dial.): l'enunk < Lóhnung, Silbu6k < Schildwache, grumbük < Grund- 
buch, firáng « Vorhang. On the other hand, the German dialects were also in- 
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fluenced by the languages of SEE, e.g. in WeiBkirchen (Bela Crkva) from Serbian: 
katsihozn ‘underpants’ < gaće, kolá:c(n) ‘cake’ < kolač, petjar ‘good-for-nothing’ 
< becar; from Hungarian: ra:mpas ‘new wine’ < rámpás, képanek ‘sheepskin coat? 
« kópónyeg; and from Rumanian: pasók traim ‘to make jokes’ < bate joc ‘to make 
fun of’, nubün ‘good-for-nothing’ < nu-i bun (cf. Goebl et al. 1997: 1413-1414; 
1470-1486). 

In Transylvania, contacts between Rumanian and Saxon speakers led to bor- 
rowings into Rumanian in the following semantic fields: mediaeval urban culture 
(pirgar ‘member of urban administration, councillor’, turn ‘tower’ ), carpentry (joa- 
gdr ‘type of saw’, şindrilă ‘shingle’), animal husbandry (şură ‘shed’), gardening 
and agriculture (ciuhd ‘scarecrow’, strut ‘bouquet of flowers’) (Desnickaja 1990: 
166). Mining terminology in most of SEE reflects the influence of Saxon miners. 


2.7. Trade 


Venice was a major power in SEE in the Middle Ages and early modern times. Ve- 
nice held Crete, Cyprus and points in the Peloponnesus and the Aegean islands 
until 1714, when, under the terms of the Treaty of Passarowitz, the area was re- 
turned to the Ottoman Empire; but Venice held on to the Dalmatian coast and the 
Ionian Islands, which still left her the strongest power in the Adriatic. Venetian dia- 
lects were spoken along the Eastern Adriatic coast until the 20th century and they 
influenced Slavonic speech in phonology, morphology and syntax (Bidwell 1967). 

Merchants from SEE set up offices in Venice and they and Italian (Venetian) 
traders brought many goods to SEE. The names of these goods passed into the local 
languages, even the word roba 'goods' in Slovenian, Serbian and Croatian, Grk 
róba (Alb rrobé ‘cloth’). Imports into SEE by Ragusan merchants resident in such 
towns as Novi Pazar, Skopje and Janjevo included textiles, sugar, spices and exotic 
manufactured goods such as mirrors. Arumanian has a number of Italian (Vene- 
tian) borrowings, which, however, passed through Greek, the language of com- 
merce and lingua franca in the Balkans: giubbone /Venetian zipón > modGrk 
zipuni > Arum (ipune, cf. Srb zubun, Rum zdbun; capelă ‘hat’ < modGrk kapéla< 
Italian cappello; carota ‘waggon, coach’ < modGrk karötsa < Italian carrozza. 
Apparently Arumanian shepherds learned the technique for producing the famous 
cäscäval (dry yellow cheese) in Italy (< Italian caciocavallo or dialectal cascaval — 
Desnickaja 1990: 215). However, the word occurs in all the languages of SEE (Trk 
kaskaval, Grk kaskavali, Blg kaskaval etc.) and it is unclear which language trans- 
mitted it to which. Note that Alb kackavall and Srb kackaval (besides kaskaval(j)) 
appear to derive from a form closer to standard Italian. Venetian institutions were 
also borrowed, along with their designations: the governing council of an Ottoman 
guild was called a lonca (Alb Ilonxha), which is derived from the Venetian loggia. 

Traders from SEE were present in other centres such as Trieste, Vienna and 
Odessa. Bulgarian traders visited the Leipzig fair in the 18th century (Parveva- 
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Kern 1999: 22), and western Europeans visited the Ottoman Empire, e.g. French 
merchants in the 16th, 17th and especially 18th century (Faroqhi 2004: 63; 70; 
145). Thus we find words in Ottoman Turkish such as magaza ‘store’ « French 
magasin (itself of Arabic origin). Trk magaza was then borrowed in turn by various 
languages of SEE (e.g. Blg maza); but note Grk mayazí « Venetian *magazin 
(It magazzino). 


2.8. The Ottoman invasion 


The Turks invaded Anatolia beginning in the 11th century. Adrianopolis (Edirne) 
fell in 1362, 1389 marked the battle of Kosovo, Ioannina and Salonika fell in 1430, 
the Albanian lands in 1457 and Crete was taken in 1669. Contacts between the 
Ottomans and the subject peoples are reflected in the numerous “Turkisms” or 
Orientalisms in the languages of SEE. For a summary of the semantic fields in 
which Turkisms occur in Bulgarian cf. Schaller (1973). Contemporary standard 
Greek has a few hundred Turkish lexemes relating to oriental cuisine (dolmdöes 
‘stuffed vine leaves’, baklavás ‘baklava’), oriental culture (kilimi ‘kelim’, midéri 
*sofa"), and a few others such as eryénis ‘bachelor’ and some that are favoured 
over their authentic Greek synonyms because of their stylistic flavour (racháti 
‘leisure, relaxation’, dalyds ‘great love’ — Goebl et al. 1530-1531). Some Tur- 
kisms occur in all the languages of SEE, e.g. dolap ‘cupboard’ (Hinrichs 1999: 
475) or in most of them, e.g. çak “precisely, even, that much, only’. Apart from the 
dialects, which naturally escaped the normativizing efforts of language planners, 
Turkish expressions are still common in the colloquial language today. However, 
many Turkisms remain as neutral elements of the lexicon, including also active 
suffixes, such as the agentive -cija/dZija in Macedonian and Bulgarian or -dzís 
in Greek (paliadzís ‘dealer in old wares’). There are also many calques, such as Blg 
dogodina ‘next year’ « seneye or Greek vyázo sto meidáni = meydana ¢ikarmak ‘to 
publicize'. 

Bosniaks being Muslims make greater use of orientalisms: avaz "voice, sound, 
clamour', today ‘newspaper’ (Dnevni avaz ‘daily newspaper’), dZehenem ‘hell’, 
dženaza '(Muslim) funeral’, ahiret ‘heaven’ and greetings such as selam alejkum 
(on meeting) and alahemanet (on taking one’s leave) rather than the traditional Srb 
Cro dobar dan! ‘Good morning’. 


2.9. The róle of the towns 


In the past the towns in SEE were polyethnic and multilingual and therefore foci of 
language contact. Contemporary nationalists assume that the towns in their new 
nation states were always dominated by their nationality — Skopje was always 
Macedonian, Plovdiv always Bulgarian, Salonika always Greek — but this is far 
from being the case (apart from the obvious distortion caused by projecting con- 
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temporary national categories into the past). When the Serbian king Milutin seized 
Skopje in the 1290s the town had a more Greek character than any of his other pos- 
sessions (which may have been why he made it his capital). Under Ottoman rule, 
there was a close connexion between urban life and the spread of the Islamic faith. 
Apart from the spahis (feudal lords) on their estates, most of the askerf (so-called 
"soldier class") — which was overwhelmingly Muslim - lived in the towns; and 
many of the institutions of the towns were bound up with those of Islam. While 
there was language contact in the villages between colonists and the indigenous 
population (e.g. in Bulgaria and Macedonia) as well as with itinerant traders and 
also between peasants and lords in different parts of SEE, the most multilateral lan- 
guage contact took place in the towns, even though the different ethnic groups 
lived in separate quarters (on Ottoman towns cf. Hinrichs 1999: 859—862). In the 
Ottoman period, Turkish dominated, especially the market place and adminis- 
tration. There existed also a Greek-speaking cultural and commercial élite, some 
members of which were of non-Greek origin and probably spoke their own lan- 
guage at home. Peasants came into the towns to sell their wares to members of this 
élite at the markets. Vlach (Arumanian) was also an important language of com- 
merce. In Croatia, the burghers of whatever ethnic origin spoke German or Hun- 
garian. In the 19th century some members of this élite were wealthy enough to send 
their sons to study in Austria, Germany or France. In Albania in the course of the 
19th century the towns were progressively Albanianized with the in-migration of 
Albanians from the villages (Goebl et al. 1997: 1455). 

Salonika, the largest commercial centre, remained a place of intensive lan- 
guage contacts (Greek, Turkish, Judaeo-Spanish, Bulgarian /Macedonian, Aru- 
manian, Albanian, Armenian) until well after World War I. One of the languages 
spoken was that of the majority Jewish population, Judaeo-Spanish (Sephardic, Ju- 
dezmo, Balkan Spanish). When Jews were expelled from Spain in 1492 most of 
them took advantage of the relative religious tolerance granted by the Sultans and 
settled in different parts of the Ottoman Empire, where they were to form compact 
communities in the major urban centres, including Sarajevo, Salonika and Istan- 
bul, absorbing the older, Greek- or Slavonic-speaking Jewish communities. It is 
estimated that by 1553 there were 20,000 Jews in Salonika (not all of them from 
Spain). At the beginning of the 20th century there were some 80,000 Jews in 
Salonika and the Ottoman authorities issued edicts in Judaeo-Spanish, as well as 
Turkish, Greek and Bulgarian. Because the Jews were the dominant element in the 
population, some non-Jews (Turks, Greeks, Bulgarians) acquired Judaeo-Spanish. 

Although the Jews lived in Salonika in their own quarters, many of them were 
merchants and Judaeo-Spanish adopted many Turkish expressions that were com- 
mon currency in the Ottoman Empire such as kolay ‘easy’ and yug < Trk güç (later 
supplanted by the French borrowings fasil and difisil), yene (Trk yine, gene) “once 
more’, mahsus ‘on purpose’ and interjections such as ayde (Trk haydi) ‘go on!’ and 
vah ‘what a shame!’ (Goebl et al. 1997: 1436; 1566; 1570). 
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2.10. Transhumance 


The Arumanians practised nomadic animal husbandry (transhumance), which 
brought them into contact with most of the other languages of SEE (Goebl et al. 
1997: 1533). Transhumance could take different forms, varying from group mi- 
grations, whereby the herdsman had no permanent settlements, moving in the 
winter and summer from the plains into the mountains over considerable distances 
(as far as Pannonia and the North Carpathians — Solta 1980: 71) along traditional 
routes (nomadic form of life), to alpine animal husbandry, whereby the livestock 
was driven up into the mountains in the summer but kept in enclosures in the 
winter. The Arumanians are thus originally mountain dwellers (Desnickaja, 1990: 
193-194). The closure of the borders with the establishment of the nation states put 
an end to this way of life (Goebl et al. 1997: 1454). Today more or less sedentary 
Arumanians may be found in many parts of SEE. Their number is estimated at be- 
tween 200,000 and 300,000. Transhumance was practised also by the Sarakatsani, 
a Greek-speaking group (Goebl et al. 1997: 1454; 1493; 1533). 


3. The Establishment of the Nation-States 


With the establishment of the nation states on the French model in the 19th and 
20th centuries language contact between national languages declined (Hinrichs 
1999: 811). The “national” language was promoted to the detriment of all others. 
With the codification of national standard languages, certain varieties come to be 
classified as “dialects” (French patois). They are stigmatized and discouraged or 
actively combated by schooling. On the other hand, there is now a new form of lan- 
guage contact — that between the standard language and the dialects. In as far as 
they survive at all, the dialects are spoken in a milieu where the standard language 
is the prestige variety and the “dialects” are subject to massive interference from 
the standard variety. Dunja Jutronié-Tihomirovié (1992) has studied the influence 
of the standard language at the syntactic level on the Cakavian dialect as it is 
spoken today in the city of Split. But this language contact can be a two-way street: 
while the dialects absorb many words from the standard language, especially due 
to the ubiquitous mass-media, to a certain extent the standard language is enriched 
by accepting words, constructions, proverbs and turns of phrase from the dialects. 
In some cases, e.g. standard Macedonian or contemporary standard Turkish, lan- 
guage planners consciously turned to the dialects to replace foreign borrowings, 
e.g. in standard Macedonian the Serbian word dogagaj and the Bulgarian sobitie 
were used for ‘event’ until the dialectal expression nastan was adopted for the 
standard language. In the case of Turkish, a dictionary of old words (Türk Tarama 
Sözlügü) was published as a source of Öztürkge “authentically Turkish" lexemes to 
replace Arabic and Persian borrowings. 
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Standard Croatian is based on the Stokavian dialect group, but it has been in- 
fluenced by the Kajkavian dialects and, to a lesser extent, by the Cakavian ones 
(see also the chapter by Voss, this volume). Since the Croatian capital, Zagreb 
(Agram), is in the Kajkavian dialect area it is not surprising that colloquial speech 
in the capital is strongly influenced by the Kajkavian substratum, in the lexicon, in 
syntax and in accentuation. Similarly, the colloquial language in Split and Rijeka is 
influenced by the Cakavian subststratum. In some rural areas educated Croats are 
bilingual in the sense that they may speak a rural dialect in informal domains. 

After World War II Josip Broz Tito re-established Yugoslavia as the only ex- 
plicitly multinational state in SEE. Each republic of the federation had its own of- 
ficial language but — as in pre-war Yugoslavia — Serbian functioned as a "lingua 
communis" and as the command language in the Yugoslavian National Army. All 
Yugoslavian citizens learned Serbian or Croatian at school and these languages in- 
fluenced the other languages in Yugoslavia such as Slovenian, Macedonian and 
Kosovo Albanian via the mass media. Most Macedonians were happy to speak Ser- 
bian (or “pseudo-Serbian” — Usikova, i.e. a mixture of Serbian and Macedonian) 
with Serbs that visited Macedonia, or when they visited federal institutions, while 
Slovenes, by contrast, tended to insist on their constitutional right to speak Slove- 
nian. Standard Macedonian has been heavily influenced by Serbian in phonology, 
syntax and lexicon, while colloquial Skopje Macedonian could be described as a 
creolized form of Serbian. On the other hand, language contact between Macedo- 
nian and minority languages in the Republic of Macedonia led to those minority 
languages adopting Macedonianisms, especially in relation to politics (e.g. rezol- 
ucija ‘resolution’, anketa ‘questionnaire’; ‘opinion poll’); administration (e.g. 
licna karta ‘identity card’); modern lifestyle (e.g. dnevna ‘sitting room’); and mod- 
ern technology (e.g. frižider 'refrigerator"). 

In modern times there is the phenomenon of gurbet (kurbet) or pecalba, which 
was typical especially for Albania, Serbia, Macedonia and Bulgaria: migrant 
workers would go abroad for some months or even years to earn money before 
returning home (Desnickaja, 1990: 50). Initially they went to other areas in the 
Ottoman Empire, even as far as Egypt, but in the 20th century they set out for the 
USA and later for Australia, New Zealand, South America or South Africa. When 
they returned they inevitably brought words with them from the new world. In 
some Macedonian villages one can hear words from English that were brought 
back by such migrant workers, e.g. bajsik ‘bicycle’ or k’e te ringam ‘TIl ring you’ 
from Australian English (Hill 1982: 54). In the 1960s such migrant workers went 
to western Europe and again they brought back some German or French ex- 
pressions that they had learned while working there. 

Romani is spoken in different parts of SEE. The Roma arrived in SEE during 
the Byzantine era and their presence is attested e.g. in Bosnia-Hercegovina since 
15th century (Goebl et al. 1997: 1436). In constant contact with the languages of 
the populations among which they live, their language has developed “national” 
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variants in different countries (cf. Kostov 1973). The Roma came into contact with 
Greek when they arrived in Anatolia and adopted old Anatolian-Greek words: 
kopal ‘wooden hammer’ « kopálin, kohinel ‘to cough’ « kochízo, kraj « kránin 
*dogwood (Cornus mas)' and others (Hinrichs and Büttner 2000: 233). 

There are Arumanian settlements in Greece and Macedonia. While the former 
are more numerous (some 60,000), they are in the process of language shift to Greek, 
while the latter group, though only some 6,000 strong, have been encouraged to 
maintain their language, which is taught in schools. The settlement area of the Meg- 
leno-Rumanians was cut by the border between Yugoslavia and Greece a century ago 
and they appear doomed to succumb to language shift to the dominant national lan- 
guage of the state in which they live (Desnickaja 1990: 224). Istrorumanian exists in 
a Croatian environment. Speakers of Istrorumanian — numbering at the most 1500 — 
are bilingual and Istrorumanian shows many interferences from the official language. 
In the past Istrorumanian was spoken over a larger area and was influenced by Italian 
and the Cakavian dialects of Croatian (Desnickaja 1990: 231—247). 

When codifying their national languages, Bulgarians and Serbs often borrowed 
from Russian (e.g. Srb činovnik ‘civil servant’, čaj ‘tea’, gord ‘proud’, iskren 'sin- 
cere’). After World War II, ‘Sovietisms’ became part of everyday reality in the 
countries of SEE: Cro Srb aktivist(a), fiskultura, kolektivizacija, Alb fizkulturé, 
plastmasé (SamardZija 1998: 137; Hinrichs and Büttner 2000: 163; Goebl et al. 
1997: 1468), Hng kolhoz ‘collective farm’, kulák ‘wealthy peasant, kulak’ (Hin- 
richs and Biittner 2000: 241). 

When Salonika was incorporated into the Kingdom of Greece in 1913, Greek 
gradually became the dominant language here so that, by World War II, the 
younger Jewish generation were fluent in Greek and there were now many Greek 
expressions in Judaeo-Spanish. Through its schools, the Alliance Israélite Univer- 
selle promoted French, and Judaeo-Spanish was significantly influenced by that 
language, in the lexicon, in syntax and even in phonology (Desnickaja 1990: 267). 
French became the second and in some cases even the first language for the edu- 
cated members of the Jewish communities (Goebl et al. 1997: 1570, 1567), while 
local communists stood up for Judaeo-Spanish, the vernacular of the workers. The 
“Worker’s Paper” appeared in Salonika in 1909 in Turkish, Greek, Bulgarian and 
Judaeo-Spanish (Amele gazetesi/ Efimerís tou ergátou/ Rabotniceski vestnik /Jour- 
nal do laborador) (Mazower 2005: 403-407). 
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4. Contemporary developments 


4.1 Minorities in the nation-states 


In Albania and Greece Arumanian is classified by UNESCO (1999) as “endan- 
gered", while Istro- and Megleno-Rumanian are classified as "seriously endan- 
gered". The Arumanians and especially the Megleno-Rumanians have always had 
particularly close contacts with Greek speakers and also with Slavonic speakers 
and have adopted many lexemes from these languages, over and above those that 
they share with Rumanian. Arumanian in Greece has adopted the Greek phonemes 
/8 0 y/, which were borrowed via lexemes such as Óáskal ‘teacher’, Oimó ‘anger’, 
yiftu ‘gipsy; blacksmith’, but are now found even in words of Latin origin such as 
Oeámin 'feminine'. The use of verb tenses and some other constructions have also 
been influenced by Greek and there are instances of semantic calques; for example, 
the verb fudzí « Lat fugere has acquired the meaning ‘to leave’ after Grk févyo 
(Goebl et al. 1997: 1537—1538), cf. Mcd bega. Megleno-Rumanian morphology 
has also been influenced by Macedonian, thus in verbs we get the 1st and 2nd per- 
son sg present forms aflum, afli$ under the influence of Macedonian čitam, čitaš. 
Megleno-Rumanian has adopted the suffix -ta from Macedonian in the adverbs 
zuu(a)ta *during the day' and searta *in the evening' (Mcd denta, vecerta) as well 
as various verb prefixes, e.g. zadurmiri ‘to fall asleep’ (Mcd zaspie). Apart from 
working in animal husbandry and commerce, the Arumanians were famous as 
caravaneers, and so it is not surprising that other Balkan languages should have 
adopted the Arumanian word sdmdr ‘pack saddle’ « Lat sagmarium ‘baggage 
carried on a pack saddle’: Grk samdri, Alb Blg Mcd samar, Trk semer. As the Aru- 
manians became sedentary, they quickly turned away from their culture and lan- 
guage. Direct contact situations — in comparison with indirect linguistic contacts — 
have had a strong impact due to the intensity of everyday encounters. Nowhere, ex- 
cept in the Republic of Macedonia, does Arumanian receive state support. Today, 
the education system in the official language of the host country, increasing urban- 
ization, industrialization, migration to the cities and the influence of the electronic 
media are leading to rapid linguistic change. Speakers of the minority languages 
borrow an increasing number of elements from the dominant language. As the mi- 
nority language is economically irrelevant it is also increasingly losing ground 
even within the family circle and is no longer indispensable for its speakers. And 
yet — like Macedonian in Greece, whose death has been “imminent” for several 
generations — Arumanian still hangs on (Sikimić and Ašić 2008: 123-149). 

The Lipovans — Eastern Orthodox “Old Believers” — have preserved their 
language, Russian, in Rumania. While children have only kindergarten tuition in 
Russian, the language is still vital in their villages. These villages are bilingual, 
but, unusually in such a situation, native speakers of the official state language, 
Rumanian, also speak Russian. (There is a similar situation in Northern Greek vil- 
lages, where Greek-speaking refugees from Asia Minor have been forced by the 


Language contact in South-Eastern Europe 423 


exigencies of everyday communication to learn the indigenous Macedonian dia- 
lect.) Because of this bilingualism, the Russian speech of the Lipovans displays 
both lexical borrowings and calques from Rumanian (Sikimić and Ašić 2008: 
151-159) 

There has long been a linguistic symbiosis in Transylvania, with Rumanian 
borrowing from German (chelner ‘waiter’, spalt ‘column’ and svaitui ‘to weld’), 
while German and Hungarian dialects have borrowed words from Rumanian, such 
as Hng zsudec, Grm Schudetz ‘municipality’< judet, Hng grenicser, Grm Granit- 
scher < gränicer (Desnickaja 1990: 163-167; Goebl et al. 1997: 1464-1472). The 
German dialects in Rumania show considerable interferences due to being spoken 
in a Rumanian environment. Take, for instance, the use of the negative particle be- 
fore the verb in the imperative: Nicht fürchte dich! vs. its postverbal position in 
Standard German Fürchte dich nicht!, also in Rumanian-Hungarian: ne fely!; and 
calques such as Hundsholz from Rum lemnul ciinelor or Sie hat ein Kind gemacht 
[‘made a child’ instead of ‘gave birth to a child’]. 

There are still Turkish-speaking minorities in some SEE countries, e.g. in Bul- 
garia, including Gagauz (Turkish-speaking Christians), and so the language con- 
tact continues, though today it is mostly one-sided, i.e. the Turkish dialects have 
changed through contact with the Slavonic-language environment. The Turkish 
dialects in Bulgaria are developing a distinction of gender due to contact with 
Bulgarian, borrowing the Bulgarian feminizing suffix -ka, e.g., in Vidin koy-qa 
‘villager, f.’, Gagauz padisax-ka ‘queen’. In the Rhodope Mountains the Turkish 
necessitative with the suffix -mali-/-meli- has been replaced with the Bulgarian 
construction, using the Turkish adverb l'azdm (standard Trk lâzım) + optative/sub- 
junctive, e.g. (ben) l’azdm alajdm ‘I have to take’ (standard Trk almalıyım). The 
sentence structure in Bulgarian Turkish and Gagauz often follows the models of 
the contact language, e.g. söylerlar Cobana gütürsin ... = Blg kazaxa na ovčarja da 
donese ... ‘they told the shepherd that he should bring ...' (Goebl et al. 1997: 
1507-1509). 

There is an Albanian minority in Attica and Biotia (Boeotia) in Greece. Their 
language — Arvanitika - is still spoken by old people, but is still changing under the 
influence of Greek, and appears headed for extinction. Some old people preserve 
forms that have otherwise been lost, showing that this is a recent development. 
Of the verb tenses, the imperfect and the conditional are threatened and will soon 
be lost altogether, while the optative mood has been replaced by periphrastic forms 
based on Greek (Trudgill 1984). 

In South-Eastern Europe today there are very few speakers of Judaeo-Spanish 
(Sephardic) left, the largest number being in Istanbul, but many Sephardic-speak- 
ing Jews embraced wholeheartedly the appeals by the Turkish government to speak 
Turkish (Vatandaş, Türkçe konus!), some going even so far as to speak only Tur- 
kish with their children. The Jewish weekly Salom has been published in Turkish 
since 1984 and contains only one page in Judaeo-Spanish. The last Jewish paper in 


424 Peter M. Hill 


French, Le journal d'Orient (founded in 1917), ceased its publication in 1971. The 
Istanbul variety of Judaeo-Spanish — especially the language used in the Judaeo- 
Spanish press, in novel writing etc. — was termed judéo-fragnol by Séphiha 
because of its Gallicized style. The more Hispanic variety of Judaeo-Spanish re- 
mained the language spoken by the lower classes. However, today Judaeo-Spanish 
may be heavily influenced by Turkish, as evidenced by utterances such as no kale 
karisterearse [< Tk karıştırmak] en meseles [< Tk mesele] del hukumat [< Tk hü- 
kümat], which, in a “purer” form of Judaeo-Spanish, would be no kale mesklarse 
en kestiones [« Fr question] del governo "One should not interfere in government 
affairs" (Goebl et al. 1997: 1558; 1568-1570). 

Of course, apart from the influence of the official standard language, minority 
languages may be influenced by dialects of the standard language and by other mi- 
nority languages: the Slavonic dialects in Albania and Greece show influences 
from Arumanian, as in gu vidu na stefo ‘I saw Stefo', where the accusative is indi- 
cated by a preposition (Sikimić and Ašić 2008: 107-121). 

In Bulgaria there are still both Daco-Rumanian and Arumanian dialects spoken 
by immigrants that arrived some 100-150 years ago. They include Rumanian- 
speaking Roma called Rudari. As well as borrowing lexical items from Bulgarian, 
they have introduced Bulgarian syntactic constructions, e.g. a construction with 
the preposition /a/a to express possession: rdcovoditel la ansambäl ‘director of an 
ensemble'. Code-mixing, switching and borrowing suggest that these dialects are 
endangered (Sikimić and Asié 2008: 161-172). 

In Croatia and Serbia the Rumanian-speaking Roma — some 10,000-13,000 in 
Croatia — are referred to as Ba(n)yash. Recent fieldwork suggests that group cohe- 
sion is still strong and their language still vital, despite extensive borrowing from 
the contact dialects (Sikimić and ASié 2008: 173—225). The Karavlachs in several 
scattered enclaves in the north of Republika Srpska in Bosnia and Hercegovina are 
a similar group. Their vernaculars have never been supported by the school system 
or the media and are today restricted to the domestic domain. Here too, code-mix- 
ing, switching and borrowing suggest that these dialects are endangered (Sikimić 
and Ašić 2008: 227-246). 


4.2 Anglomania 


While the minority languages have been under pressure, i.a. from the official lan- 
guages of the host states, the latter are now under pressure from English. Since 
1989 there has been a reaction against everything that went before that iconic date, 
especially amongst the young. There is now ubiquitous contact between all the 
languages of SEE and English. While pop music was a source of Anglicisms even 
before 1989, it is now impossible especially for young people in SEE to avoid con- 
tact with English, for instance when using computers. But the English loanwords 
in Macedonian cited in Hinrichs and Büttner (2000: 149) reveal other sources: 
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big-mek (US fast-food outlets), bodibilder (fitness centres), market, vokmen 
(US-dominated consumer culture), besides hardver. For Albanian, Lloshi men- 
tions cupa-cups, kesh, minimarket, parking, piceri, rep, repist, top model, xhel, 
akomodim, axhendé, implementim, input, killer, lidership, sponsor, and from the IT 
realm: sejvoj ‘to save’, klikoj ‘to click on’, kompjuterizoj ‘to computerize’, mouse, 
printer, fajl, disketé, sidi ‘CD’, bekap ‘backup’. Because journalists are constantly 
translating English-language sources in great haste there are frequent loan trans- 
lations, e.g. Alb filluan armigesite instead of veprime luftarake or pérleshje ushta- 
rake ‘hostilities’ (Hinrichs and Büttner 2000: 168-169). Some new universities 
that teach partly or exclusively in English have been established in SEE since 
1989 — the American University in Bulgaria or the State University of Tetovo — 
so that many people in SEE have acquired English as the language of scholarship 
and may use a scholarly register in their native language that reflects this English 
"substratum". Moreover, English has become the lingua franca for educated people 
from different parts of SEE, replacing earlier lingue franche such as Turkish, 
Greek, German or French. 


4.3 Other influences 


Albanians traditionally watch Italian TV (tolerated even during the Hoxha era). 
Recent loanwords from Italian are ambientalist, asikuracion, disbursoj, gomone, 
handikapat, kantautor, karizmatik, penitenciar, populizém, programator, skaf, ske- 
diné, tanxhente, traget. There are also hybrid Italian-French borrowings such as 
kondicioner ‘air-conditioner’ « Italian condizionatore + French conditionneur 
(Hinrichs and Büttner 2000: 169). 


4.4 The resurgence of the Turkisms 


When the languages of SEE were codified in the 19th and 20th centuries, most Tur- 
kisms, e.g. in Bulgarian, were deliberately replaced with borrowings from Russian 
or western languages. These Turkisms were then relegated to the substandard 
sphere but have now resurfaced as an extremely effective stylistic device (Hinrichs 
1999: 824). When the Bulgarian PM Kostov presented his economic plans to the 
IMF, the headline in one Bulgarian newspaper read: Askolsun (Tk Askolsun!) 
‘Good on you!’ Or in a leading article deriding the Bulgarian attitude to tall pop- 
pies: Kato sa tolkova akdlii, da se opravjat sami! ‘If they're so clever, let them 
take care of themselves!’ (Tk akıllı). 
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5. Conclusion 


With the creation of the nation-states in the 19th century and the decline of the 
traditional way of life in SEE in the 20th, multilateral language contact has been 
replaced by one-sided language contact and “intermediary borrowing": many 
ethnic groups have become “national minorities" and their languages experience 
one-sided influence by the official state language, while all languages in SEE have 
adopted many English expressions via the mass media. On the other hand, in line 
with the "return of history", there has been a resurgence of the Turkisms. 


See also the following chapters in this volume: 5 by Hansen, 9 by Kallulli, 11 by Tsan- 
galidis, 16 by Tomić, 23 by Breu, 27 by Woolhiser, 41 by Johanson and 43 by Voss. 
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1. Introduction 


Linguistic minorities are typical for many parts of Europe. In fact, European coun- 
tries without any traditional minority language are rare exceptions, with the at- 
tribute "traditional" referring to autochthonous populations or to groups of people 
having lived in their actual territories for centuries and excluding recent immi- 
grations of individual persons or small groups having come to their actual places of 
residence in searching a job or for other political or social reasons since the 19th 
century. We differentiate between borderline minorities and linguistic enclaves (is- 
lands). In the first case, the (rather heterogeneous socio-political) status of “minor- 
ity" language is exclusively due to a political border passing across the language 
area in question, with the same language being possibly a majority language on the 
other side of the border. Examples are Italian and French in Switzerland, (Slavo-) 
Macedonian in Greece, or the German varieties in the Alsace (France) and in South 
Tyrol (Italy). In all these cases, linguistic isolation is only relative and clearly dif- 
ferent from the situation of linguistic enclaves, with their often centuries-long iso- 
lation from genetically closer varieties and complete domination by the majority 
language. Instances of such linguistic enclaves are Aromanian or Arvanitika in 
Greece, Catalan in Sardinia, Greek in Corsica or Polish in Romania, to name only a 
few. The individual sociolinguistic situations of Buropean minority languages are 
often much more complicated than this simple bipartition implies. Even linguistic 
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enclaves without any genetically related majority language exist, as in the case of 
Basque in Spain and France, incidentally also a special case of a borderline minor- 
ity, as it is a minority language on both sides of the border. 

This chapter will solely be concerned with Slavic and Albanian in linguistic en- 
claves in Germany, Austria, and Italy, i.e. enclaves with majority languages of gen- 
etically different families.! In the case of the Slavic minorities considered, we are 
dealing with genetically related languages of the same language family as replica 
languages in contact with different model languages. This provides us with a ter- 
tium comparationis when trying to explain different results of possibly contact-in- 
duced developments, allowing us, in principle, to attribute them to the structure of 
the individual majority languages. Vice versa, in the case of Albanian and Slavic 
minorities in Italy, it will be possible to attribute differences in the contact-induced 
developments to the structures of the minority languages themselves, as they share 
the dominant model of the same majority language. In all cases it must, however, 
be borne in mind that we are never concerned with an absolute historical identity of 
the minority languages involved in the different contact situations, with the same 
holding true for the majority languages. On the one hand, the Slavic languages in 
question have a different West or South Slavic origin; on the other hand, not only 
Standard Italian and German, but also different local varieties must be considered 
as model languages. As for the contact phenomena, the focus of this chapter will be 
on the grammatical level, leaving aside the more frequently investigated lexical 
and phonetic influences, and we will deal not only with contact-induced change but 
also with the question of "linguistic pertinacity" (persistence), i.e. why in some 
cases an expected change did not happen. 

There are two main types of language contact, based on the two logical oppo- 
site directions of influence; see Breu (1994: 45—48) and, with a different terminol- 
ogy, Thomason and Kaufmann (1991: 35-64). The first type is what we tradition- 
ally call the “substrate type", with the speakers of a primary language L, (= their 
mother tongue) learning a secondary language L, and changing the latter into L,’ as 
a consequence of the inertia of L,. This is the normal case of language learning, 
which turns into a real case of language change when a whole language community 
starts using L,” as one of their means of communication and passes it on to the next 
generation. The structural effects of the substrate type on L, are often very strong, 
as for instance in the well-known cases of French (Gallic as L,, Latin as L,, Old 
French as L,') or Bulgarian (Balkan Romance as L,, South Slavic as L,, Bulgarian 
as L,’). Regarding Bulgarian, the substrate type of contact-induced change led, for 
example, to the complete loss of the rich Slavic declension system, to the develop- 
ment of a definite article against a common Slavic constant of blocking the rise of 
article systems etc. On the other hand, the complex Protoslavic tense—aspect sys- 
tem has been conserved, contrary to the usual development in the Slavic languages 
outside the Balkans, which can be considered a case of pertinacity due to language 
contact; see Breu (1994: 58—62) for more details. 
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The second contact case is the “adstrate type", i.e. the influence of a foreign 
language L, on the primary or replica language L,, resulting in the change of L, to 
L,'. This is the normal case of borrowing foreign words, of copying structural mod- 
els in vocabulary and, to a lesser extent, in grammar, mainly by loan formations, 
loan constructions, and semantic calquing. This type is the main source for the con- 
tact-induced change in minority languages with the majority languages as their 
model. The so-called "superstrate" does not constitute a type of its own since here 
the same processes are at work as in the case of the adstrate type, the only differ- 
ence being the additional extralinguistic criterion of political or social hegemony 
of at least part of the speakers of the dominant language L,. 

In the first centuries of newly established contact situations, most individual 
speakers of the minority languages in Europe remained monolingual. Since lan- 
guage contact depends on bilingualism, it is, however, reasonable to suppose that 
at least part of the population — normally people with external contacts and a 
relatively high prestige among the minority population — knew the majority lan- 
guage, too. It was their language to show the first contact phenomena, which were 
copied and eventually put to frequent, possibly new (extended or restricted) use in 
the language of the monolingual members of the minority speech community. 
Later on, many more minority people became bilingual, and nowadays, at least in 
the contact areas which will be described below, knowing and using more than one 
language is the normal case. This means that the minority languages are in a situ- 
ation of “total (or absolute) language contact", in which everybody can freely in- 
troduce elements of the majority language, functioning as an umbrella language, 
into the local minority language. This is why in minority languages the adstrate 
type of contact-induced change can have such far-reaching consequences in gram- 
mar as are normally characteristic only for the substrate type. 

Moreover, we are confronted with situations of diglossia, as the use of the 
languages in contact is to a high degree functionally determined. The typical usage 
domains of the majority languages are the mass media, school and higher edu- 
cation, law court, church services, verbal interaction in contact situations with 
"foreigners", including the communication with authorities from outside, and gen- 
erally written texts. The realm of the minority language, by contrast, is that of a 
predominantly or exclusively orally used vernacular in everyday life. 


2. The general situations of linguistic minorities 
in Germany, Austria, and Italy 


Before entering into the discussion of contact-induced grammatical change in Slavic 
and Albanian minority languages with genetically different majority languages, we 
will briefly characterize their extra-linguistic situation. The geographical sketch in 
Map 1 is intended to show the approximate distribution of their territories: 
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GERMANY: 
Colloquial Upper Sorbian 


AUSTRIA: 
Burgenland Croatian 


ITALY: 
Resian 


Molise Slavic 
Italo-Albanian (Molise) 


Italo-Albanian (Calabria) 


Map 1: Geographical position of the minority territories 


2:1: Colloquial Upper Sorbian (Slavic-German language contact I) 


The Sorbian language group belongs to the West-Slavic languages and is tradition- 
ally represented by two literary languages, Standard Upper Sorbian in Upper 
Lusatia in the East-German state (Bundesland) of Saxony and Standard Lower Sor- 
bian in Lower Lusatia in the state of Brandenburg. Sorbs have lived in their modern 
territories and beyond even since before the arrival of the Germans. The story of 
Sorbian is the story of a continuous loss of territory and speakers, with Lower Sor- 
bian now having retreated to a few enclaves in the district of Cottbus with only few 
fluent speakers left, mostly of the older generations; see Norberg (1996: 62-68) for 
a short overview of the general situation of the Sorbs, and Spieß’ appendix (1996: 
69—75) for a statistical attempt at language acquisition and the number of speakers 
in the Sorbian territory. The situation of Upper Sorbian in most of its former terri- 
tories resembles that of Lower Sorbian, except for its western part with the rural 
communes of Crostwitz and Ralbitz/Rosenthal (former administrative district of 
Kamenz) as its centre. In this so-called “core region” of modern Sorbian, an every- 
day vernacular is still used by roughly ten thousand speakers, compared with a 
much higher number of people (up to seventy thousand) considering themselves as 
"Sorbs" but speaking mostly German. We call this variety, which differs from 
Standard Sorbian in many respects, “Colloquial Upper Sorbian” (CUSrb). These 
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differences concern all linguistic levels, including a differing phonology with dis- 
tinctive vowel length in CUSrb contrary to the Standard, and a morphology with its 
own aspect and article system (see below), a special treatment of the dual, etc.; for 
general information on Colloquial Upper Sorbian and its role among the Sorbian 
varieties, see Breu (2000: 51-54) and Scholze (2008: 31-42). Religion plays an 
important role in the CUSrb territory, which is populated by Catholics contrary to 
the almost exclusively Protestant Sorbs in the other regions. Undoubtedly, the 
identification of religion and language has played a decisive role for the conser- 
vation of the minority language in this area. 

Due to its functions as the variety used in school and as a written language, 
Standard Upper Sorbian is a prestige variety in the CUSrb area. Yet it has only a 
limited role in everyday life. Its purist tendencies, going back to the 19th century, 
have had only little effect on CUSrb, and it has not been able to impede even greater 
assimilations to German and its local varieties in more recent times, for example in 
the case of the relatively recent development of an indefinite article. The function 
of an umbrella language for Colloquial Upper Sorbian has been taken over to a 
great extent by Standard German. It should be noted, however, that CUSrb speakers 
are used to writing in Sorbian, which contrasts with the situation in most other lin- 
guistic enclaves discussed below. Sorbian (serbski) is the indigenous ethnonym of 
the Sorbs, whereas Wenden, Wendisch is still used by older people, especially in 
Lower Lusatia, when referring to their people or language in German. 


2.2. Burgenland Croatian (Slavic-German language contact II) 


Burgenland Croatian (BCr), situated in Eastern Austria, is a minority language 
of the Serbo-Croatian (Central South Slavic) group. Unlike the situation of Collo- 
quial Upper Sorbian, the territory of this minority language does not form a single 
continuum but consists of separate enclaves, subdivided into seven groups 
(Neweklowsky 1978: 57—193). These include, apart from the federal state of Bur- 
genland proper, bordering enclaves in Lower Austria, Slovakia and Hungary (Ne- 
weklowsky 1978: 19-21, 377). In total, it has been estimated that this minority has 
about twenty thousand members, with the number of active speakers being much 
lower though (Szucsich 2000: 874). The ancestors of the Burgenland Croatians im- 
migrated mainly in the 15th and 16th century from different parts of historical 
Croatia (Neweklowsky 1978: 264—266), with the individual enclaves belonging to 
all three different Serbo-Croatian dialect groups, i.e. Cakavian (mostly), Stokavian 
(less so) and Kajkavian (only the two villages in Hungary). These terms refer to the 
equivalents of Protoslavic Zëtteg ‘what’, with “Stokavian” meaning that ‘what’ ap- 
pears as sto in the dialects in question, like in the central, southern and eastern part 
of the Serbo-Croatian continuum and also in Standard Bosnian, Serbian and Croa- 
tian, contrary to “Kajkavian” with kaj ‘what’ in the north-west and “Cakavian” 
with ca ‘what’ on the Dalmatian islands. 
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Contrary to many other linguistic islands of Central South Slavic provenience, 
including those in central Hungary, Romania, and Italy, the Burgenland Croats 
have always used Croat as their ethnonym, probably due to their origins in Croatia 
proper, but also because of their having maintained close contacts with Croatia, at 
least as far as the Church and intellectual circles are concerned. 

Burgenland Croatians have their own literary tradition, going back to the 16th 
century. Today's standard is based mainly on the Cakavian varieties but it has been 
adapted in many respects to Stokavian-based Standard Croatian by language re- 
formers and purists, though not without local resistance (Szucsich 2000: 861—869). 
While it is indeed used — and more or less voluntary — for school classes, for church 
services, and by the mass media, its role in everyday life is very restricted (Szu- 
csich 2000: 869—874), in contrast to German in its regional variants which is the 
only real umbrella language. A certain preserving or retarding influence of the BCr 
Standard on contact-induced changes of the minority dialects can, however, not be 
excluded. 


2.3. Resian (Slavic-Romance language contact I) 


Minorities with a language variety genetically based on the Slovene dialect con- 
tinuum (North-Western South Slavic) exist in Hungary, Austria, and Italy. In Italy, 
there are by and large two types of Slovene-based varieties. The first type is con- 
stituted by the borderline minorities in the provinces of Gorizia and Trieste, form- 
ing a continuum with the dialects in Slovenia. The second type consists of the lin- 
guistic enclaves of the Slavia veneta in the Province of Udine of the Region of 
Friuli-Venezia Giulia. Of these it is only the isolated minority language in the 
Resia Valley (Val Resia) which will be considered in this chapter. The far-reaching 
structural differences between Resian and Slovenian result from its long-lasting 
situation of total language contact Resian had with Romance varieties (initially 
only with Friulian, later on — at the latest in the 15th century — with Venetian, and 
from the second half of the 19th century with Standard Italian). Since World War 
IL, Standard Italian has been the only umbrella language for Resian (Benacchio 
2005: 93). The members of the Resian speech community do not use an ethnonym 
based on the term Slovenian but rather derive the denomination for themselves (te 
Rozajanske ‘the Resians’) and for their language (te rozajanski jazek ‘the Resia 
language’) from the name of their home valley. An alternative term they use for 
Resian is po nds ‘in our way’.? 

The ancestors of the modern minority probably came to the Resia Valley from 
Carinthia (Austria) about a thousand years ago. Today, this territory is inhabited by 
roughly 1,300 people in five major villages and several hamlets, mostly bilingual 
or trilingual native speakers of Resian, but with exceptions predominantly among 
younger people. In recent years, efforts have been made to introduce Resian as a 
school language by creating grammars and textbooks. Standard Slovene has never 
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been taught in Val Resia, thus remaining without any influence on the development 
of Resian. 


2.4. Molise Slavic (Slavic-Romance language contact II) 


Molise Slavic (MSlav) is spoken in the Province of Campobasso in the Southern 
Italian Region of Molise. From a genetic point of view, it can be assigned to the 
Central South Slavic language continuum but different from Burgenland Croatian 
exclusively to the Stokavian-Ikavian dialect group, with “Ikavian” referring to the 
pronunciation of Protoslavic é as [i] as in some Dalmatian dialects, contrary to 
Standard Croatian [je], [ije] and Standard Serbian [e]. There are also other criteria 
allowing for a more precise location of the origin of the Molise Slavs, for instance 
the development of syllable-final -/ to -a, referring to the west of the central part of 
the Neretva valley, cf. *vidil ‘see (masculine /-participle)’ > vidija, but vidio in 
Standard Bosnian, Croatian, or Serbian. The ancestors of today's Molise Slavs 
emigrated from Hercegovina (Dalmatian hinterland) to Italy probably in the 16th 
century; see ReSetar (1911: 50-139) and Breu (1997) for an overview of the history 
and the modern situation of the Molise Slavs and their language. 

Molise Slavs do not have an ethnonym of their own and are traditionally accus- 
tomed to Italian-based denominations like Zlava or Skjavuna, i.e. ‘Slavs’, and to 
calling their language simply nas jezik ‘our language’, adverbially na-nasu or na- 
naso ‘in our way’. Molise Slavic is traditionally spoken in three bordering villages, 
each showing a different degree of language preservation: Acquaviva Collecroce 
(the traditional centre), Montemitro (smaller and more conservative), and San Fe- 
lice del Molise (with only few people still speaking Slavic). Dialectal differences 
between these three village communities are considerable at the phonetic and pho- 
nological level, less so in morphosyntax and in the lexicon. Only half of the 
roughly 2,000 inhabitants of these villages are still speakers of Molise Slavic, 
among them only few children. We need to add, however, a relatively large number 
of emigrants, especially in Australia (Perth). 

After the immigration, Molise Slavic came first under the influence of the 
Molisian dialect of Italian, to which Standard Italian influence was added after the 
unification of the South with the Italian kingdom in the second half of the 19th cen- 
tury. Today, Standard Italian (in its regional variety), being the only umbrella lan- 
guage for Molise Slavic, is the main source for foreign influences, while Standard 
Croatian — despite some educational experiments — has never played any role at all. 


2.5. Italo- Albanian (Albanian-Romance language contact) 


Italo- Albanian (ItAlb), or Arb(é)risht, Alb(E)risht, Arb(&)resh, shares similar con- 
tact conditions with Molise Slavic, as it is also spoken by minorities in Southern 
Italy. Nevertheless, there are considerable differences with respect to the numbers 
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of speakers and to the territories inhabited. While Molise Slavs live in a small com- 
pact area, there are almost fifty Italo- Albanian villages, spread out across several 
regions from Molise via Campania, Basilicata, Apulia, and Calabria down to 
Sicily, with a concentration, however, in the North Calabrian province of Cosenza; 
for a map see Altimari (1991: 64). Moreover, their immigration was a process con- 
sisting of several waves, taking place at least from the 15th up to the 18th century. 
Linguistic evidence like the shift of intervocalic n to r (Tosk rhotacism) and the 
lack of an infinitive point to Southern Albania and Greece with their Tosk dialect as 
the original homelands of the Italo-Albanians. As in the case of the other minor- 
ities, the number of speakers can only be estimated, with numbers running from 
some tens of thousands up to one hundred thousand. Due to the extension of the 
Italo-Albanian territories, many Italian dialects have played a role in the contact- 
induced development of the Italo- Albanian dialects but just as in the case of Molise 
Slavic, Standard Italian is nowadays the main source of foreign influences. 

The Italo-Albanians have not only kept their traditional ethnonym, contrary to 
Standard Albanian, where it has been replaced by shqiptar ‘Albanian’ (shgip when 
referring to the language), but many communities have also kept their own ecclesi- 
astic characteristics, i.e. a liturgy of the Greek Orthodox type, which for a long 
time inhibited mixed marriages with Italians. The process of immigration over 
three centuries and the territorial distribution, however, have led to much greater 
differences between the individual Italo-Albanian dialects than in the case of Mo- 
lise Slavic; see Savoia (1991) for a classification. In this chapter, we will refer only 
to the Italo- Albanian varieties of Calabria and Molise. 


3. Pertinacity and change of grammatical categories 
in total language contact 


Two grammatical categories will serve as the basis for our comparison of the con- 
tact-induced changes in the minority languages introduced in section 2: definite- 
ness (article systems) and verbal aspect. For reasons of space the discussion of 
their characteristics must be restricted to a minimum. 


3.1. Contact-induced article systems 


The absence of an article system can be considered a diachronic constant of the 
Slavic language family. Bulgarian and Macedonian are the only Slavic Standard 
languages with a definite article (placed after the noun) and to a certain extent also 
an indefinite one. This, of course, is not an argument against the assumption of 
such a genetic diachronic constant, as the Balkans are an area of intense language 
contact. In fact, articles play an important role in the contact-induced changes of 
the Slavic minority languages. 
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In Molise Slavic we find a fully grammaticalized indefinite article JENA with 
referential (1), non-referential (2) and generic functions (3). This functional range 
is almost completely identical to that of the indefinite articles of Romance contact 
varieties and was certainly induced in older times by the Molisian dialect, as we 
find it already in the texts given by ReSetar (1911: 237—322). Here and in the fol- 
lowing, small capitals symbolize the whole paradigm of the article word in ques- 
tion. In the case of JENA this includes also the clitics na, nu besides the full forms 
jena, jenoga, jenu etc. (Breu 2003a: 34-35): 


Molise Slavic 


(1) Ovo je na Storija tra na medik as 
this:N is a:NOM.SG.F story between a:NOM.sG.M doctor and 
na vidzil. 


a:NOM.SG.M cop 
“This is a story of a doctor and a policeman.’ 


(2) Cini-mi nu vizitu pur men! 
make-me a:AKK.sG.F medical examination also me:DAT 
‘Examine me, too!’ 


(3) Tranj je na sekj s konopam zgora. 
water bucket is a:NoM.sG.M bucket with rope.INSTR.sG.M above 
"The water bucket is a bucket with a rope on it.’ 


We understand “generic” in the wider sense including predicates. While in English 
the equivalent of the generic subject tranj in (3) could have a generic indefinite 
article (a water bucket), too, this is excluded both in Molise Slavic and in Italian 
(where the definite article would be obligatory). 

The Molise Slavic indefinite article is one of the results of the most productive 
process in language contact, which is the “adaptation to the semantic structure of 
the dominant language” (Breu 2003b: 354—363). In the case of the indefinite article 
in Molise Slavic, this process of semantic calquing consists in the analogical re- 
production of a foreign polysemic model in the replica language. More precisely, 
the polysemy of uno in Romance, expressing the numeral ‘one’ and also the indefi- 
nite article, was transferred to Molise Slavic by means of a corresponding semantic 
extension of the numeral JENA ‘one’, precisely with the effect of creating an indefi- 
nite article that is homonymous with the numeral. 

Exactly the same process was responsible for the transfer of the polysemy of 
German EIN to Colloquial Upper Sorbian JEN, once again resulting in the develop- 
ment of an indefinite article in all its functions, homonymous with the numeral 
‘one’ (Breu 2003a: 65, Scholze 2008: 144-156); see the following examples with 
JEN in its referential (4), non-referential (5) and generic article function (6). As the 
latter example shows, CUSrb JEN appears not only in the predicate but also in the 
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subject of a generic phrase, just like EIN in German, but unlike Molise Slavic JENA, 
see example (3) above: 


Colloquial Upper Sorbian 


(4) Najemol sindze jen muz z jenym koferom. 
suddenly come.PRs.3sG a:NOM.sG.M man with a:INSTR.SG.M suitcase 
“Suddenly a man comes with a suitcase.’ 


(5) Ja cem jen mikser mé. 
I  wantPRs.lsG a:AKK.SG.M mixer  have.INF 
‘I would like (want) to have a mixer.’ 


(6) Jen tigor jo jene wulke zwérjo. 
a:NOM.SG.M tiger is a:NOM.SG.N big animal 
‘A tiger is a big animal.’ 


Contrary to Molise Slavic, this seems to be a relatively recent development in 
Colloquial Upper Sorbian. Only the younger generation below their fifties use the 
indefinite article consistently, whereas older people still nearer to dialect usage 
omit it under certain conditions. The presence of attributes seems to be the most 
important factor for not using the indefinite article (Breu 2003a: 56). Apart from 
this, deviations from the German usage of the indefinite article can be found in all 
speakers in generic predications and with prepositions; see Breu (2003a: 46-49) 
and Scholze (2008: 151-156). It should be noted, however, that the indefinite ar- 
ticle, suppressed in modern Standard Upper Sorbian, appeared already in the oldest 
Sorbian texts with some of its functions (Breu 2003a: 55-56). 

There is, however, another, much greater difference in the article systems of 
these two minority languages. This concerns the definite article, which is present 
in Colloquial Upper Sorbian, but absent in Molise Slavic, despite the fact that their 
respective model languages German and Italian have, in principle, the same article 
system.? The CUSrb definite article TON, originally a deictic pronoun, has devel- 
oped from copying the German model of DER, which is polysemic with respect 
to deixis and definiteness. In both Sorbian Standard languages, TÓN with article 
functions has been eliminated, due to the efforts of 19th century purists, just like 
the indefinite article, but it was present in the early literature and in bible trans- 
lations of the 15th and 16th century (Berger 1999) and continues to be used fre- 
quently in modern dialects, although apparently in an inconsistent way (Bayer 
2006: 108—122, 158), without the strict rules developed in Colloquial Upper Sor- 
bian. Here, TON is obligatory in many contexts (Breu 2004: 16-44, Scholze 2008: 
157-179) though it has not yet reached the full functional range of its German 
equivalent DER. We find it consistently used with deictic and phoric (pragmatic) 
functions, as in the anaphoric example in (7) and the associative anaphoric 
example in (8), while it is absent in the cases of semantic definiteness, e.g. with 
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uniques (9), or with typical accessories or constituent parts like the 'cellar' as part 
of a house in (10), but also in the case of generic subjects as in (11), here in vari- 
ation with the indefinite article as shows example (6) above. In (7), the definite 
article refers to the ‘cop’ as the rheme, introduced by jen pólicist in the preceding 
context, given in brackets. The omitted sentence, marked here by three dots, is 
example (4) above: 


Colloquial Upper Sorbian 


(7) [Jen pólicist tam stej ha napisa jen strafcetl ...] Tóm pólicistej 
[ sal the:DAT.SG.M COp:DAT.SG.M 
pa jo to Sojene. 
but is it indifferent 
‘[A cop is standing there and is writing a ticket ...] The cop, 
however, does not care.’ 


(8 Ta ce na jen film hic, dókej$ wona to rezisera 
She wants on a film go because she  the:AKK.sG.M director 
znaje. 


know:PRs.3sG 
‘She wants to go to a film, because she knows the director.’ 


(9 Ø  Zemja so wóko Ø  slónca werci. 
DEF earth RFL around DEF sun turn:PRS.3SG 
‘The Earth rotates around the Sun.’ 


(10) Sy ty Ø  pincu redzita? 
be.PRs.2sG you DEF cellar clean.PsT.PCPL.F 
*Did you clean the cellar?' 


(11) Ø  Kruwa jo jene zwérjo. 
DEF COW iS a:NOM.SG.N animal 
‘The cow is an animal.’ 


The absence of a definite article in Molise Slavic is a violation of the linguistic uni- 
versal 1163 (previously 1167) in the “Konstanz Universals Archive” (http://typo. 
uni-konstanz.de/archive/intro; see Breu 2003a: 62—65) saying that an indefinite ar- 
ticle develops only after a definite article, which cannot be explained by the inter- 
nal structure of the minority language. In fact, we are confronted with a linguistic 
"accident" going back to the contact situation. The reason for this "accident" is 
immediately connected to the structure of the Romance contact varieties, whose 
definite articles are not homonymous with any of their deictic pronouns. For 
example, Standard Italian 1L (definite article) differs both semantically and for- 
mally from the deictic pronouns QUESTO ‘this’, CODESTO ‘this, that’, and QUELLO 
‘that’, and hence did not provide a model for an analogical extension of the mean- 
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ing of the corresponding Molise Slavic pronouns ovi, TA, ONI. What is more, this 
semantic structure of the model languages even inhibited Molise Slavic from inde- 
pendently developing a definite article from, for example, TA with its relatively 
neutral deixis. 

The corresponding deictic pronoun, actually, was the source for the Bulgarian 
definite article, whereas Macedonian uses all three deictic differentiations for its 
article system. According to what was said above, the definite articles in the Bal- 
kan Slavic languages did not develop independently but are a consequence of lan- 
guage contact, going back to the Romance substrate varieties, whose definite ar- 
ticle had probably derived from the Latin pronoun ILLE, when it still had its original 
deictic characteristics. 

Molise Slavs feel a clear correspondence between the Italian article iL and the 
absence of an article in their primary language, which can be concluded, for 
example, from the fact that they always give correspondences with a definite article 
in Italian when asked for the translation of a Molise Slavic noun. In the opposition 
JEN: Ø, definiteness is clearly connected to the zero-morpheme, for example in 
theme-rheme constructions like (12), the more so as the cases of using no article at 
all are very rare in Italian, which requires the definite article in generic contexts. 


Molise Slavic 


(12) Ovo je na Storija tra na medik 
this:N is a:NoM.sG.F story between a:NoM.sG.M doctor 
as na vidzil. 
and a:NOM.SG.M cop 
Ø Medik je ferma Ø maginu. 


DEF doctor:NOM.SG.M be:PRs.3sG stop:PRs.3G DEF car 
“This is a story between a doctor and a cop. The doctor has stopped his car.’ 


In this way, O has become a “definite zero-article" in the “grammatical diasystem" 
of the bilingual speakers. The "grammatical diasystem" (or diagrammar) of bilin- 
guals is a theoretical model based on the hypothesis that multilingual speakers do 
not strictly separate the grammars of their two or more languages but combine 
them in the most economical way possible. It could be claimed that the grammars 
of the individual languages are derived from the common diagrammar (as a psy- 
chological reality) by specific rules: the fewer the rules, the more economic is 
the management of the languages. In situations of "absolute" language contact, in 
which all speakers are bilingual, the reduction of language-specific rules seems to 
be the most important reason for language change. This is especially true with re- 
spect to the minority languages, as the pressure of the language community to pre- 
serve their specific rules is relatively weak, given the high prestige of the dominant 
majority language; for the MSlav and CUSrb article diasystems see Breu (2003a: 
58—62). The indirect expression of definiteness could be an explanation for the 
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early development of an indefinite article in Molise Slavic, contrary to its relatively 
late development in Colloquial Upper Sorbian and in spite of this early develop- 
ment violating the linguistic universal mentioned above. 

A complication for this line of argument seems to come from the Slovene- 
based Resian minority language. Resian did develop a definite article TE besides 
the indefinite article DIN, at least for some of its functions; see Steenwijk (1992: 
125-126) for the forms of the indefinite and the definite article in the modern 
Resian dialect of San Giorgio, with indefinite examples like to ma din lípi giist 
‘this has a nice taste’, ko j bil ni slábi tamp ‘when there was (a) bad weather’. The 
definite article appears normally in combination with adjectives, e.g. za te myrtve 
‘for the dead’, te stare Zane ‘the old woman’. Other cases can be found in Benac- 
chio (2005: 105), with examples from the texts in Steenwijk (1999); see also Be- 
nacchio (2002: 47) for the same peculiarity in ancient texts. 

Despite having been in total language contact with Romance donor varieties, 
just like Molise Slavic, there is, however, a crucial difference between the contact 
histories of the two minority languages, which is the likely reason for the definite 
article development in Resian, namely the historical influence of German on the 
whole complex of Slovenian varieties. Article functions of the demonstrative 
pronoun, normatively banned from modern Standard Slovene, appear very early in 
Slovenian literature and are still present in dialects of the Slovenian “mainland” 
and in Austria. They probably go back to the polysemic model of German and its 
varieties, with which Slovenian came into contact a thousand years ago. Therefore, 
it might well be assumed that this polysemy had existed in older Resian even be- 
fore it came into total language contact with Friulian and Italian. Apart from these 
early influences, Steenwijk (1992: 1) notes some "influence of German, still pres- 
ent in Val Canale". German influence on the development of a Slovene “definite 
article" is, however, discussed controversially (Benacchio 2002: 44—48). 

Burgenland Croatian can be claimed to continue without an indefinite article, 
whereas first steps towards a definite article derived from a demonstrative can be 
noticed. This means that this minority language is still far from copying the Ger- 
man model; see Neweklowsky (1978: 231) for the expression of indefiniteness by 
the numeral ‘one’ with the function of an indefinite pronoun and Bayer (2006: 
135-142, 164—165) for a corpus-based analysis of the overall use of article forms 
in Burgenland Croatian. The reasons for this conservatism might be connected 
with the preservation of the old differentiation of definite vs. indefinite in the ad- 
jectives of most dialects of Burgenland Croatian (Neweklowsky 1978: 43, 207). 

Italo-Albanian traditionally had an article system of its own, just like all the 
other varieties of Albanian, consisting of a fully developed postpositive definite 
article and an indefinite article displaying some differences compared with the Ita- 
lian system. So the question here was not one of copying the system but rather of 
adapting article functions according to the model of the majority language. This is 
exactly what happened.^ 
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3.2. Contact-induced reduction, restructuring, 
and extension of aspect oppositions 


There are basically two formal oppositions of verbal aspect in the Slavic languages: 
an inflectional one of the imperfect-aorist type and the grammatically derivational 
type, expressed by stem oppositions and known as the Slavic "opposition of per- 
fectivity". As for the modern languages, the inflectional type is almost completely 
restricted to the South-Slavic languages (excluding Slovene). It is most common in 
Bulgarian and Macedonian and limited to formal literary style in the Serbo-Croa- 
tian Standards (Bosnian, Croatian, Serbian), in spite of its productivity in some 
mainly central and south-eastern dialects. In Slovene as well as in the West and 
East Slavic languages, imperfect and aorist have disappeared, with their functions 
being absorbed by the analytic (periphrastic) perfect of the type “ro BE + l-parti- 
ciple”. The only exception to this geographical distribution is the Sorbian language 
group, which, in principle, has preserved both formal aspect oppositions, notwith- 
standing its affiliation to West Slavic.5 

In Molise Slavic the inflectional type of aspectually differentiated synthetic 
past tenses turned into an opposition of synthetic imperfect vs. periphrastic perfect, 
due to the loss of the aorist and the perfect taking over its functions (Breu 2005: 
41-42). In the history of other Slavic languages where at least one of the members 
of the ancient opposition of imperfect vs. aorist had been replaced by the perfect, 
this process did not happen in one step either. But it is a diachronic constant of 
Slavic that the imperfect is always lost before the aorist, and not the other way 
round, as in Molise Slavic. This is true even for the genetically closely related 
Serbo-Croatian dialects, which have either kept the ancient opposition entirely or 
exhibit a reduced system with the perfect replacing only the imperfect (Ivić 1958: 
120). The opposite development of Molise Slavic is obviously a result of language 
contact, as northern and eastern dialects of Italian — including the Molisian dialects 
in the neighbourhood of the Molise Slavic territory — have reduced the older three- 
part system to exactly the same opposition of imperfect vs. perfect (passato 
prossimo) by losing the aorist (passato remoto). It can be claimed that the loss of 
the aorist preceding the loss of the imperfect is a diachronic constant of Romance, 
as all Romance languages with a tendency towards the reduction of the old three- 
part system have undergone exactly this development, e.g. Rumanian and French 
(colloquial and northern dialects). This example shows even more clearly than the 
development of article systems in section 3.1. that language contact can induce a 
replica language to replace a diachronic constant of its own genetic group by the 
corresponding diachronic constant of the model language and its genetic group. 

The opposition of perfectivity as expressed by aspectual pairs consisting of a 
perfective and an imperfective verb with the same lexical meaning has been com- 
pletely preserved in Molise Slavic. Therefore, it still has both a perfective and an 
imperfective imperfect and a perfective and an imperfective aorist, with traditional 
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special functions like "iteration of sequences of events" (perfective imperfect) and 
"delimitation of activities and states" (imperfective perfect); see Breu (2007: 
194—197). Language contact seems, however, to be responsible for the dominance 
of the inflectional opposition over the derivational one. This can be seen most 
clearly from the overall exclusion of the (imperfective) perfect from expressing 
states and ongoing processes, just as in Romance and unlike the developments in 
the other Slavic languages (Breu 2005: 79-80). Only the (imperfective) imperfect 
can be used: 


Molise Slavic 


(13) Mendr ka sa  vraécasa, su ngopal 
while that RFL return:IMPFV.IMPERF.3SG be:PRS.3sG arrive.PFV.PTCP.PL 
vuca. 


wolf.NOM.PL 
“When he was returning home, wolves appeared.’ 


The earlier mentioned conservation of the inflectional aspect opposition of imper- 
fect vs. aoristin Standard Upper Sorbian seems to contradict the complete absence 
of aspect oppositions in the German majority language. Nevertheless, this situation 
does not speak against contact influence. Actually, it is due to Standard German, 
with its opposition between a simple and a periphrastic past, that such inflectional 
aspect forms have been preserved in Standard Upper Sorbian contrary to all the 
other West and East Slavic languages (Breu 2005: 38-41). Evidence for this claim 
comes from the tight link between the inflectional and the derivational aspect cat- 
egories, without the traditional contradictory combinations (perfective imperfect, 
imperfective perfect) mentioned in the case of Molise Slavic. As a consequence, 
there is only one simple past (as in German), differentiated secondarily according 
to the opposition of perfectivity into a perfective and an imperfective simple past 
(perfective aorist: imperfective imperfect). In Upper Sorbian dialects the aspec- 
tually differentiated simple past continues to be used but it can always be replaced 
by the periphrastic perfect. Colloquial Upper Sorbian, however, has reduced this 
opposition to only a few verbs, mainly auxiliaries, modal verbs and some speech- 
act verbs (Scholze 2008: 213-214), i.e. exactly the same groups of verbs to which 
the simple past is restricted in modern colloquial German, at least in the southern 
part of Germany, including Upper Lusatia. This is clearly another influence of the 
majority language, in this case from one colloquial variety onto the other. 

While in normative Standard Upper Sorbian the “derivational” opposition of 
perfectivity has preserved its function of viewing the lexically given state of affairs 
as a whole (totality function), it has turned into an opposition of terminativity in 
Colloquial Upper Sorbian. This means that the (traditionally mostly prefixed) per- 
fective verbs have been reduced to expressing terminativity (telicity) just like Ger- 
man prefixed verbs, regardless of whether the state of affairs in question is pres- 
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ented as a whole, as iterating or as an ongoing process (Breu 2000: 54-66; Scholze 
2008: 231—237). That is why in Colloquial Upper Sorbian "perfective" verbs can 
be used, for example, in the actual present (14), which contrasts with all the other 
Slavic languages, including Standard Upper and Lower Sorbian and Molise Slavic: 


Colloquial Upper Sorbian 


(14) Sto tón tam čini? — Ton Setozi rune 
what he there do:IPFV.PRS.3sG he  translate:PFv.PRs.3sG just 
jen list. 

a letter 


“What is he doing there? — He is just translating a letter.’ 


Nevertheless, the derivational aspect opposition has preserved its grammatical 
character in Colloquial Upper Sorbian. Although it no longer marks the “totality” 
of a state of affairs, there are clearly fixed rules for presenting a state of affairs as 
terminative or aterminative (telic or atelic; Breu 2000: 56-66; 2005: 61-67). Such 
rules are completely unknown in German and apply even to loanwords in Collo- 
quial Upper Sorbian (Scholze 2008: 231). This corresponds to a restructuring of 
the traditional Slavic opposition of perfectivity to a new grammatical opposition 
based on terminativity as a feature of German word formation. 

Such a restructuring was impossible for Molise Slavic, as Italian does not pro- 
vide a model for the expression of terminativity by means of prefixes. In fact, there 
was no model at all for changing the opposition of perfectivity in Molise Slavic 
so that it remained untouched. It even shows a high degree of productivity in the in- 
tegration of loanwords. Actually, it is in this field that terminativity has a special 
role in the Molise Slavic system, too: Italian aterminative verbs are integrated 
as imperfectives, whereas terminative verbs appear as perfectives, forming an as- 
pectual pair together with an imperfective partner derived by suffixation from the 
loan verb, e.g. It amare ‘to love’ (aterminative) > MSlav amat (imperfective), but 
It partire ‘to leave’ (terminative) > MSlav partit (perfective) — parcivat (imper- 
fective); see Breu (2007: 198—200) for more details. Incidentally, a parallel pro- 
cedure exists in Resian, for example It molteplicare ‘to multiply’ — Resian molti- 
plikät — moltiplikawat (Benacchio 2005: 98-99). 

The fate of the aspect categories in Burgenland Croatian has not yet been 
investigated in detail. From what is known by now, we can conclude that it has fol- 
lowed the general path of development of north-western Serbo-Croatian dialects 
by completely losing the inflectional aspect opposition. As for the “derivational” 
opposition of perfectivity expressed by means of aspectual pairs, it is much more 
conservative than Colloquial Upper Sorbian by having kept the central function of 
totality of the perfective verbs. This is clearly demonstrated by the exclusion of the 
formal perfective present from the expression of ongoing processes in Burgenland 
Croatian, i.e. examples of the CUSrb type (14) would not be possible here. On the 
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other hand, the use of perfectives for habitual events in the present goes beyond 
their use in modern Serbo-Croatian varieties (Neweklowsky 1978: 244—245). In 
this case the criterion of lexical terminativity is particularly prominent, as termi- 
native (telic) meanings seem to be expressed consistently by perfective verbs 
(often differentiated from their imperfective partners by prefixes) and aterminative 
(atelic) meanings by imperfectives, with the opposition of totality (‘whole’ vs. 
*part/ongoing") being ignored. In the light of the Sorbian restructurings this could 
be considered another case of German influence due to the terminative character of 
German prefixes of verbs. But Burgenland Croatian is truly more conservative 
with respect to perfectivity. This may well be attributable to having maintained its 
relations with “mainland” Croatian through the centuries, at least via the Burgen- 
land Standard. Another reason may be a less intense influence of German varieties, 
given the centuries-long Hungarian rule over that territory. The conservative char- 
acteristics of Burgenland Croatian grammar are confirmed by the fact that the de- 
velopment of an article system is still in its infancy, as mentioned above. 

As can be expected from the reduction of the inflectional aspect opposition in 
Molise Slavic, Resian is another case of a Slavic minority language which, 
contrary to Standard Slovene, has lost its aorist but preserved the original imper- 
fect, thus following the respective diachronic constant of Romance. It is here, how- 
ever, that the parallelism with Molise Slavic ends. In fact, a more thorough inves- 
tigation of the aspectual behaviour of the Resian verb, for example in the texts 
given by Steenwijk (1992: 189—224), shows that here the (imperfective) perfect 
can indeed express states and ongoing processes. This is impossible in Molise 
Slavic, where this function is exclusively tied to the imperfect. Moreover, the Re- 
sian imperfect has practically lost its indicative functions and changed into a 
counterfactual mood (irrealis), e.g. méson délat ‘I could have done’, bésté vídel 
‘you would have seen’; see Steenwijk (1992: 182-183), noting with respect to the 
San Giorgio dialect: "Imperfect forms indicate actions in the past that were never 
realised". The hypothetical function of the imperfect is frequent in Molise Slavic, 
too, but here in addition to its primary indicative aspect functions. The reason for 
this polysemy comes from an analogical extension of the functions of the indica- 
tive imperfect by copying the corresponding model of colloquial Italian, where the 
indicative imperfect, e.g. veniva *he came', replaces both verbs in the protasis and 
apodosis of the unreal conditional clause, substituting in this way the past perfect 
of the subjunctive, here fosse venuto, and the past conditional of Standard Italian, 
here sarebbe venuto. The reduction of the Resian imperfect to its secondary func- 
tion could be explained in a similar way to the development of its definite article, 
i.e. by German influence. Southern German varieties have completely lost their in- 
dicative simple past, keeping, however, past tense stems in the paradigm of the 
subjunctives, which are mainly used to express hypothetical conditional clauses. 

The reduction of the old three-part opposition of imperfect, aorist and (peri- 
phrastic) perfect is relevant for Italo- Albanian, too. Whereas the old opposition 
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continues to exist in Standard Albanian, it has in fact been reduced to an opposition 
of imperfect vs. aorist in many Italo- Albanian dialects, with the aorist also serving 
the functions of a perfect (see Altimari 1991: 63—64 for the isoglosses). At first 
sight, this seems amazing, as it is the exact contrary to the Molise Slavic develop- 
ment. Nevertheless, a contact explanation might again be justified: The periphras- 
tic perfect has a very restricted use in the extreme south of Italy (Rohlfs 1969: 46), 
where even the resultative meaning tends to be expressed by the aorist. As a con- 
sequence, the analytic perfect of the type “Kam ‘have’ + passive participle" in the 
language of the Albanian immigrants may have followed the model of the majority 
language; the South Italian model may even have been overgeneralized by com- 
pletely getting rid of its indicative function to the benefit of the aorist. In addition, a 
general weakening of the perfect could already have happened before the immi- 
gration to Italy due the language contact with medieval and modern Greek with 
their restricted use of analytical perfects. 

The perfect periphrasis itself continued to exist in Italo-Albanian, acquiring, 
though, the modal function of a presumptive. Thus, ItAlb ka vdekur does not 
simply mean ‘s/he has died’ but ‘s/he has probably died, s/he must have died’ 
(Breu 1991). Once again, the development of this modal meaning can be explained 
by language contact, more exactly by means of a two-step process. The first step 
concerns the perfect auxiliary kam ‘have’, which extended its meaning. Since in 
Southern Italy the Romance verb AvÉ means both ‘have’ and ‘must’, this poly- 
semic model was transferred to Italo- Albanian, with kam developing the additional 
meaning "must The second step originates in the omissions of dependent modal 
and auxiliary verbs typical of South Italian dialects. This led to parallel omissions 
in Italo- Albanian. Thus, for instance, ka té keté vdekur ‘s/he must have died’ with 
its presumptive meaning was reduced to ka vdekur by omitting të ketë, the subjunc- 
tive (replacing the infinitive in Albanian) of kam ‘have’. As a consequence, the 
presumptive construction became homonymous with the (original) perfect and was 
ultimately grammaticalized as the only meaning of this form. 

Yet the problem of the presumptive is not completely resolved. It is still an 
open question why it also emerged in northern Italo-Albanian dialects, for 
example in Molise, where the local dialects (and with them Molise Slavic) have 
lost the aorist. A potential explanation for this might be migration from the South 
at a time when the presumptive had already developed there. Another problem, 
requiring further research, comes from the opposite fact that the Italo-Albanian 
dialects south of the Calabrian province of Cosenza and in Sicily have not com- 
pletely lost the perfect. There could be special language-internal reasons for this, 
like for instance the partial use of jam ‘to be’ as perfect auxiliary in the dialect 
of San Nicola dell’ Alto in the province of Crotone (where the perfect has even 
replaced the aorist). This means that the exclusive use of kam ‘to have’ as the 
perfect auxiliary could be a necessary condition for the development of a pre- 
sumptive. 
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In Italian varieties we find aspectual periphrases, e.g. for expressing ongoing 
processes like sto partendo ‘I am leaving’ in Standard Italian, or for imminent 
states of affairs like sto per partire ‘I am on the point of leaving’. Such construc- 
tions tend to be calqued by the minority languages, though with different results on 
the basis of the individual replica systems. Thus we have Molise Slavic sa parci- 
vam, literally “now I leave", and stojim za partit “I stand for leaving" respectively, 
whereas Italo-Albanian expresses, for instance, the ongoing process by jam e 
nisem, literally “I am and leave", in one dialect group and jam që nisem "I am that I 
leave" in others. The motivation for these different constructions is not clear and 
needs further investigation into the insufficiently described local Italian dialects. 
This is too, in order to shed more light on the geographical distribution of the pre- 
sumptive. 


4. Summary and further research 


The minority languages in linguistic enclaves in central and southern Europe dis- 
cussed in this chapter are examples of particularly strong and long-lasting lan- 
guage contacts with one or more varieties of genetically diverse majority lan- 
guages functioning as their umbrella languages and also as models for their lexical 
and grammatical restructuring by contact-induced changes of the adstrate type. 
The dramatic changes in the morphosyntactic domains of article systems and ver- 
bal aspect served as examples for the overall adaptation of the minority languages 
to foreign models. They are the result of centuries of linguistic isolation and wide- 
spread bilingualism in the minority territories with little or no connections to the 
standard varieties of their own language group (phylum). What has happened in 
these special situations of total language contact might well be considered the end 
point of less intense influences in other contact situations, too, only that in such 
situations changes happen much more slowly and are often interrupted by a change 
of the sociolinguistic situation before a prospective restructuring will be com- 
pleted. 

The principle of the “adaptation of the semantic structure", which is respon- 
sible for the semantic extension of lexical units (semantic calquing), has turned out 
to be productive with respect to contact-induced changes in grammar, too. In the 
examples discussed, it caused the merger of aorist and perfect, with the direction of 
the merger being determined by the structure of the dominant language. The same 
was shown to be true for the rise of indefinite and partly also definite articles. On 
the other hand, the lack of a corresponding polysemic structure in the model lan- 
guage can even block a well-known grammaticalization path, like that of the defi- 
nite articles deriving from demonstrative pronouns, for instance. This was demon- 
strated for Molise Slavic, where at the same time another typological universal was 
overridden, namely the one that indefinite articles develop only after definite ones. 
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In both cases, genetic diachronic constants of the minority languages were can- 
celled out by the constants of the majority languages due to language contact. One 
of the consequences of language contact overriding the genetic relations of the mi- 
nority languages is that these minority languages now perfectly fit the European 
language map, with its isoglosses for the distribution of article systems and the loss 
of simple past tenses (and aspect; see Breu 1994: 56—58). 

Another result of our comparative study was that grammatical forms losing 
their primary functions, as was the case with the aspect functions of the perfect in 
Italo-Albanian and the imperfect in Resian, do not necessarily become obsolete. 
They can perfectly well serve as starting points for the development of new inde- 
pendent grammemes from secondary meanings, in this case the presumptive and 
the hypothetical mood. Although the secondary non-indicative meanings in ques- 
tion go back to the characteristics of the model language, the final result sharply 
differs from the model. There are many similar cases, like the development of a 
grammatical opposition between two futures or of an impersonal neuter in Molise 
Slavic, which have not been discussed here but which could have offered addi- 
tional insight into the problem of novel “independent” developments in situations 
of total language contact. 

The absence of a category of the replica language in the model language does 
not automatically mean that it becomes obsolete. Quite to the contrary, the lack of a 
model for the change can result in its conservation, as in the case of the derivational 
aspect opposition of perfectivity in Molise Slavic. But whenever a model is found, 
even if it is only an indirect one like the German lexical opposition of terminativity 
(telicity) based on verbal prefixes, it can serve as a basis for restructuring the gram- 
matical system of the minority language. The case in point in this chapter was the 
grammaticalization of the opposition of terminativity in the aspect system of Col- 
loquial Upper Sorbian to the detriment of the opposition of perfectivity. New op- 
positions on the basis of periphrastic models may result in replica-specific formal 
realizations, as was shown by the aspect periphrases in Molise Slavic and Italo-AI- 
banian. Unfortunately, our knowledge of the local varieties of the majority lan- 
guages in Italy and elsewhere is still very restricted. In view of their historical role, 
further investigations of the contact-induced developments in the minority lan- 
guages must therefore include specific research on the structural variation found in 
these local dialects of the majority languages. 


See also the following chapters in this volume: 16 by Tomić, 17 by Wälchli, 19 by 
Óstman, 21 by Pusch and Kabatek, 22 by Hill, 30 by Gilles and 20 by Stern. 
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Notes 


1 See Goebl et al. (1997: 1318-1398, 1753-1790) and Hinderling and Eichinger (1996) for 
other contact situations in the mentioned countries, including “genetically” less differ- 
entiated contact situations, with minority languages like Frisian or Danish in Germany or 
Occitan and Franco-Provengal in Italy. Other minority languages, like German-based 
Walser, Mócheno, and Cimbro in Italy, would, however, have provided interesting argu- 
ments for our comparative analysis. Unfortunately, they had to be excluded from our dis- 
cussion for reasons of space. 

2 Iam grateful to Rosanna Benacchio for this information. Apart from the Resia Valley, the 
Slovene-based minorities of the Natisone and Torre Valley have also lived for centuries in 
intense contact with these Romance varieties. See Benacchio (2002: 17—24) and Steen- 
wijk (1992: 1—7) for brief overviews of the history, the actual situation, and the research 
done in the Slavia veneta. 

3 There are, of course, typical differences between Italian and other languages, for example 
the obligatory use of the definite article with mass nouns as subjects, optional in German 
and excluded in English, e.g. It. L'acqua é un liquido : (Das) Wasser ist eine Flüssigkeit: 
Ø Water is a liquid. Apart from the fact that this usage is clearly marginal, it should have 
even heightened the pressure on transferring the definite article to Molise Slavic by in- 
creasing its frequency in Italian. 

4 Judging from the standpoint of the differences between Standard and Italo-Albanian, 
possible results of such an adaptation could be found in the realm of non-referential use, 
but only restricted to hypothetical clauses and to questions of the type ‘is there a doctor in 
your village?', where Italo-Albanian informants used the indefinite article NJÉ, corre- 
sponding to un in Italian, while Albanians insisted on the indefinite pronoun ndonjé ‘any’. 

5 InStandard Lower Sorbian the conservation of the simple past (imperfect: aorist) is a pure 
claim of grammarians and some writers copying the Upper Sorbian and German model. In 
Lower Sorbian dialects, following the normal development of the West Slavic languages, 
it has been obsolete for centuries. See Michalk (1959) for the use of the simple past in all 
Sorbian varieties, from the historical and the modern (of his time) point of view. 
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4.2. Frisian (East Frisian, Saterlandic) 

5. Language politics 

6. The role of linguistics 

1. The term minority language and other terms 


A language minority is considered to consist of a small group of speakers of a spe- 
cific language used in a given, commonly political, territory in which one or more 
other, major language(s) are spoken. Thus, speakers of a language spoken by mil- 
lions can constitute a minority when their language is used outside the country of 
origin, and under conditions described above (see also chapter 25 in this volume by 
Extra). But there are also languages with a small number of speakers which do not 
have the status of a majority language anywhere. For these, the term regional lan- 
guage is often used. Small languages (translating the German term Kleinsprachen 
often employed in this context) can be of this regional type, but they can also 
simply be languages with a small number of speakers (in no minority setting), like 
Icelandic or Estonian. In the context of the European Union, the term lesser-used 
language is applied for the same type of language. The treaty for the protection and 
support of those languages is called ‘Charter for Regional and Minority Lan- 
guages’. Languages considered in the treaty are “those traditionally used by part of 
the population in a state, but which are not official state language dialects, migrant 
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languages or artificially created languages” (cf. http://ec.europa.eu, full link in the 
bibliography). This chapter will focus on the regional and small indigenous lan- 
guages. They are considered “old” minorities since they have existed for a longer 
time and are not the result of migrations in the last century. 

When speaking of minority languages, a common problem is the distinction be- 
tween dialect and language. Reasons for naming a variety a language can be politi- 
cal and economic, as well as on grounds of group identity. But, as will be shown 
below for Rhaeto-Romance, there are often enough even linguistic arguments both 
for and against the decision for assigning to a given variety the status of a lan- 
guage — with no definite conclusion to be drawn. 


EL Overview of languages 


Itis difficult to list all minority languages spoken in Europe since there are so many 
approaches to classify them. For example, one can sort by country and list the 
minority languages spoken in each of them. This is shown (unfortunately, in an in- 
complete way) in Poche (2000: 10-11), who includes new minorities and dialectal 
varieties as well as minorities belonging to language communities of adjacent 
countries. Haarmann (1972: 5-7) classifies by language family. He enumerates 
forty small languages with less than four and a half million speakers (Haarmann 
1972: 19). Obviously, not all of the languages listed by Haarmann are minority lan- 
guages as considered in the Charter, e.g. Icelandic, Slovak, Latvian or Lithuanian. 


1.2. Reasons for being a minority 


Every language community can become a minority, due to either emigration, or the 
fixing or shifting of political (and sometimes also ideological) borders, which more 
or less separate the community from their related language spoken in their home 
country. Following this scenario, the features of the language can change in such 
a way that a new language emerges. This happened in the Middle Ages with Irish 
Gaelic (Old Irish), when Gaelic people settled in Scotland and on the Isle of Man. 
Divided by the Irish Sea, the language developed differently in Ireland than it did 
in Scotland and on the Isle of Man. Now, all of them are minority languages, with 
Irish enjoying the status of first national language of Ireland, Scots Gaelic a re- 
gional language, and Manx a revived minority language. 

Other language communities have always been small communities as long as 
there has been a record of them, but they became a minority at the moment another 
language received official language status. Franco-Provencal, in the Aosta Valley 
in Italy, was part of a broader language group: however, when Italian became the 
first national language in Italy, Franco-Provencal became a minority language. 
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1.3. Small languages and minorities 


Not every small language is a minority language. For example, Luxembourgish is 
the mother tongue of about 300,000 people in the grand duchy of Luxembourg and 
is one of the official languages besides French and German. Another example is 
Faroese. A small language (with 60,000 to 100,000 native speakers) which apart 
from an emigrant group in Copenhagen, is only used on the Faroes. However, it is 
the main language on the islands and still very much alive. It has never been a mi- 
nority, even though Danish was the written language up to the 19th century. Irish, 
on the other hand, is the first national language of the Republic of Ireland, but the 
number of speakers has declined in most parts of the country. Even in the Irish- 
speaking areas, English (especially in the counties Donegal, Galway and Mayo) 
has become more and more important. Depending on the formulation of census 
questions, between 20,000 and 70,000 native speakers can be counted. Most Irish 
people have knowledge of the Irish language from school, but since it is taught in a 
similar way that Latin is taught, an active command of Irish is usually not acquired. 
Today, there are no monolingual speakers of Irish left. 

Almost all minority languages belong to a geographical area in which they are 
the only language spoken or, at least, a language spoken in the majority of contexts 
and situations, some even as a written language. However, with respect to the 
national linguistic context, they still qualify as a minority language. Many of the 
minority languages do not have any standard or written form, and it is more or less 
natural for the speakers to use another language for written communication. The 
development of standards and written languages for non-standardized languages is 
a complicated process and often not widely accepted. Possible reasons for this are 
the conflict between different traditions, (e.g., traditional spellings, spellings of fa- 
mous authors in the minority language, etc.), or the dialectal diversity of a lan- 
guage. For some languages, the development has only just started, as is the case 
with Ladin and the development of the Ladin Dolomitan. In other cases, like Bre- 
ton, competing standards are used. 


2. Scenarios 


Different scenarios concerning minority languages can be distinguished on the 
basis of the varying degrees of genetic relationship between minority and majority 
languages. Minority languages are always in a contact situation with one or more 
majority languages and thus the degree of structural relatedness can (but need not) 
be of importance to the degree of interference between the languages. Of course, 
there are other possible descriptions of minorities. For instance, different scenarios 
can be distinguished for old, indigenous minorities and new, immigrant minorities. 
Anderson (1989: 121—127) offers the following seven subdivisions of indigenous 
minority situations using a socio-political framework: 
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1) Language minority situated in its own ‘homeland’ territory (e.g. Occitan) 

2) Language minority representing the linguistic majority of a neighbouring 
country (e.g. Flemish in France) 

3) Minorities as in 2) on either side of a political border (e.g. Danish/German 
border varieties) 

4) Minorities in an indigenous region spreading across more than one country 
(e.g. Basque) 

5) Widely scattered indigenous minorities (e.g. Armenian, Jewish) 

6) "Revival"-Languages (e.g. Cornish) 

7) Minorities within a minority region (e.g. Ladin in the German speaking prov- 
ince in Italy) (Anderson 1989: 121—127) 


Similarly, Anderson subdivides the immigrant minority settings into seven sub- 
divisions, but these need not concern us here. 

Edwards (1989: 137—150) discusses other possible typologies of minority lan- 
guages. One typology, for instance, is based on the parameters unique, non-unique 
and /ocal-only. The latter two are minorities in one or more countries, but the lan- 
guage is a majority in some other country, as is the case for Greek or Albanian. Lan- 
guages belonging to the first type are minority languages spoken in one country 
only, and in the second type they are spoken in two or more countries. The distinc- 
tion adjoining and non-adjoining describes whether the linguistic communities of 
the minority language in different states are in contact with each other or not. Fin- 
ally, cohesive and non-cohesive describe the local connectedness of speakers with 
one state. Ten different settings can be established and used for typologies of both 
indigenous and immigrant minorities (Edwards 1989: 141—143). Edwards suggests 
the following ten extra-linguistic factors for classifying minorities: demography, 
sociology, psychology, history, politics, geography, education, religion, economics, 
and media. All of these are normally included when describing minority language 
settings. Below an alternative view will be presented which is based on the kind of 
linguistic relationship of the minority language and the contact languages. 


2.1: Minority languages closely related to the contact languages 


A typical setting is that a minority language and a majority language are closely re- 
lated to each other in that they are members of the same branch of a language 
family, for example, both are Romance or Germanic languages. In earlier times, 
they might have been two neighboring dialects or related languages, one of which 
later became a language with high prestige and the function of a national language. 
A famous example is the fate of Oscan and Umbrian, whose neighboring dialect 
from Latium became the language of a large empire, known to us as Latin. In sec- 
tion 4, we shall demonstrate this situation with the examples of Rhaeto-Romance 
and Frisian. Another case in point is the development of Middle English in the late 
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Middle Ages. The variety of English used in Scotland declined after a period of 
high prestige when the kingdoms of Scotland and England where united. In the 
Lowlands, the variety survived up to now as Scots, which is to be distinguished 
from Scottish English (cf. Bergs 2001). 


2.2. Minority languages related to the contact languages 


In this scenario, the minority and the majority language are members of the same 
language family, such as Indo-European, but not of the same branch. This is the 
case for all the Celtic languages, which are nowadays only in contact with Indo- 
European languages, i.e., English on the British Isles and French in Brittany. In 
times of greater expansion of Celtic, there might have been contacts with less or 
even unrelated languages, such as Basque. 

Upper and Lower Sorbian in Germany also fall into this category, as Slavic lan- 
guages in a Germanic-speaking country. The Upper and Lower Sorbian commu- 
nities are minorities in Germany (with 20,000 and 40,000 native speakers, respect- 
ively) and their languages are widely used in education and public contexts. For 
instance, the official websites of the cities of Bautzen and Cottbus are available in 
both Sorbian and German. The long established contact between German and Sor- 
bian has affected the supra-segmental properties of Sorbian. The intonation of 
many speakers sounds similar to German, even though on a segmental level the 
contact did hardly influenced the sounds of Sorbian (Faßke 1997: 1795). 


23: Contact languages belong to other language families 
than the minority language 


The third and last scenario occurs where the minority language and the contact lan- 
guage belong to completely different language families. This is the case for Basque, 
an isolated language, which is in contact with the Indo-European languages Span- 
ish, French, and Gascon. 


2.4. Non-areal minorities 


A special situation arises for languages of minorities without an indigenous area 
where the speakers live, such as Romani. When speakers of such a language move 
around a country, for instance Spain, it means they will get in contact not only with 
Spanish, but with Catalan, Basque and Galego, and all of these languages can have 
an influence on the minority language (see also chapter 13 by Matras, this volume). 
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3. The fate of minority languages 


3.1: Extinct minority languages 


For many old languages of Europe we cannot give a full account of whether they 
had minority status or not. Some old languages are only known to us as place 
names or sparse relicts. Those provide little information on the size of the lin- 
guistic community and the use of the particular languages. Thus, we cannot answer 
the question whether Lepontic, for instance, was a minority language in Northern 
Italy or not. 

Norn, a language derived from Old Nordic, was spoken in parts of Scotland, the 
Orkney and Shetland Islands. When, in the late 15th century, these islands became 
part of Scotland, the language was gradually displaced by Gaelic, Scots and later 
on by Scottish English. In the 18th century, Norn still existed as a language used in 
family communication, however, during the 20th century the last 10,000 Norn 
words still surviving were replaced by English words (Haugen 1976: 233). 


3:2: Nearly extinct minority languages 


Cimbrian is a relict of Germanic in two villages in the province of Veneto, and in 
one village in the province of Trentino. Only in the village Lusérn, in Trentino, is it 
used for communication by about 84 % of the population; in the other villages it is 
a language of cultural tradition, but not used in communication. These villages are 
situated at the end of a valley and on a plateau, i.e., in isolated areas. The exact area 
where Cimbrian was spoken in former times is not clearly defined; historical docu- 
ments mention a group of Germanic speaking people roughly in this area. The now 
existing enclaves are the last remains of these settlements. The origin of Cimbrian 
is not certain either. Currently, there are three competing theories on its origin. 
Some see Cimbrian as a distinct Germanic idiom. Others see it as a remnant from 
upper German settlements, and still others as a remnant from the Langobards. 
Further opinions do not agree with any of these theories (Bidese 2004: 14—32). 

In the Baltic area, we find the last remains of Livonian, a Finno-Ugric language 
spoken only by about a dozen elderly speakers in Latvia. Until the 18th century, Li- 
vonian used to have a wider distribution along the Baltic coast, but almost died out 
during the period in which Latvia was part of the USSR. The Livonian coast was a 
restricted area and the Livonians had to emigrate into the cities. The language is 
now acknowledged as an indigenous language of Latvia. 

In the south of Italy, there are settlements of people speaking Albanian. Of 
about a hundred formerly existing villages, forty-five remain. The communities go 
back to a period (15th to 18th century) when Albanians settled in the south of Italy. 
The speakers are bilingual but they show signs of a complete language shift to 
Italian. In the lexicon, almost all words are replaced by Calabrian idioms. Change 
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in morphology and assimilation of sounds to the Calabrian system are other signs 
of the abandonment of typical Albanian features. From a cultural perspective, 
it may be noted that the communities changed their confession from Orthodox to 
Catholic. 

The Ethnologue (www.ethnologue.org) lists six nearly extinct languages 
for Europe: Karaim in Lithuania, Livonian (Liv) in Latvia, Lower Silesian in Po- 
land, Germany and the Czech Republic, Romano-Greek in Greece, different dia- 
lects of Saami in Sweden, Norway and Russia, and Vodish (Vod) in Russia. A prob- 
lem with the subdivision in the Ethnologue is that dialects are sometimes listed as 
separate languages. 


3.3. Revived minority languages 


An interesting situation can be observed in Cornwall and on the Isle of Man. There, 
the native Celtic languages Cornish and Manx died out (the last native speaker of 
Manx died in 1974, the last native speakers of Cornish died around 1800). For a 
while, they existed only as literary languages. However, for different reasons, 
people revived these languages and tried to re-establish them. After long debates 
about which form of Cornish would be the correct one, a traditional spelling re- 
flecting the close relationship to Breton was finally settled on. Due to the discon- 
tinued oral tradition, the pronunciation of Neo-Cornish is heavily influenced by 
English (to prove that there is no problem with that, it was argued that West-Corn- 
wall English reflects the sounds of late Cornish anyway). Even though there are 
language courses and initiatives for teaching Cornish in school, the number of 
speakers is still only about 200. Yet one can find bilingual road signs in West Corn- 
wall. For Manx, there were tape recordings from the last speakers which made it 
easier to reconstruct the sounds of spoken Manx. As with Cornish, a rich literature 
exists for Manx, which serves as an important source for the revitalization of the 
language (Thompson 1992, Thomas 1992). 


3.4. Special scenario: Minorities within minorities 


A minority language which is spoken within another minority community consti- 
tutes a special case. A minority in Italy is Catalan, spoken on the island of Sardinia. 
The Catalan community, which goes back to the 14th century, consists of almost 
the complete population of the city of Alghero. Catalan was used merely as a 
spoken language, but now it is acknowledged as a minority language in the print 
media as well. It has undergone various changes which differentiate it from the 
Catalan of Catalonia. But as Sardic is another minority language in Italy, Catalan is 
a minority within another minority. This scenario also exists in mainland Italy, 
where most of the Ladin-speaking valleys in the north are surrounded by the south 
Tyroleans, which is a German-speaking minority. 
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The case of East Frisian described in 4.2 below is similar. A major contact, at 
least for the older generation, exists with Low German, which itself constitutes a 
minority language in the north of Germany. The same applies to the situation of 
North Frisian. 


4. Surviving minority languages: Two examples 


4.1. Rhaeto-Romance 


Rhaeto-Romance is divided into Swiss Rhaeto-Romance in the Swiss canton of 
Graubiinden, with about 40,000 to 50,000 speakers, Ladin in Trentino-Alto-Adige 
in Northern Italy, with about 10,000 to 30,000 speakers, and Friulian in the prov- 
ince of Udine in northeastern Italy, with 400,000 to 500,000 speakers. The question 
whether this is one language with three dialects or sub-languages, or rather a 
Northern Italian dialect continuum, was and is discussed as the questione ladina 
and will not be debated here (cf. Haiman and Beninca 1992: 25, Kaiser 2001). 
However, some unifying features can be noted: 


(1) Plural endings in -s: 
Lat stella ‘star’; Rmns staila, PL. stailas; Ladin stéra, PL. stéras; Frin stele, 
Stéles, 


(2) Conservation of -/- where in Italian it changed to -j- (Lat clavis ‘key’Rmns 
clay; Ladin tlé; Frln cläf), 


(3) Palatalization of g and c before a (Lat calida ‘warm’; Rmns chauda, Ladin 
Cálda; Fıln cdlde, Lat gattu ‘cat’; Rmns giat; Ladin iat(giat); Prin gjat), as to 
be found in Gallo-Romance as well. 


Swiss Rhaeto-Romance and Ladin are in contact with German, and all three of 
them are in contact with Italian and local dialects of Italian, i.e., Lombardic (for 
Swiss Rhaeto-Romance), the dialects of the Trentino (for Ladin), and the dialects 
of the Veneto (for Friulian). The Rhaeto-Romance languages share features with 
the adjacent dialects of Italian. Thus, most Swiss Rhaeto-Romance dialects (apart 
from Surselvian) have rounded front vowels, as do other northern Italian dialects. 

The languages are spoken in villages and small towns in the Alps and there 
exist many geographic separations of communities living in different valleys, so 
that there is no continuous linguistic community. All of the Rhaeto-Romance lan- 
guages are subdivided into dialects. There is no single standard for each of the 
three languages. In Switzerland, the suggested Rumantsch Grischun, a constructed 
standard for written language, is quite successful. For Ladin, a similar process is 
going on with the development of Ladin Dolomitan. Other writing traditions for 
dialects or smaller units are still in use. 
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Swiss Rhaeto-Romance has the status of an official language of Switzerland, 
whereas Ladin and Friulian are recognized as regional languages in Trentino-Alto- 
Adige (Ladin) and Friuli- Venezia-Giulia (Friulian). 

Rhaeto-Romance is represented in the print media and radio programmes, 
sometimes in television, but German and Italian media are omnipresent. Literature 
can be found in all of the Rhaeto-Romance languages. It has a long tradition and 
some regional written forms do exist. The languages are taught at school, but not 
throughout all the communities and not at all levels. In some communities, the lan- 
guages show great vitality, in others they are only used in private communication. 
In official communication, everyone should be allowed to use Rhaeto-Romance; 
however, the reality often looks different. 


4.2. Frisian (East Frisian, Saterlandic) 


In the region called Eastern Frisia (Ostfriesland), the only languages spoken today 
are Low and High German. The last speaker of East Frisian died in the 1950s. 
However, further to the south, there is still a Frisian speaking population. East Fri- 
sians emigrated in the Middle Ages, when severe floods drowned a large area of the 
coast, and settled in a boggy land surrounded by Westphalian peoples. Isolated by 
language, religion, and poor traffic conditions, the language survived. The area 
where the settlements were founded is called the Saterland and the East Frisian lan- 
guage there is called Saterlandic (Seeltersk). 

Today, some 1,000 to 2,000 speakers use it more or less frequently on a daily 
basis. In many families, intermarriage with non-Frisian partners from the sur- 
rounding villages led to a language shift to Low German. For some time after the 
Second World War, the tradition of the language was interrupted, resulting in the 
fact that almost a complete generation learned Saterlandic exclusively from their 
grandparents. Influence of Low and High German is seen in, for example, final de- 
voicing (Auslautverhdrtung), vocalization of postvocalic [r], and change to a uvu- 
lar realization when [r] is not vocalized. New allophones came into the Frisian sys- 
tem, such as /g/ for the /x/-phoneme after front vowels. In each of the three main 
villages of the community (Scharrel, Ramsloh, Strücklingen) there are some dia- 
lectal differences, especially in the phonology and lexicon. 

There is no standard for written Saterlandic, although some traditions deve- 
loped from local authors or from linguistic descriptions and references. Most 
speakers use it in written form only for private letters; otherwise the favoured 
written language is Standard German. There is a rare presence of Saterlandic in the 
print media, e.g., once in a while a short column in a local newspaper, competing 
for space with Low German. Cultural tradition is very vivid. Some contacts to the 
other Frisian languages in Northern Germany (North Frisian) and in the province 
Fryslan in the Netherlands (West Frisian) have been established. But, for the vast 
majority of the population, the distances for frequent meetings are too great. Ef- 
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forts to publish translated works in printed media, such as bilingual editions (North 
Frisian and Saterlandic), are rare since the Saterlandic readership is often too small 
for the publishers. In everyday use, Saterlandic competes with both Low German 
and High German. During the last 60 years, especially after the Second World War, 
the influx of people without knowledge of either Frisian or Low German was so 
significant that one can no longer expect to be understood when speaking Sater- 
landic in a shop or public office. 

Saterlandic was not taught in school for a long period. In the 1990s, it was only 
offered as an elective subject, mainly in the hours after the official end of regular 
school lessons. During the last few years, offers have been extended and efforts 
have been made to teach Saterlandic in normal classes. However, there is still a 
lack of fully educated teachers with appropriate knowledge of the language. Sater- 
landic is a recognized regional language of Germany (Tróster 1995, Wirrer 2000). 


5. Language politics 


Language politics plays an important role. The times of repression of minority lan- 
guages are over. This does not mean, however, that the use and tradition of them 
has become a major concern for official national institutions. In the European 
Union, for example, there are institutions such as the Bureau of Lesser Used Lan- 
guages, and within the ‘European Charter for Regional and Minority Languages’, 
there is a framework for protecting and supporting regional and minority lan- 
guages, also for non-European Union countries. But not all member states have 
signed the European Charter yet, and not all that signed it have put it into national 
law. Italy and France are two important members with a number of regional and 
minority languages that have signed, but not yet ratified, the Charter. 


6. The role of linguistics 


What is the role of linguistics for the situation of minority and majority languages? 
Linguists have the tools and competence for describing languages. Linguistic 
description is exact and comparable, thus on the basis of solid linguistic facts the 
relationship between minority and majority languages, as well as between two or 
more minority languages, can firmly be established. For some communities, it 
might be of economic importance to identify linguistic facts and provide scientifi- 
cally sound arguments helping to support their plea for having their language ac- 
knowledged as a minority language, in order to become eligible for receiving pub- 
lic or European money for language-cultural projects. Therefore, linguists can 
assist language communities in the documentation of their languages, as well as in 
the development of didactic material, which is very important for teaching both 
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young and old non-native speakers in order to preserve the survival of the lan- 
guages. As the case of Rumantsch Grischun in Switzerland has shown, suggestions 
by linguists concerning the development of a written language that is accepted by a 
wide variety of speakers of different dialects can be effective. Major parts of the 
Swiss Rhaeto-Romance language community accept it for written use, probably 
because it was designed to fit most dialects of Rhaeto-Romance, without giving 
any of them a clear preference. 


See also the following chapters in this volume: 1 by Holvoet, 2 by Ronan, 3 by As- 
kedal, 4 by Kabatek and Pusch, 5 by Hansen, 10 by Haase, 16 by Tomić, 18 by 
Sanso, 29 by Darquennes. 
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25  Theimmigrant minority languages of Europe 


Guus Extra 


1. Regional minority languages and immigrant minority languages 

2. Immigrant minority groups and immigrant minority languages across Europe 

3. The role of language in identifying population groups in multicultural nation-states 
4. Immigrant minority languages at school 

5: The European discourse on plurilingualism 

1. Regional minority languages and immigrant minority languages 


Linguistic diversity is conceived as a constituent characteristic of Europe’s identity 
(Arzoz 2008). However, some languages play a more important role in the Euro- 
pean public and political discourse on “celebrating linguistic diversity”, the motto 
of the first European Year of Languages (2001). The constellation of languages in 
Europe actually functions as a descending hierarchy (Extra and Gorter 2008; Nic 
Craith 2006) with the following ranking: 


English as lingua franca for transnational communication; 
national or “official state” languages of European nation-states; 
regional minority (RM henceforward) languages across Europe; 
immigrant minority (IM henceforward) languages across Europe. 


In the official EU discourse, RM languages are referred to as regional or minority 
languages and IM languages as migrant languages. Both concepts are problematic 
for a variety of reasons. Whereas the national languages of the EU with English in- 
creasingly on top are celebrated most at the EU level, RM languages are celebrated 
less and IM languages least. IM languages are only marginally covered by EU lan- 
guage promotion programmes and — so far — are mainly considered in the context 
of provisions for learning the national languages of the “migrants’ countries of 
residence". 

Europe's identity is to a great extent determined by cultural and linguistic 
diversity. At the EU level, there are large differences in population size amongst 
EU member-states. German, French, English, Italian, Spanish, and Polish belong, 
in this order, to the six most widely spoken official state languages in the present 
EU, whereas Turkish would come second to German in an enlarged EU. There is 
also a close connection between the reference to nation-states and the reference to 
official state language. In 28 out of 30 EU (candidate) countries, distinct languages 
are the clearest feature distinguishing one nation-state from its neighbours (Bar- 
bour 2000), the only exceptions being Belgium and Cyprus. This match between 
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nation-state references and official state language references obscures the exist- 
ence of different types of "other" languages that are actually spoken across Euro- 
pean nation-states. Many of these languages are indigenous minority languages 
with a regional base; many other languages stem from abroad without such a base. 
As mentioned before, we will refer to these languages as RM and IM languages, re- 
spectively (Extra and Gorter 2008). 

A number of things need to be kept in mind. First of all, within and across EU 
member-states, many RM and IM languages have larger numbers of speakers than 
many of the official EU state languages. Moreover, RM and IM languages in one 
EU nation-state may be official state languages in another nation-state. Examples 
of the former result from language border crossing in adjacent nation-states, such 
as Finnish in Sweden or Swedish in Finland. Examples of the latter result from pro- 
cesses of migration, in particular from Southern to Northern Europe, such as Por- 
tuguese, Spanish, Italian or Greek. It should also be kept in mind that many, if not 
most, IM languages in particular European nation-states originate from countries 
outside Europe. It is the context of migration and minorisation in particular that 
makes our proposed distinction between RM and IM languages ambiguous. We 
see, however, no better alternative. In our opinion, the proposed distinction leads at 
least to awareness raising and may ultimately lead to an inclusive approach in the 
European conceptualisation of minority languages (Extra and Gorter 2008). 

There have always been speakers of IM languages in Europe, but these lan- 
guages have only recently emerged as community languages spoken on a wide 
scale in urban Europe, due to intensified processes of international migration and 
intergenerational minorisation. Turkish and Arabic are good examples of so-called 
“non-European” languages that are spoken and learned by millions of inhabitants 
of the EU. Although IM languages are often conceived of and transmitted as core 
values by IM language groups, they are less protected than RM languages by af- 
firmative action and legal measures in, for example, education. In fact, the learning 
and certainly the teaching of IM languages are often seen by majority language 
speakers and by policy makers as obstacles to integration and as a threat to the 
national identity. 

At the European level, guidelines and directives regarding IM languages are 
scant and outdated. Despite the possibilities and challenges of comparing the status 
of RM and IM languages, amazingly few connections have been made in the so- 
ciolinguistic, educational, and political domains (Extra and Gorter 2008). As yet, 
we lack a common referential framework for the languages under discussion. 
There are no standardised designations for these languages across nation-states. 
Commonly, a distinction is made between so-called national/historical/regional/in- 
digenous/old minority languages versus non-national/non-historical/non-regional/ 
non-indigenous/new minority languages. These distinctions show the inclusive 
versus exclusive jargon for reference to RM and IM languages, respectively. The 
concept of "lesser used languages" has been adopted at the EU level; the European 
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Bureau for Lesser Used Languages (EBLUL), established in Dublin, speaks and 
acts on behalf of “the autochthonous regional and minority languages of the EU”. 

In this chapter, the focus will be on IM groups and IM languages across Europe 
(section 2), on the role of language in identifying population groups in multicultu- 
ral nation-states (section 3), on IM languages at school (section 4), and on the 
European discourse on plurilingualism (section 5). Both multidisciplinary and 
crossnational perspectives will be offered on two major domains in which lan- 
guage transmission occurs, i.e. the domestic domain and the public domain. The 
home and the school are typical of these domains. At home, language transmission 
occurs between (parents and) children; at school this occurs between (teachers 
and) pupils. Viewed from the perspectives of majority language versus minority 
language speakers, language transmission becomes a very different issue. In the 
case of majority language speakers, language transmission at home and at school 
are commonly taken for granted: At home, parents usually speak an informal var- 
iety of this language with their children, and at school the formal variety of this 
language is usually the only or major subject and medium of instruction. In the 
case of minority language speakers, there is usually a much stronger mismatch be- 
tween the language of the home and that of the school. Whether parents in such a 
context continue to transmit their language to their children is strongly dependent 
on the degree to which these parents, or the minority group to which they belong, 
conceive of this language as a core value of cultural identity. 


2. Immigrant minority groups and immigrant minority languages 
across Europe 


As a consequence of socio-economically or politically determined processes of mi- 
gration, the traditional patterns of language variation across Western Burope have 
changed considerably over the past few decades. A new pattern of migration 
started in the 1960s and the early 1970s, and it was mainly economically moti- 
vated. In the case of Mediterranean groups, migration initially involved contract 
workers who expected — and were expected — to stay for a limited period of time. 
As the period of their stay gradually became longer, this pattern of economic mi- 
gration was followed by a second pattern of social migration as their families 
joined them. Subsequently, a second generation was born in the immigrant coun- 
tries, while their parents often remained uncertain or ambivalent about whether to 
stay or to return to their country of origin. These demographic shifts over time have 
also been accompanied by shifts of designation for the groups under consider- 
ation — “migrant workers", “immigrant families", and “immigrant minorities", re- 
spectively (Barni and Extra 2008). 

As a result, many industrialised Western European countries have a growing 
number of IM populations which differ widely, both from a cultural and from a lin- 


470 Guus Extra 


guistic point of view, from the indigenous population. In spite of more stringent 
immigration policies in most EU countries, the prognosis is that IM populations 
will continue to grow as a consequence of the increasing number of political refu- 
gees, the opening of the internal European borders, and political and economic de- 
velopments in Central and Eastern Europe and in other regions of the world. In the 
year 2000, about one third of the population under the age of 35 in urbanised West- 
ern Europe had an immigration background. Within the various EU countries, four 
major IM groups can be distinguished: people from Mediterranean EU countries 
and Mediterranean non-EU countries, people from former colonial countries, and 
political refugees. Although free movement of migrants between EU member- 
states is legally permitted and promoted, most immigrants in EU countries orig- 
inate from non-EU countries. 

At the EU level, comparable crossnational data are available only on so-called 
“non-national” population groups in EU member-states (Poulain 2008; EuroStat 
2006). In 2004, their number was around 25 million, just under 5.5 % of the total 
EU population. The largest numbers of foreign citizens reside in Germany, France, 
Spain, the United Kingdom, and Italy. The citizenship structures of foreign popu- 
lations in the EU member-states vary greatly. As well as geographical proximity, 
the composition of the non-national population in each country, examined against 
the proportion of the five largest groups of non-nationals, strongly reflects their 
history, in particular labour migration, recent political developments and historical 
links. For example, the largest non-national groups include Turkish citizens in Ger- 
many, Denmark, and the Netherlands; citizens of former colonies in Portugal 
(citizens of Cape Verde, Brazil, and Angola) and in Spain (Ecuadorians and Mo- 
roccans); migrants from Albania in Greece; citizens from other parts of the former 
Yugoslavia in Slovenia; Czech citizens in Slovakia; and citizens from former So- 
viet Union countries in Estonia, Latvia, and Lithuania. 

Given the overwhelming focus on second language acquisition by IM groups, 
there is much less evidence on the status and use of IM languages across Europe as 
a result of processes of international migration and intergenerational minorisation. 
In contrast to RM languages, IM languages have no established status in terms of 
period and area of residence, nor do they have influential pressure groups at the EU 
level. Also in contrast to RM languages, hardly any systematically collected data 
are available on IM languages across European nation-states. Typological differ- 
ences between IM languages across EU nation-states do exist, e.g. in terms of the 
status of IM languages as EU or non-EU languages or as languages of former col- 
onies. Taken from the latter perspective, e.g. Indian languages are prominent in the 
United Kingdom, Maghreb languages in France, Congolese languages in Belgium, 
and Surinamese languages in the Netherlands. 

Most studies on IM languages in Europe have focused on a spectrum of IM lan- 
guages at the level of one particular multicultural and multilingual city (e.g. Baker 
and Eversley 2000), one particular nation-state (e.g. LMP 1985, Alladina and Ed- 
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wards 1991), or one particular IM language at the state or European level (e.g. Jor- 
gensen 2003). As yet, few studies have taken both a crossnational and crosslin- 
guistic perspective (e.g. Extra and Yağmur 2004). 


3. The role of language in identifying population groups 
in multicultural nation-states 


Collecting reliable information about the diversity of population groups in EU 
countries is no easy enterprise. What is, however, more interesting than numbers or 
estimates of the size of particular groups, are the criteria for determining such 
numbers or estimates. Throughout the EU it is common practice to present data on 
RM groups on the basis of (home) language use and/or ethnicity, and to present 
data on IM groups on the basis of nationality and/or country of birth. However, 
convergence between these criteria for the two groups emerges over time in terms 
of (home) language and/or ethnicity as a consequence of the increasing period of 
migration and minorisation of IM groups in EU countries. Due to their prolonged/ 
permanent stay, there is strong erosion in the utility of nationality or birth-country 
statistics (Barni and Extra 2008). 

Comparative information on population figures in EU member-states can be 
obtained from the Statistical Office of the EU in Luxembourg (EuroStat). An over- 
all decrease of the indigenous population has been observed in most EU countries 
over the last decades; at the same time, there has been an increase in the IM figures. 
For a variety of reasons, however, reliable and comparable demographic in- 
formation on IM groups in EU countries is difficult to obtain. Seemingly simple 
questions like How many Turkish residents live in Germany compared to France? 
cannot easily be answered. For some groups or countries, no updated information 
1s available or no such data have ever been collected. Moreover, official statistics 
only reflect IM groups with legal resident status. Another source of disparity is the 
different data collection systems being used, ranging from nation-wide census data 
to administrative (municipal) registers or to more or less representative statistical 
sample surveys (Poulain 2008). In addition, most residents from former colonies 
already have the nationality of their country of immigration. Most importantly, 
however, the most widely used criteria for IM status — nationality and/or country of 
birth — have become less valid over time because of an increasing trend towards 
naturalisation and births within the countries of residence. 

For a discussion of the role of censuses in identifying population groups in a 
variety of multicultural nation-states, we refer to Kertzer and Arel (2002). Alter- 
man (1969) offers a fascinating account of the history of counting people from the 
earliest known records on Babylonian clay tablets in 3800 BC to the USA census in 
1970. In addition to the methods of counting, Alterman discusses at length who 
were counted and how, and who were not counted and why. The issue of mapping 
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identities through nationwide periodical censuses by state institutions is commonly 
coupled with a vigorous debate between proponents and opponents about the fol- 
lowing "ethnic dilemma": How can you combat discrimination if you do not 
measure diversity? (Kertzer and Arel 2002: 23-25). Among minority groups and 
academic groups, both proponents and opponents of mapping diversity can be 
found: Proponents argue in terms of the social or scientific need for population data 
bases on diversity as prerequisites for affirmative action by the government in such 
domains as labor, housing, health care, education or media policies; opponents 
argue in terms of the social or scientific risks of public or political misuse of such 
data bases for stereotyping, stigmatization, discrimination or even removal of the 
"unwanted other". Kertzer and Arel (2002: 2) argue that the census does much 
more than simply reflect social reality; rather than merely reflecting it, it plays a 
key role in the construction of that reality and in the creation of collective iden- 
tities. At the same time, it should be acknowledged that the census is a crucial area 
for the politics of representation. Census data can make people aware of under-rep- 
resentation. Minority groups often make language rights one of their key demands 
on the basis of (home) language databases. 

Decennial censuses became a common practice in Europe and the New World 
colonised by Europeans in the first part of the 19th century. The USA became the 
first newly established nation-state with a decennial census since 1790. The first 
countries to include a language question in their census, however, were Belgium in 
1846 and Switzerland in the 1850s, both being European countries with more than 
one official state language. At present, in many EU countries, only population data 
on nationality and/or birth country (of person and/or parents) are available on IM 
groups. In 1982, the Australian Institute of Multicultural Affairs recognised the 
above-mentioned identification problems for inhabitants of Australia and proposed 
including questions in the Australian census on birth country (of person and par- 
ents), ethnic origin (based on self-categorisation in terms of which ethnic group a 
person considers him/herself to belong to), and home language use (see Fishman 
1989 and Edwards 1985 as key references on the linkage between language and 
ethnicity). 

In Table 1, the four criteria mentioned are discussed in terms of their major ad- 
vantages and disadvantages. First of all, Table 1 reveals that there is no simple road 
to solving the identification problem. Moreover, inspection of the criteria for 
multicultural population groups is as important as the actual figures themselves. 
Seen from a European perspective, there is a top-down development over time in 
the utility and utilisation of different types of criteria, inevitably going from 
nationality and birth-country criteria in present statistics to self-categorisation and 
home language in the future. The latter two criteria are generally conceived as 
complementary criteria. Self-categorisation and home language references need 
not coincide since languages may be conceived to variable degrees as core values 
of ethnocultural identity in contexts of migration and minorisation. 
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Table 1l. Criteria for the definition and identification of population groups in a multicul- 
tural society (Extra and Gorter 2008: 17) 

Criterion Advantages Disadvantages 

Nationality © objective e (intergenerational) erosion 

(NAT) * relatively easy to establish through naturalisation or 

(P/F/M z person/ double NAT 

father/mother) * NAT not always indicative of 


ethnicity/identity 

some (e.g. ex-colonial) groups 
have NAT of immigration 
country 


Birth country 
(BC) 
(P/F/M) 


objective 
relatively easy to establish 


intergenerational erosion 
through births in immigration 
country 

BC not always indicative of 
ethnicity/identity 
invariable/deterministic: does 
not take into account dynamics 
in society (in contrast to all 
other criteria) 


Self-categorisation/ 
Ethnicity 
(SC) 


touches the heart of the 
matter 

emancipatory: SC takes 
into account person’s own 
conception of ethnicity/ 
identity 


subjective by definition: also 
determined by the language/ 
ethnicity of interviewer and by 
the spirit of the times 


e multiple SC possible 
e historically charged, especially 


by World War II experiences 


Home language 
(HL) 


HL is significant criterion 
of ethnicity in communi- 
cation processes 

HL data are prerequisite 
for government policy 

in areas such as public 
information or education 


complex criterion: who speaks 
what language to whom and 
when? 

language is not always a core 
value of ethnicity/identity 
useless in one-person house- 
holds 


In 23 out of 27 EU countries, nationwide censuses are held at variable intervals. 
Scandinavian countries and the Netherlands rely on yearly updated administrative 
(municipal) registers in combination with periodical sample surveys. Questions on 
ethnicity and language feature in the censuses of 13 and 17 EU countries, respect- 
ively (Extra and Gorter 2008: 18-21). There is, however, wide variation in the 
operationalisation of language questions, which makes crossnational comparison 
of the outcomes difficult. The three most commonly asked questions on language 
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use relate to mother tongue (11 countries), (other) language(s) spoken (frequently) 
(6 countries), and language(s) (most frequently) spoken at home (5 countries). 
While the mother tongue question is most widely used in Europe, this question has 
been criticised for its lack of validity in English-dominant immigration countries 
outside Europe, in particular in Australia, Canada, and the USA (Extra and Gorter 
2008: 21—24). In these countries, the most widely used language question in the 
census refers to language(s) used at home — next to or instead of English. Whereas 
the focus in these non-European English-dominant countries is on immigrant lan- 
guages, the focus in EU countries — if non-national languages are considered at all — 
is on RM languages. The United Kingdom 2011 Census has for the first time a 
LOTE question (Languages Other Than English) with an inclusive perspective on 
both RM and IM languages. Unfortunately, in spite of the UK's leading role 
in such data gathering within the context of EU countries, the UK question is 
focused on the informant's main language rather than on the main language(s) used 
at home. A question about the informant's main language is very ambiguous be- 
cause of its lack of domain specification and it will make crosscontinental compari- 
sons of LOTE outcomes difficult. 

At this stage, far less empirical data on IM languages are available across Euro- 
pean nation-states than in English-dominant immigration countries outside Europe. 
As a result, language policies in this domain in Europe are commonly developed in 
the absence of even the most basic language facts (Extra and Gorter 2008). 


4. Immigrant minority languages at school 


Across Europe, vast contrasts can be observed in the status of IM languages at 
school, depending on particular nation-states, or even particular federal states 
within nation-states (as in Germany), and depending on particular IM languages, 
being official state languages in other EU countries or not. Most commonly, IM lan- 
guages are not part of mainstream education. In the United Kingdom, for example, 
IM languages are not part of the so-called "national" curriculum, and they are dealt 
with in various types of so-called “complementary” education at out-of-school 
hours (e.g. Martin et al. 2004). Being aware of crossnational differences in deno- 
tation, we will use the concept community language teaching (henceforward CLT) 
when referring to this type of education. Our rationale for the CLT concept rather 
than the concepts mother tongue teaching or home language instruction is the in- 
clusion of a broad spectrum of potential target groups. First of all, the status of an IM 
language as "native" or *home" language is subject to change through intergener- 
ational processes of language shift. Moreover, in secondary education, both minor- 
ity and majority pupils are often de iure (although seldom de facto) admitted to CLT. 

From a historical point of view, most EU countries show a similar chronological 
development in their argumentation in this domain. CLT was generally introduced 


The immigrant minority languages of Europe 475 


into primary education with a view to family remigration. This objective was also 
clearly expressed in Directive 77/486 of the European Community, on 25 July 1977. 
The Directive focused on the education of the children of *migrant workers" with 
the aim "principally to facilitate their possible reintegration into the Member-State 
of origin". As is clear from this formulation, the Directive excluded all IM children 
originating from non-EU countries, although these children formed and still form 
the large part of IM children in European primary schools. During the 1980s, the 
above argumentation for CLT was increasingly abandoned. Demographic develop- 
ments showed no substantial signs of families remigrating to their source countries; 
instead, processes of family reunion and family formation took place in the target 
countries. This development resulted in a conceptual shift and CLT became pri- 
marily aimed at combating disadvantages. It had to bridge the gap between the home 
and school environment, and to encourage school achievement in “regular” subjects. 
Since such an approach tended to under-appreciate ethno-cultural dimensions, some 
countries began to emphasize the intrinsic importance of CLT in various respects: 


e incultural respects, CLT contributes to maintaining and advancing a pluralist 
society, in correspondence with the fact that many IM groups consider their 
own language of core value to their cultural identity; 

e inlegal respects, CLT meets the internationally recognised right to language 
development and language maintenance; 

e ineconomic respects, CLT leads to an important pool of profitable knowledge 
in societies which are increasingly internationally oriented. 


Table 2 gives a crossnational summary of the outcomes of a comparative study of 
nine parameters of CLT in primary and secondary education in six EU countries 
(Extra and Yağmur 2004). A comparison of all nine parameters makes clear that 
CLT has gained a higher status in secondary schools than in primary schools. In 
primary education, CLT is generally not part of the “regular” or “national” curricu- 
lum and, consequently, it tends to become a negotiable entity in a complex and 
often opaque interplay of forces by a variety of factors, in contrast with other cur- 
ricular subjects. Another remarkable fact is that, in some countries (particularly 
France, Belgium, Spain, and some German federal states), CLT is funded by the 
consulates or embassies of the countries of origin. In these cases, the national gov- 
ernment does not interfere in the organisation of CLT, or in the requirements for 
and the selection and employment of teachers. A paradoxical consequence of this 
phenomenon is that the earmarking of CLT budgets is often safeguarded by the 
above-mentioned consulates or embassies. National, regional or local governments 
often fail to earmark budgets so that funds meant for CLT may be appropriated for 
other educational purposes. It should be mentioned that CLT for primary school 
children in the Netherlands has been completely abolished in the school year 
2004/2005, resulting in Dutch-only education in multicultural and multilingual 
primary schools (Extra and Yağmur 2006). 
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Table 2: 


Guus Extra 


Status of CLT in European primary and secondary education, according to 


9 parameters in 6 countries (Sw/G/N/B/F/Sp = Sweden/Germany/Netherlands 
until 2004/Belgium/France/Spain) (Extra and Yağmur 2004: 385) 


CLT parameters 


Primary education 


Secondary education 


1 Target IM children in a narrow vs. broad ® de iure: mostly IM pupils; 
groups definition in terms of sometimes all pupils (in 
© the spectrum of languages taught particular N) 
(Sp vs. N B F vs. G Sw) e de facto: IM pupils in a narrow 
* active language use and basic vs. broad sense (see left) 
language proficiency (limited participation, in 
(Sw F vs. GN B Sp) particular B Sp) 

2 Arguments mostly in terms of astruggle against mostly in terms of multicultural 
deficits, rarely in terms policies, rarely in terms of deficits 
of multicultural policies (all countries) 

(N B vs. other countries) 

3 Objectives rarely specified in terms of specified in terms of oral and 
(meta)linguistic and (inter)cultural written skills to be reached at 
skills interim and final stages 
(Sw G Sp vs. N B F) (all countries) 

4 Evaluation mostly informal/subjective through  formal/objective assessment plus 
teacher, rarely formal/objective school report figures 
through measurement and school (SwGNvs. BF Sp) 
report figures 
(Sw G F vs. B N Sp) 

5 Minimal specified at the level of classes, specified at the level of classes, 

enrolment schools, or municipalities schools, or municipalities 
(Sw vs. G B F vs. N Sp) (Sw N vs. other countries) 
6 Curricular e voluntary and optional e voluntary and optional 
status e within vs. outside regular school e within regular school hours 
hours * one/more lessons per week (all 
(G N Sp vs. S B F) countries) 
e 1-5 hours per week 
7 Funding e by national, regional or local e by national, regional or local 
educational authorities educational authorities 
* by consulates/embassies of * by consulates/embassies of 
countries of origin countries of origin 
(Sw N vs. B Sp, mixed G F) (Sw N F vs. B Sp, mixed G) 

8 Teaching e from countries of residence e from countries of residence 

materials e from countries of origin e from countries of origin 


(Sw GN vs. BF Sp) 


(Sw GN F vs. B Sp) 
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9 Teacher e from countries of residence e from countries of residence 
qualifications ¢ from countries of origin e from countries of origin 
(Sw GN vs. B F Sp) (Sw G NF vs. B Sp) 


The higher status of CLT in secondary education is largely due to the fact that in- 
struction in one or more languages other than the national standard language is 
a traditional and regular component of the (optional) school curriculum, whereas 
primary education is highly determined by a monolingual mindset or habitus 
(Gogolin 1994). Within secondary education, however, CLT must compete with 
“foreign” languages that have a higher status and a longer tradition. 

There is a great need for educational policies in Europe that take new realities 
of multilingualism into account. Processes of internationalisation and global- 
isation have brought European nation-states to the world but they have also 
brought the world to European nation-states. This bipolar pattern of change has led 
to both convergence and divergence of multilingualism across Europe. On the one 
hand, English is on the rise as the lingua franca for international communication 
across the borders of European nation-states at the cost of all the other national 
languages of Europe, including French. In spite of many objections against the 
hegemony of English (Phillipson 2003), this process of convergence will be 
enhanced by the extension of the EU to Eastern Europe. Within the borders of 
European nation-states, however, there is an increasing divergence of home lan- 
guages due to large-scale processes of global migration and intergenerational mi- 
norisation. The call for differentiation of the monolingual mindset of primary 
schools across Europe originates not only bottom-up from IM parents or organi- 
sations, but also top-down from supranational institutions, which emphasize the in- 
creasing need for European citizens with a transnational and multicultural affinity 
and identity. Plurilingual competencies are considered prerequisites for such an af- 
finity and identity. 


5. The European discourse on plurilingualism 


Both the European Commission and the Council of Europe have published many 
policy documents in which language diversity is cherished as a key element of the 
multicultural identity of Europe — now and in the future. Language diversity is con- 
sidered to be a prerequisite rather than an obstacle for a united European space in 
which all citizens are equal (but not the same) and enjoy equal rights (Council of 
Europe 2000). The maintenance of language diversity and the promotion of lan- 
guage learning and plurilingualism are seen as essential elements for the improve- 
ment of communication and for the reduction of intercultural misunderstanding. 

In a so-called Whitebook, the European Commission (1995) opted for triling- 
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ualism as a policy goal for all European citizens. Apart from the “mother tongue", 
each citizen should learn at least two “community languages". In fact, the concept 
of *mother tongue" implicitly referred to the national languages of nation-states 
and ignored the fact that for many inhabitants of Europe mother tongue and 
national language do not coincide. At the same time, the concept of “community 
languages" referred to the national languages of two other EU member-states. In 
later European Commission documents, reference was made to one foreign lan- 
guage with high international prestige (English was deliberately not referred to) 
and one so-called “neighbouring language". The latter concept related to neigh- 
bouring countries, never to next-door neighbours. UNESCO adopted the term 
"multilingual education" in 1999 (General Conference Resolution 12) for refer- 
ence to the use of at least three languages, i.e. the mother tongue, a regional or 
national language, and an international language in education. 

The heads of state and government of all EU member-states called upon the 
European Commission to take further action to promote plurilingualism across Eu- 
rope, in particular by the learning and teaching of two additional languages from a 
very young age (Nikolov and Curtain 2000). Further promotional activities of the 
European Commission may ultimately lead to an inclusive approach, in which IM 
languages are no longer denied access to Europe's celebration of language diver- 
sity. In particular, the plea for the learning of three languages by all EU citizens, 
the plea for an early start to such learning experiences, and the plea for offering 
a wide range of languages to choose from open the door to such an inclusive 
approach. Although this may sound paradoxical, such an approach can also be 
advanced by accepting the role of English as lingua franca for intercultural com- 
munication across Europe. Against this background, the following principles 
are suggested for the enhancement of plurilingualism at the primary school level 
(Extra and Gorter 2008: 45): 


1 In the primary school curriculum, three languages are introduced for all 
children: 

e the standard language of the particular nation-state (or in some cases a re- 
gion) as a major school subject and the major language of communication 
for the teaching of other school subjects; 

English as lingua franca for international communication; 

an additional third language opted from a variable and varied set of priority 
languages at the national, regional, and/or local level of the multicultural 
society. 


2 Theteaching of all these languages is part of the regular school curriculum and 
subject to educational inspection. 


3 Regular primary school reports contain information on the children's profi- 
ciency in each of these languages. 
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4 National working programmes are established for the priority languages re- 
ferred to under (1) in order to develop curricula, teaching methods, and teacher 
training programmes. 


5 Some of these priority languages may be taught at specialised language schools. 


This set of principles is aimed at reconciling bottom-up and top-down pleas in Eu- 
rope for plurilingualism, and is inspired by large-scale and enduring experiences 
with the learning and teaching of English (as L1 or L2) and one Language Other 
Than English (LOTE) program for all children in the State of Victoria, Australia 
(Extra and Yagmur 2004: 99-105). The Victorian School of Languages in Mel- 
bourne has led to an internationally recognised break-through in the conceptual- 
isation of plurilingualism in terms of making provisions feasible and mandatory 
for all children (including L1 English speaking children), in terms of offering 
a broad spectrum of LOTE provisions (in 2008, more than 40 languages were 
taught), and in terms of governmental support for these provisions derived from 
multicultural policy perspectives. 

When in the European context each of the earlier mentioned languages should 
be introduced in the curriculum and whether or when they should be subject 
or medium of instruction, has to be spelled out according to particular national, 
regional, and/or local demands. Derived from an overarching conceptual frame- 
work, priority languages could be specified in terms of both RM and IM minority 
languages for the development of curricula, teaching methods, and teacher training 
programmes. Moreover, the increasing internationalisation of pupil populations 
in European schools requires that a language policy be introduced for all school 
children, in which the traditional dichotomy between foreign language instruction 
for indigenous majority pupils and home language instruction for IM pupils is put 
aside. Given the experiences abroad (e.g. the Victorian School of Languages in 
Australia), language schools can become centres of expertise where a variety of 
languages are taught if the students' demand is low and/or spread over many 
schools. In line with the proposed principles for primary schooling, similar ideas 
could be worked out for secondary schools where learning more than one language 
is already an established curricular practice. The above-mentioned principles 
would recognise plurilingualism in an increasingly multicultural environment as 
an asset for all youngsters and for society at large. The EU, the Council of Europe, 
and UNESCO could function as leading transnational agencies in promoting such 
concepts. The UNESCO Universal Declaration of Cultural Diversity (updated in 
2002) is very much in line with the views expressed here, in particular in its plea to 
encourage linguistic diversity, to respect the mother tongue at all levels of edu- 
cation, and to foster the learning of more than one language from a very early age. 


See also the following chapters in this volume: 24 by Troester-Mutz, 29 by Dar- 
quennes and 44 by Wright. 
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II.4. Non-standard varieties 


26 Dialect vs. standard: 
a typology of scenarios in Europe 


Peter Auer 


Introduction 

Terminological preliminaries 

The typology 

A historical perspective: the emergence of endoglossic standard varieties 
Where are we heading? 


Qu darc 


1. Introduction 


The European sociolinguistic landscape looks scattered when taking into account 
the manifold ways in which standard languages and dialects relate to each other. 
Although the tendency for fine-grained dialect differences to erode in favour of 
more standard-oriented regional varieties, and even in favour of one or more stan- 
dard varieties, is dominant almost everywhere, the status of the standard varieties 
(in terms of prestige, situational usage and adequacy, and diffusion) varies as much 
as the status of the traditional and modern dialect varieties. Areas in which the use 
of the standard variety is almost restricted to writing and in which traditional dia- 
lects with a very limited regional reach are the dominant spoken varieties of every- 
day life, stand next to those in which the traditional dialects have completely dis- 
appeared and everybody uses the standard in oral communication. The strength 
(vitality) of the newly emerging regional varieties is sometimes superior to that of 
the standard varieties, and sometimes it is not. National standard varieties partly 
develop divergent internal tendencies (regional standards) and become “destandar- 
dised". In spite of this heterogeneity, this chapter intends to give evidence for a 
high amount of orderliness. After some terminological remarks on "standard" and 
"dialect", six (proto)typical scenarios of language-dialect contact will be presented 
which are able to cover almost all language situations in Europe. 

In the second part of the paper, I will add a historical perspective to this typol- 
ogy. Contrary to the situation in other parts of the world which European languages 
have been exported to, the European dialects are the predecessors! of the standard 
languages. I will argue that the emergence of endoglossic standard varieties out of 
the older vernaculars follows a general line of development with five consecutive 
stages (types of repertoires). Variation between the European language repertoires 
is due to the fact that they may have reached different stages, or may have moved 
through these stages at different times in history (and therefore, under different his- 
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torical circumstances). In any case, the emergence of endoglossic standard var- 
ieties has had a massive impact on the structure, use, and status of the old dialects. 

Due to limitations of space, no attempt will be made to cover the Buropean situ- 
ation exhaustively. Examples will mostly be drawn from the northern and central 
European area (but see Auer 2005a for a more detailed account). 


2. Terminological preliminaries 


It is obvious that the idea of a standard language is not reflected in lay metalin- 
guistic ideologies across Europe. The term itself is a technical term used by lin- 
guists, while non-linguistic, private and public, talk about language is dominated 
by more evaluative terms (such as Hochsprache or bon usage), by terms that fore- 
ground one specific aspect of the standard language (such as in Dan. rigsmäl), or 
by terms that highlight its function (as in Russ. literaturnyj jazyk). Almost the same 
holds true for the dialectal end of the repertoire, where the term dialect is used as a 
lay term but competes with others (such as patois, Platt, Mundart, accent, etc). For 
the purposes of this typology, I have chosen a relatively narrow definition of a stan- 
dard variety (see Ammon 2004 for a general discussion). It is based on three fea- 
tures: (a) a standard variety is a common language, i.e. one which (ideally) shows 
no geographical variation in the territory in which it is used; (b) a standard variety 
is an H variety, i.e. it has overt prestige and is used in situations which require a for- 
mal way of speaking (if a spoken standard exists at all), as well as in writing; and 
(c) a standard variety is codified, i.e. ‘right’ or ‘wrong’ plays an important role in 
the way in which speakers orient towards it. All three dimensions are gradual, i.e. 
the process of standardisation may be more or less advanced depending on how 
precisely the above criteria are met. In this sense, Standard Swiss German is less of 
a standard language than Standard German in Germany since it has less prestige, 
English is less standardized than French, since it is less codified, and German is 
less standardised than Danish since it shows more geographical variation within 
speech regarded as standard. My definition of a standard variety therefore differs 
both from purely normative approaches (which take a standard to be identical with 
a set of norms which may or may not be followed) and from broader definitions 
which do not require a standard to have prestige and/or show codification. If one 
were to follow the latter view, the typology presented below would not be valid any 
longer since common languages existed long before they were codified in some 
parts of Europe. The second criterion refers to the overt prestige of the standard 
while the non-standard varieties may show a certain degree of covert prestige (see 
Kristiansen and Jgrgensen 2005). This overt prestige is enough to make the stan- 
dard an H variety in the sense of Ferguson's original use of the term (Ferguson 
1959).? The overt prestige of the standard varieties is due to the fact that they are 
symbols of national unity; the language ideology which lies at the heart of their his- 
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torical origin and present-day definition is part and parcel of the European national 
ideology. Although this ideology is typically linked to a nation state, statehood is 
no prerequisite for it: nations that do not form states (such as the Basque nation, or 
the Catalan nation) are nonetheless committed to the idea of a national standard 
language, which thereby assumes the prestige of an H variety. There are older stan- 
dard languages such as Latin, Old Church Slavonic, or Arabic, which fall outside 
the scope of this paper (apart from their status as exoglossic standard varieties); un- 
like the national standard languages, they gained their overt prestige from their em- 
beddedness into different ideologies (mainly religious ones). 

At the opposite end, it is useful to reserve the term “(traditional) dialects” for 
the varieties under the roof (Kloss 1952) of a standard variety which preceded the 
standard languages and provided the linguistic material out of which the endoglos- 
sic standard varieties developed. Intermediate varieties between standard and dia- 
lect will be called "regional dialects (regiolects)" and "regional standards". 


3. The typology 


The typology distinguishes between five dialect/standard constellations which can 
be summarised as follows: 


Type Zero: exoglossic diaglossia (no endoglossic standard) 

Type A: (endoglossic) medial diglossia (endoglossic standard only used for 
writing) 

Type B: (endoglossic) diglossia (endoglossic standard also used in for 
speaking) 

Type C: diaglossia (intermediate forms between standard 
and dialect, such as regiolects) 

Type D: dialect loss (gradual dialect levelling or 


non-transmission) 


The first type of repertoire to be considered is somewhat special and will therefore 
be called a Zero repertoire. In a Type Zero repertoire, there is no endoglossic? stan- 
dard variety at all; rather, the standard variety is imported and not considered by its 
users to be a variety which is structurally related to the vernaculars. This was the 
case for of the most important pre-modern exoglossic European standard lan- 
guage — Latin — and of colonial exoglossic standards (such as English in Malta, Cy- 
prus, Gibraltar at different stages of their history); it was also the case for minority 
languages within the European nation states before they introduced endoglossic 
standards which came to be used in addition to the exoglossic ones. Type Zero rep- 
ertoires may be combined with endoglossic ones, i.e. transition from Type Zero to 
Type A (and from there to any further repertoire type) does not imply the loss of 
Type Zero but can lead to the addition of an endoglossic standard to the repertoire 
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exoglossic standard e 
(spoken/written) 


vernacular varieties 
(spoken/written) 


Figure 1. Type Zero Repertoire 


which coexists with the exoglossic one. Endoglossic standard/dialect repertoires 
may also turn into such dual standard repertoires later in their history, i.e. a new 
exoglossic standard may be added after the endoglossic standard has already been 
established. 

In a somewhat simplistic but still useful graphical representation,? the exoglos- 
sic repertoire type can be represented by a dot hovering above a territory in which 
non-related vernaculars are spoken, here represented by a circle (see Figure 1). 
When representing a repertoire schematically as in Figure 1 — and all the following 
figures — it is important to keep in mind what the schema ignores. In particular, the 
graphical distance between the standard and the vernacular varieties can be under- 
stood in two ways: as a structural distance and as an attitudinal distance. In the 
following, the two will be collapsed for the sake of simplicity but a more realistic 
approach would need to measure and represent them independently. Another sim- 
plification is the representation of the vernacular area as a contiguous language 
space. In some cases, the area in which the standard language is valid is not con- 
tiguous; this holds for endoglossic standards but even more for exoglossic ones. A 
more realistic representation could therefore look as in Figure 2. 


(exoglossic standard) e 
X 
GE 
vernacular varieties queres 
(spoken/written) D RE <> 


a 


Figure 2. Type Zero Repertoire over a non-contiguous area. The distance between the dot 
and the circles represents the structural distance between standard language and 
vernacular varieties, the X the attitudinal distance. In this case, the exoglossic 
standard variety is structurally quite distinct from the vernaculars but its prestige 
is relatively low. 
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endoglossic standard (mainly written) 


---------------> € 


dialects (mainly spoken) 


Figure 3. Type A repertoire — written diglossia 


Whether a standard variety is endoglossic or exoglossic is largely an attitudinal 
(and ideological), not a structural question. While it may be difficult to define a 
completely unrelated language as the endogenous standard of a certain group of 
vernacular varieties (even under conditions of heavy borrowing from the exoglos- 
sic standard into the vernaculars, i.e. some kind of structural convergence), there 
are many cases in which the structural distance between the vernaculars and the 
standard is small but the latter is still considered to be exoglossic. Just take German 
as an exoglossic standard in Luxembourg, or Russian as an exoglossic standard in 
Belorussia; for further examples, see Trudgill (2004). At the same time, there are 
some examples in which the structural distance between the standard and the ver- 
naculars is large enough to make them mutually incomprehensible but they are still 
considered to stand in an endoglossic relationship (for instance, Standard German 
is an endoglossic standard in the Low German and in the Bavarian dialect area). 
The tendency in present-day Europe seems to be to reduce the reach of the endo- 
glossic standard languages, i.e. to regard them as exoglossic in areas where they 
used to be considered endoglossic. (As a consequence, the now exoglossic stan- 
dard is often given up and an endoglossic one takes its place; cf. the situations in 
Belgium and Austria with regard to the Netherlands Dutch standard (Grondelaers 
and van Hout to appear) and the Germany-German standard respectively.) 

The first endoglossic standard/dialect repertoire type (Type A) is shown in Fig- 
ure 3. It differs from Type Zero in two ways. First, the standard variety is perceived 
to be related to the vernaculars for which it provides a roof, and which we can now 
call dialects. Second, although the relationship between standard and dialect re- 
mains diglossic, this diglossia assigns the varieties to different media: the endog- 
lossic standard is used in the written medium, the dialects in the spoken medium. 
(For some qualifications of this statement, see below, section 4.) 

Repertoires of this type are rare but existent in Europe today. Examples are 
German-speaking Switzerland and those parts of Norway in which Nynorsk is the 
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(regional) standards (spoken) 


standard (written) ——— — —» "o 


dialects (only spoken) 


Figure 4. Type B Repertoires: spoken diglossia 


overarching standard variety. Other examples come from language minority con- 
texts in which standardisation has only set in a short while ago and in which the 
standard is mainly a written variety which is taught at school but not used much 
outside. (This applies, for instance, to Standard Irish (cf. ÓhIfearnáin 2008) while 
Standard Welsh seems to be closer to Type B already.) 

A second type of endoglossic standard/dialect repertoire is shown in Figure 4. 
The important difference is that here the standard variety is also spoken. We are 
still dealing with diglossia though, and the structural difference between standard 
and dialect is perceived as too large to be bridged by intermediate forms. The two 
varieties are kept apart in speaking, usually because they are subject to different 
usage norms. Code-switching between standard and dialect is possible but gradual 
transitions (code-gliding) are not. Dialect and standard are compartmentalised. 
Their features co-vary in two-sided co-occurrence restrictions (cf. Auer 1997), i.e. 
the realisation of a variable as dialectal within a word (or even clause) implies the 
dialectal realisation of all the other variables as well (and vice versa). 

Like Type A, this type of repertoire is not particularly frequent in present-day 
Europe, although it has played an important role in language history. We still find it 
in recently standardised minority languages (such as Standard Welsh, Standard 
Basque) and in certain dialect areas of large nation states (such as the Low German 
area in Germany and the southern Italian dialects such as Sicilian or Neapolitan) 
which are characterised by a substantial structural distance between dialect and 
standard but in which the standard is nonetheless used as an oral variety in every- 
day life. The tendency here is to look upon the standard as an exoglossic standard, 
however, and to regard the dialect as a separate minority or regional language. 

An important difference between Type A and Type B repertoires is that in the 
latter there is inevitably more variation within the standard. Type B standards 
usually show regional variation — to the degree that regional standards can be 
identified at least on the phonological and phonetic level. 
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standard (spoken/written) O 


regional standards 


regiolects 


traditional dialects NE NN 


Figure 5. A diaglossic repertoire (Type C) 


The most widespread standard/dialect repertoire type in Burope today is not di- 
glossic, however, but diaglossic. This means that there are intermediate forms 
(layers of speech) between standard and dialect which ‘fill up’ the structural space 
between the two. Various terminologies have been used to refer to these intermedi- 
ate structures, or subsets of them, among them Umgangssprache, regional dialect, 
colloquial language, italiano popolare, tussentaal, etc. A general representation of 
the diaglossic repertoire type is given in Figure 5. 

Diaglossic repertoires typically show convergence between the traditional dia- 
lects, which is a by-product of dialect-to-standard advergence: the most dialectal 
ways of speaking (i.e. those with the smallest geographical reach) are given up by 
some speakers and in some situations in which the traditional dialects used to be 
spoken before, in favour of more widespread, ‘regiolectal’ ones. On the whole, the 
regiolects are always closer to the standard, both in structural terms and in terms of 
official (overt) prestige.5 However, it is important to note that regional dialects can- 
not be explained fully as a product of dialect-to-standard advergence. New struc- 
tures emerge, which are sometimes compromise forms conforming neither to the 
standard nor to the dialect and sometimes innovations. In the schematic represen- 
tation in Figure 5 these dynamics are symbolised by the horizontal arrows which 
expand the regional dialects beyond the limits of the lines which link the standard 
to the traditional dialects. 

Terms such as regional dialect or regiolect suggest that the intermediate layer 
of speech between dialect and standard has the status of a variety. This, however, is 
not necessarily the case, and careful empirical studies are necessary to establish 
variety boundaries (cf. Lenz 2003). Often, one-sided co-occurrence restrictions 
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Figure 6. The structural distance between the standard variety and the dialectal varieties is 
not the same across the language territory. 


hold (Auer 1997), which leave space for gradual shifts from a more dialectal to a 
more standard-like way of speaking, and vice versa. 

Another point needs to be brought home here, which could also be made with 
respect to Figure 3 and 4, i.e. A and B repertoires. The structural distance between 
the dialects and the standard variety often is not the same across the dialectal ter- 
ritory. Some dialects (often those of or around the capital) are closer to the standard 
than others since the standard language was formed on the basis of these dialects 
(see below). Once more, the schematic representation hides this important detail: It 
is therefore more adequate to use a representation such as in Figure 6. 

In the extreme form, the variety spoken by the elites of the capital can be ident- 
ical with the standard (as it is claimed for older, upper middle class Copenhagen 
Speech, cf. Kristiansen 1998, and for Paris). In this case, the top of the pyramid is 
identical with one point in the area which forms its base. In other cases, it is closer to 
certain dialects than to others (e.g. Bokmál — one of the Norwegian standards - to the 
Eastern Norwegian dialects; Standard Swedish to the dialects of the Stockholm area; 
Standard British English to those of the Southeast of England; Standard Finnish to 
the western Finnish dialects; Standard Dutch to the Randstad dialects; Standard Irish 


Peter Auer 493 


standard (written and spoken) 


2 ^« former dialects 
H 


Figure 7. Type D/2 (dialect loss) repertoires 


to the Connemara dialect, etc.). The base of the ‘pyramid’ is then not a flat but a 
curved plane, some of parts of which approach the standard (top) more than others. 

The last type of repertoire is one in which there are no dialects (Type D). This 
is typically the case after the traditional dialects have been lost. Only in the case 
of Iceland there is a standard language with no dialectal predecessors, or at least 
we have no evidence for the existence and levelling out of the primordial dialects 
(Arnason 2003). Type D repertoires have a variable standard language, whose 
variability may be a consequence of regional accents or regional standards, usually 
on the level of phonetics and/or vocabulary, or of destandardisation and the devel- 
opments of substandards which are not geographically located. There are two path- 
ways towards this stage: from Type B repertoires (diglossia) and from Type C rep- 
ertoires (diaglossia). In the first case (D/2, cf. Figure 7), the dialects are no longer 
transmitted from one generation to the next. Since there are no intermediate forms 
of speech, this means that the entire L-section of the repertoire is lost, often in a 
short period of time. Typical examples are the Low German dialects, which died 
out in such a way in most parts of the former Low German dialect area in the course 
of just a few decades in the middle of the last century. 

Another scenario which may eventually lead to the same result but through a 
prolonged stage of dialect levelling by dialect-to-standard advergence takes as its 
starting point a diaglossic repertoire. This repertoire gradually shrinks since the 
most dialectal forms are gradually replaced by less regional ones (cf. Figure 8, Type 
D/1). The process may stop at a point in which the traditional dialects have disap- 
peared but strong regional dialects have established themselves (such as, in Ger- 
many, in the Ruhrgebiet or Lower Saxony). However, it may also proceed beyond 
this stage towards the loss of these regional dialects (as presumably was the case in 
Denmark). Typically, dialect loss in a diaglossic repertoire type proceeds at a dif- 
ferent pace in various parts of a language area; it shows age grading and the usual 
distribution of gender patterns (with young women leading the change). It is also 
linked to urbanisation and its concomitant migration movements. 
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Figure 8. From Type C (diaglossia) to Type D/1 (dialect loss) 


4. A historical perspective: 
the emergence of endoglossic standard varieties 


The typology sketched out in section 3 can be used in order to classify the Euro- 
pean dialect/standard repertoires. However, it can also be used to describe the 
diachrony of these repertoires, i.e. in order to answer questions such as: How did 
the European endoglossic standard varieties emerge? How have they affected the 
traditional dialects over their history? And how can endo- and exoglossic standard 
varieties be combined? The typology suggests the following development: 


Type D/2 
A 


l 
l 
TypeA > TypeB > TypeC > Type!D/1 
Type 0 an A A A 


M M M M 
Type 0-A > Type0-B > Type0-C > Type:0-D/1 
l 


l 
Lu 
Type 0-D/2 


Figure 9. European standard/dialect repertoire developments 
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The European language development from Type 0 to Type D as shown in Figure 9 
is an empirical generalisation, not a sociolinguistic universal. It is driven by non- 
linguistic forces which are intimately tied to the developments of the European 
nation states. The transitions from one repertoire to the next are partly reversible. 
For instance, it is not rare that an endoglossic standard is lost and replaced by an 
exoglossic one when a group of speakers migrates out of its nation state territory 
(cf. some of the German language enclaves in eastern Europe and worldwide) or 
when state borders are redrawn (cf. standard German in the Alsace). Sometimes, 
the transition from B to C is reversed when a standard variety is spoken less and 
less (as happened in the case of the German standard variety in German-speaking 
Switzerland during the last century). Whether dialect loss is reversible by dialect 
revitalisation is an open question. These reversals are, however, marked develop- 
ments, which occur rarely and/or under special circumstances. 

Apart from the usual processes of standardisation (cf. Haugen 1966) and dia- 
lect attrition, the pathway from Type 0 to Type D is essentially intertwined with 
two major linguistic issues: bilingualism and medium allocation (spoken/written). 

The emergence of endoglossic standard varieties is always preceded by a phase 
in which an exoglossic standard is used. This situation implies bilingualism in the 
highest strata of society. This bilingualism may be replaced by monolingualism 
once the endoglossic standard is established but usually this is a long process 
(cf. the gradual decline of Latin as a European exoglossic standard language, of 
Swedish as a standard language in Finland after the establishment of the Finnish 
standard in the 19th century, or of German after the standardisation of Czech in the 
19th century, etc.); often, bilingualism persists (cf. the use of English after the es- 
tablishment of modern Standard Irish in the mid-20th century, the use of Standard 
French and Standard German after the establishment of the Létzebuergish after it 
became a national language in the same period, or the use of the majority language 
in all minority language contexts). Exoglossic standards may also be introduced 
into a repertoire after the endoglossic standard has been established (cf. the use of 
French in Germany, Russia, etc. in the 17th-18th century). 

The first transgression of the medial boundary between spoken and written 
language occurs when an endoglossic standard is established as a written variety 
(Type A repertoires). This written standard is formed in some way or other out of 
the vernacular varieties which it comes to roof but if these are purely spoken, stan- 
dardisation involves more than the selection and codification of single features 
from the vernacular forms; a wholesale reorganisation of the language is involved. 
Often, the relevant structural changes and Ausbau processes borrow from the 
exoglossic standard or (particularly in more recent, ‘constructed’, i.e. synthetically 
coined standards) from older written documents of the vernacular varieties, if they 
exist (as in the case of Finnish, Irish, or Czech). 

There are two exceptions from the generalisation that endoglossic standard 
varieties always appear in writing before they are spoken. The first exception are 
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repertoire splits from Type B or C repertoires, i.e. one part of a territory leaves the 
roof of a written and spoken standard language and establishes a slightly different 
one on the basis of the vernaculars spoken in this territory but also on the basis 
of the already existent standard variety; examples are the emerging Belgian Dutch 
standard (Grondelaers and van Hout 2009), the emerging Austrian German stan- 
dard variety, or the ‘new’ (or better, re-established9) standard languages of Croatia, 
Serbia, Bosnia. In these cases, the split applies to the written as well as to the 
spoken standard varieties (even though the norms of the new standard are usually 
reinforced in writing before they are reinforced in speaking). The second exception 
are standard varieties which are identical with the variety spoken in the capital (or 
at the court); candidates are French, Danish, and perhaps Swedish (Kort Svenska); 
for the (usually quite small) group of people who lived in these areas and were 
part of the relevant social groups, the transition was merely that from a spoken to a 
written variety. 

This transition from Type A to Type B is the second case in which the medium 
(speaking/writing) is reallocated but this time in the opposite direction: A spoken 
version of the standard evolves from the written standard and gradually spreads 
to all layers of society — a process which has only recently come to completion in 
some European nation states and is far from being completed in others. Mattheier 
(1997) uses the term “demotisation” to refer to this appropriation of a variety 
the norms of which were defined by a small sector of the well-educated classes by 
the whole society; he points out how such an appropriation can challenge these 
norms.’ The spoken standard, which in the beginning is only an articulated version 
of the written standard, is now made suitable for spoken, face-to-face interaction, 
also by less educated speakers. This weakens the norms and inevitably leads to 
a multi-stylistic standard and hence to variability (one form of destandardisation), 
and sometimes also to standard language change (another form of destandardis- 
ation). 

At the same time, the transition into B (and even more, C repertoires) weakens 
the traditional dialects: They are now no longer markers of the geographical origin 
of the speaker alone but take on a social value, i.e. that of the lower classes. If the 
gap between this standard and the old dialects is perceived as too wide to be 
bridged by its speakers, the consequence is that the acquisition of the standard by 
those who are in a social position to acquire it results in a diglossic repertoire, 
which almost inevitably leads to dialect loss in the long run (repertoire D/2). The 
best protection of the traditional dialects is therefore a weak or non-existent spoken 
standard variety (as is indeed the case in German-speaking Switzerland and Ny- 
norsk Norway). In most cases, however, the result is different: Speakers develop 
intermediate forms, which results in the emergence of new ways of speaking that 
avoid the negative social prestige now attached to the dialects but nonetheless dis- 
play regional identity (i.e. a diaglossic repertoire emerges). The regional dialects 
which now come into being have their own (second order) norm centres, usually 
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large cities. They may be more numerous in decentralised, large nation states and 
less numerous in centralised, smaller nation states but little is known about these 
correlations. 

The details of the emergence ofregional forms of speech seem to vary from one 
language area to the next. Variation includes their historical depth, their co-exten- 
sion with the old dialect groups, their location on the dialect/standard continuum, 
and their status as structurally defined varieties. Compared to the traditional dia- 
lects, they are the most dynamic but also the least researched part of the socioling- 
uistic history of Europe over the last 100 years. 


5. Where are we heading? 


The present-day situation in Europe seems to be characterised by the following 
tendencies. First, the idea of the European nation state which is symbolically de- 
fined by its ‘language’ (by which its protagonists usually understood the standard 
language) is not dead. On the contrary, cases in which standard languages and 
nation states have not coincided tend to be more and more resolved (cf. Auer 
2005b). The standard languages do not expand regionally across state borders; 
rather, former common standards dissolve by repertoire splits (Belgium/Nether- 
lands, Austria/Germany, Croatia/Serbia, etc.). This tendency, which strengthens 
the nation state, is counteracted by the fact that today almost all European minority 
languages have their own standard, which is tolerated to a greater or lesser degree 
by the state to which the minority group belongs. This weakens the idea of ‘one 
language = one nation state’. One should note, however, that very few of these 
standardised minority languages reach out across two or more states (an example 
are the Sami standard varieties). The number of officially accepted (autochthon- 
ous) minority and regional languages has been increasing steadily. Indirectly, how- 
ever, some of these minority languages even strengthen the nation state since they 
avoid the roof of a neighbouring standard language to be extended onto their terri- 
tory (cf. the case of Galician whose recognition as a regional language in Spain 
keeps it from being considered a dialect of Portuguese). 

Second, internal variability within the European standard varieties has been in- 
creasing over the last decades. This includes a tolerance for regional accents co- 
occurring with standard morphology and syntax but also for grammatical devi- 
ations from traditional standard norms. Only few of them are due to dialectal sub- 
strate influence, though. As a rule, these deviations introduce new forms, some- 
times leading to language change or at least a relaxation of the codified norms. In 
addition to regional accents, the standard varieties have also come to be stratified 
in social, stylistic, and ethnic’ terms. The emerging new substandards are not dif- 
ferentiated on the geographical level any longer. They also incorporate elements 
which arise out of language contact and multilingualism due to immigration. The 
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standard is fully demoticised and many speakers no longer have a dialectal way of 
speaking at their disposal. Therefore, the standard has to be able to provide the full 
range of expressive resources the speakers need. 

Third, there is a strong tendency to turn endoglossic repertoires into bilingual 
repertoires in which an exoglossic standard language (English) complements the 
endoglossic one. For ethnic groups, additional exoglossic standards (such as Stan- 
dard Turkish) may play a role. 

Fourth, the traditional dialects are dying out but regional dialects have often 
taken over their function as a marker of regional belonging and identity. 


See also the following chapters in this volume: 29 by Darquennes, 33 by Pahta/ 
Taavitsainen and 44 by Wright. 


Notes 


1 This is not to say that the developments of endoglossic standards only set in when the 
local vernaculars (dialects) had already assumed their modern shape. Particularly in the 
East Slavic part of Europe, dialect formation by divergence was still taking place while 
the first endoglossic standard languages appeared in addition to exoglossic Old Church 
Slavonic (Old Bulgarian). 

2 This does not mean, however, that Ferguson assumes that H varieties are not acquired as 
first languages. Today, the standard variety has increasingly come to be the first language 
of many speakers. 

3 The terms endoglossic and exoglossic correspond with Kloss’ Außendiglossie vs Binnen- 
diglossie (Kloss 1976). 

4 To my knowledge, this type of graphic representation of a repertoire was first used by 
Moser (1950). It has gained some currency due to its use in Trudgill (1986). 

5 Itis true that the prestige of the traditional dialects usually increases proportionally with 
their loss. Their social interpretation then changes, and they are seen less as markers of 
social position (lower class) but more as folkloristic markers of local identity. This pres- 
tige is different from that of the standard variety, of course. 

6 Serbian and Croatian had already begun to be standardized before the ‘invention’ of 
Serbo-Croatian as the Yugoslavian national standard language. 

7 The term was used before Mattheier by Maas (1985) to refer to the spread of literacy as a 
mass phenomenon. 

8 As Cheshire et al. (2008) show, speakers of an immigrant background do not usually ac- 
quire and use dialectal features but create their own, (poly-)ethnic ways of speaking. 
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27 Border effects in European dialect continua: 
dialect divergence and convergence 


Curt Woolhiser 
1 Introduction 
2 European borders and border regions: historical origins and contemporary status 
3. Border impact research in social dialectology: conceptual and methodological issues 
4 European borders as sites of dialect convergence and divergence: research on 
border effects in European dialect continua 
4.1. The Germanic languages 
4.2. The Romance languages 
4.3. The Slavic languages 
5. Conclusion 
1. Introduction 


Over the last two decades, the parallel processes of deepening European integration 
and EU expansion, accompanied by the profound changes associated with increas- 
ing global interdependence and cross-border migration, have contributed to a 
growing interest in the socio-cultural significance of borders among scholars in the 
social sciences and humanities. This heightened interest in borders has likewise 
made itself felt in recent research on dialect change in Europe, which as the birth- 
place of the ideologies and institutions of the modern territorial nation-state, pro- 
vides an exceptionally broad range of case studies for the study of the role of politi- 
cal borders in the formation and evolution of national languages, dialect landscapes 
and national and linguistic identities. 

While few linguists would contest the assertion that the existence of political 
borders is a powerful stimulus for the emergence of new standard languages, until 
recently the opinion has prevailed that in the context of Ausbau standardization 
(Kloss 1967), the actual territorial boundary separating the states where closely 
cognate standard varieties are employed has had little impact on pre-existing dia- 
lect continua, where historically speaking, isoglosses typically did not coincide 
with political borders. Recent research on contemporary European dialects, how- 
ever, has revealed that modern political borders have come to play an increasingly 
prominent role in processes of dialect convergence and divergence as the impact of 
national standard languages on local dialects has intensified over the course of the 
last century. 

In this chapter I will discuss a range of issues relating to processes of dialect 
convergence and divergence in European border regions, focusing specifically on 
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dialect continua that are divided by national political boundaries. In section 2, I will 
briefly discuss a number of basic issues regarding the origins and typology of 
European political borders, as well as more recent developments in their status and 
functions. In section 3, I provide a brief overview of key conceptual and methodo- 
logical issues relevant to the study of border effects in processes of dialect main- 
tenance and change, discussing the potential contributions to border effect research 
of social dialectology, linguistic geography, linguistic anthropology and the soci- 
ology and social psychology of language. Finally, in section 4, I examine a number 
of European border regions that are of particular relevance for the study of border 
effects, and provide a brief survey of existing research dealing with the impact of 
political borders on European dialect continua. 

My focus in this chapter will be on politically segmented dialect continua 
where structurally highly similar, closely-related standard varieties, or a single 
standard language (usually with national variants), function as state languages on 
both sides of the border. In such cases, the populations on the two sides of the 
border are frequently not regarded by their respective governments, and often even 
by local inhabitants themselves, as belonging to ethnolinguistic groups distinct 
from the majority population within the country’s borders. There are of course nu- 
merous regions in Europe where the alignment of state borders has resulted in the 
creation of regional ethnolinguistic minorities speaking languages that are struc- 
turally and genetically more distant from or unrelated to the dominant national 
(state) language, for example, Alsatian dialects in France, Basque in Spain and 
France, Saami dialects in northern Norway, Sweden, Finland and Russia, Hungar- 
ian in Romania, Slovakia, Ukraine and Serbia, Slovene dialects in Austria and Italy, 
Albanian dialects in Macedonia and Greece, and Macedonian dialects in Greece, 
among others. However, inasmuch as these cases, along with other regional minor- 
ity languages, will be dealt with elsewhere in this volume, I will for the sake 
of brevity limit my discussion to situations where dialects on both sides of the 
border have a closely-related standard language "roof," whether nominally the 
same language, for example, as in the case of the German-Austrian, German- 
Swiss, Austrian-South Tyrol, English-Scottish, French-Belgian, Netherlands-Bel- 
gian, French-Swiss, and Romanian-Moldovan border regions, or closely related, 
but distinct standard languages, such as the German-Dutch, Norwegian-Swedish, 
Castilian (Aragon)-Catalan, Galician-Portuguese, Czech-Slovak, Bulgarian- 
Macedonian, Slovene-Croatian, Belarusian-Russian (with some qualifications), 
and Belarusian-Ukrainian border regions. 


Border effects in European dialect continua — 503 


2. European borders and border regions: 
historical origins and contemporary status 


The political border as it has existed in Europe since the late 18th and 19th cen- 
turies is in many respects a fundamentally new phenomenon. Before this time, the 
norm in Europe, as in other parts of the world, appears to have been the presence 
of multiple and often overlapping political, legal and ecclesiastical jurisdictions, of 
frontiers that were highly permeable both culturally and linguistically, and in many 
cases, subject to frequent revision. As noted by Benedict Anderson (1991: 19) 
"[dynastic] states were defined by centers, borders were porous and indistinct, and 
sovereignties faded imperceptibly into one another." Indeed, Anderson argues, 
it was precisely this fluidity which made possible the rule of these pre-modern 
states over highly heterogeneous — and often territorially non-contiguous — popu- 
lations for many centuries. 

The transformation of political frontiers from zones of overlapping, ambiguous 
and incomplete sovereignties to clearly demarcated linear boundaries is an import- 
ant change in the territorial organization of Europe, associated with the rise of 
the nation-state from the 18th to the 20th centuries. As Malcolm Anderson notes, 
in contrast to earlier models of political and administrative organization "[t]he 
nation-state introduced a precise sense of territorial identity and of territorial con- 
trol" (Anderson 1996: 25). 

At the same time, increasingly centralized nation-states began to actively pro- 
mote the spread of standardized national languages among the populations within 
their borders. In the discourses of European state nationalism, these standard 
varieties were represented not only as the sole legitimate forms of the national lan- 
guages, but often as key prerequisites for membership in the nation. Eventually, 
the “one (standard) language = one nation = one territory = one state" model (Auer 
2005: 11) came to dominate public discourse on issues of language and nationality 
in Europe, despite numerous problematic cases and obvious counterexamples, and 
had a profound impact on popular language attitudes in many European speech 
communities. Cases of a lack of congruence between linguistic and state borders, 
particularly in the central and eastern parts of the continent, came to play an im- 
portant role in justifying political expansionism on the one hand, and on the other, 
in the emergence of nationalist movements challenging the geopolitical status quo.! 

The linguistic impact of the European ideology of the national standard lan- 
guage in the 19th century came to be amplified over the course of the 20th century 
with the spread of modern communications, improved transportation networks, 
greater geographical and social mobility of populations within state boundaries, 
and universal compulsory education (usually in a single national standard language), 
making international political borders a far more significant force in shaping the 
identities and linguistic repertoires of the inhabitants of border regions. Modern 
European political borders have thus influenced the linguistic behavior of border 
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dwellers both directly, by creating discontinuities in patterns of intra- or inter- 
dialectal communication and by creating or reinforcing territorially-bounded 
linguistic hierarchies, and indirectly, through the role they play in shaping the lan- 
guage attitudes and language ideologies that unite or divide dialect speakers on the 
two sides of the border. 

Yet it would be overly simplistic to claim that national borders automatically 
and exclusively orient populations along both sides of a border toward their re- 
spective dominant “national” identities and linguistic models. In reality, we often 
find that each community situated along a border, and indeed, different subgroups 
and individuals within each community, may show different responses to cultural 
and linguistic influences emanating from national centers as a consequence of pre- 
existing or emerging socio-cultural divisions and identities. 

In examining the role of European political borders in processes of dialect 
maintenance and change, it is also necessary to consider the impact of more recent 
developments in the political structure of Europe.? In the western and central part 
of the continent, as a consequence of deepening EU political and economic inte- 
gration and political decentralization (particularly in Spain and the UK), the formal 
distinction between "internal" administrative borders and international borders is 
becoming increasingly less clear, while the external borders of the EU have come 
to take on many of the characteristics and functions previously associated with 
traditional nation-state borders. At the same time, with the break-up of multi- 
national (quasi-) federations such as the Soviet Union, Yugoslavia and Czechoslo- 
vakia in the last two decades, previously internal borders (themselves largely based 
on ethno-linguistic criteria) have gained the status and full range of functions of 
international state borders. Both trends in the evolution of Europe's political 
geography offer researchers significant new opportunities to study the impact of 
changes in the location and status of political borders on processes of dialect main- 
tenance and change. 


3. Border impact research in social dialectology: 
conceptual and methodological issues 


As Britain (2007) notes, the field of sociolinguistics until quite recently tended to 
ignore or downplay the spatial dimension of social variation in language, despite 
important antecedents in dialectological cartography and cultural geography: 
"almost without exception, and rather than having been critically explored as a 
potential social variable, space has been treated as a blank canvas onto which 
sociolinguistic processes are painted. It has been unexamined, untheorised and its 
role in shaping and being shaped by language untested" (2007: 34). Drawing from 
theoretical innovations in the field of human geography, Britain argues that, like 
other forms of human behavior, linguistic behavior must be considered from the 
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standpoint of three fundamental spatial dimensions: 1) Euclidian space, the objec- 
tive, socially neutral space determined by mathematical and physical laws; 2) So- 
cial space, the space that is directly and indirectly shaped by human agency and 
various forms of cultural organization, including the way the state organizes, struc- 
tures and controls various forms of social interaction at the institutional, local, 
regional and national level; and 3) Perceived space, that is, how members of a so- 
ciety perceive their immediate and more distant environments (Britain 2007: 34). 
Clearly, in studying the sociolinguistic impact of political borders that do not cor- 
respond to any significant physical geographical obstacles, the role of the latter two 
dimensions, social space and perceived space, is of paramount importance, requi- 
ring the use sociological, anthropological and socio-psychological methods of data 
collection and analysis. 

The socially constructed nature of international political borders, and the con- 
struction and situational negotiation of national identities by the inhabitants of 
border regions have over the last two decades become a central focus of research 
in such fields as human geography, sociology and anthropology, including many 
studies of European border regions.? While the field of sociolinguistics has lagged 
behind somewhat in adopting the theoretical and methodological innovations as- 
sociated with the rise of “border studies” in the social sciences, in recent years a 
growing number of researchers have begun investigating border effects in pro- 
cesses of dialect maintenance and change in Europe. 

Auer and Hinskens (1996) present a model for the study of border effects in 
dialect continua, noting that borders (or more precisely, the socio-cultural discon- 
tinuities that they give rise to) can affect processes of dialect convergence and 
divergence with respect to two axes: the vertical, or dialect-standard axis, and the 
horizontal, or dialect-dialect axis. They predict that in situations where different 
standard languages dominate on the two sides of a politically divided dialect area, 
vertical convergence toward different standard varieties is likely to be the primary 
source of horizontal cross-border divergence (Auer and Hinskens 1996: 17). How- 
ever, they note that further empirical research on border effects is necessary to de- 
termine the relationship between horizontal divergence and vertical convergence, 
and the role played by autonomous processes of inter-dialectal convergence on the 
two sides of the border. 

In addition to taking into account the linguistic and functional relationship be- 
tween dialects and their standard “roofs,” the study of border effects must also 
be based on a thorough investigation of the socio-geographical structure of border 
regions, that is, the characteristics of the border region as social space. Among the 
institutional and functional aspects of borders that are of relevance to the study of 
border effects in social dialectology are: 

1. The political status of the border: international vs. internal borders and the 
border regimes associated with them, i.e. externally (state) enforced restrictions on 
the cross-border movement of people, information and goods. 
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2. The degree of cross-border political, economic and socio-cultural inte- 
gration and interaction, both at the macro-level (in terms of flows of goods and ser- 
vices, the existence of political, economic and cultural organizations and institu- 
tions in which actors from both sides of the border participate), as well as at the 
micro-level (frequency of cross-border travel, existence of kinship ties on the other 
side of the border, density and multiplexity of transborder social networks, expo- 
sure to mass media from the other side of the border, etc.). 

3. The degree of integration of the border region and its population into the 
national economies and national cultures on the two sides of the border, as re- 
flected in regional infrastructure development, settlement patterns, economic and 
social structure, patterns of in- and outmigration, and so on. 

As the field of human geography has moved away from its earlier positivistic 
biases, researchers dealing with border regions have come to stress the importance 
of subjective factors, including attitudes, ideologies and identities, that inform and 
structure the linguistic practices of communities, groups and individuals in border 
regions. How borderlands populations situate themselves ideologically in relation 
to national centers and the populations from whom they are separated by the border, 
as well as how they evaluate different language varieties, both standard and non- 
standard, in local verbal repertoires and those on the other side of the border, can 
play a key role in processes of cross-border dialect divergence and convergence. 
Many researchers would agree with Irvine and Gal (2000), Gal (2010), Busch and 
Kelly-Holmes (2004: 4) and Auer (2005) that linguistic boundaries themselves, and 
their relationship to notions of national identity and national territoriality, can be 
viewed as social constructs shaped by the discourses of political and cultural elites, 
the media and the general public, in which linguistically-defined groups and their 
posited territorial limits are constantly constructed and deconstructed. 

Dialect speakers’ perceptions and beliefs regarding the geographical distribu- 
tion of their own and other vernacular varieties can also have a significant impact 
on various aspects of language use, including processes of language or dialect 
maintenance and shift in border regions. Research on border effects must thus also 
take into account the subjective aspects of the socio-spatial dimension of language, 
that is, of the border as perceived space, which can be studied employing the 
methods associated with the new sociolinguistic subfield of perceptual dialectol- 
ogy (Preston 1999). 


4. European borders as sites of dialect convergence and divergence: 
research on border effects in European dialect continua 


In this section I will present an overview of research on border effects in dialect 
continua of the three major Indo-European language groups of Europe, Germanic, 
Romance and Slavic, highlighting important research questions, theoretical and 
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methodological issues, and directions for future research. As will be seen, the 
political segmentation of dialect continua within these three major linguistic 
groups raises a multitude of questions regarding processes of linguistic divergence 
and convergence in Europe, questions that have only relatively recently begun to 
be addressed in a principled and systematic way by sociolinguists. 


4.]. The Germanic Languages 


Research on border effects in Germanic language continua represents perhaps 
the largest and methodologically most sophisticated body of research on this issue 
within the broader study of processes of ongoing dialect change in Europe. Much 
of this research focuses on portions of the Continental West Germanic dialect con- 
tinuum, in particular the dialects of the German-Dutch border region, the Low 
Franconian dialects of the Dutch-Belgian and Dutch-German border regions, the 
Alemannic dialects of southwestern Germany and Switzerland, and the Middle 
Bavarian dialects of the German-Austrian border region. 

The impact of modern political borders on processes of dialect divergence 
along the Dutch-German border is the subject of a groundbreaking study by 
Kremer (1979). Focusing primarily on lexical features, Kremer compares the re- 
sults of a large-scale dialectological survey conducted in 121 sites in the central 
portion of the Dutch-German border region with mostly pre-WWI dialect atlas 
data from the same region. He finds evidence of increasing cross-border discon- 
tinuities in the distribution of the lexical items investigated, and notes significant 
cross-border differences in the use of the local dialects and standard languages ac- 
cording to informants' self-report data. 

In a 1984 study (reprinted in English translation as Kremer 1999), Kremer 
addresses the impact of the political border on subjective dialect boundaries in the 
same region on the basis of informants' responses concerning the dialects of 
nearby communities they consider to be most like their own (indicated by arrows 
linking the two points). Kremer finds that the distances between the locations on 
the Netherlands side of the border that were regarded by informants as having very 
similar or identical dialects are, as a rule, much greater than on the German side, 
which he argues is a reflection of the greater extent of regional dialect leveling and 
the influence of standard Dutch. Moreover, the political border is crossed in only 
five locations, a clear sign that it has become a salient subjective linguistic border 
for the majority of informants (Kremer 1999: 35). 

The role of the Dutch-German political border in the dissolution of the Conti- 
nental West Germanic dialect continuum was also investigated in a series of other 
subsequent studies, including a number of articles in Niebaum and Kremer (1990), 
Hinskens (1993) and others. 

Border effects in the Kleverland dialect area between Nijmegen and Venray 
along the contemporary Dutch-German border are the focus of a recent socioling- 
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uistic study by Giesbers (2008). Comparing historical and contemporary sociologi- 
cal and dialectological data from five communities on each side of the border 
(arranged into pairs by size and distance from the political frontier), Giesbers finds 
that the border has over time become an increasingly significant factor in cross- 
border social and dialectal divergence, particularly in the post- WWII era. Histori- 
cal and contemporary data on cross-border marriages, kinship networks and social 
contacts paint a picture of declining cross-border interaction, despite the increas- 
ingly “open” character of the border since WWII and its permeability to cross- 
border economic activity. Social divergence along the border is mirrored on the 
level of language attitudes and language use, as reflected in the results of a tele- 
phone survey and sociolinguistic questionnaires. Using dialectometric methods for 
determining linguistic distance, Giesbers finds that the lexical distance and pho- 
netic-phonological (Levenshtein) distance between the dialects within the Klever- 
land dialect continuum does not increase gradually as geographical distance in- 
creases, but shows a dramatic discontinuity along the Dutch-German border. 

Gerritsen (1999) presents the results of a three-way comparison of modern dia- 
lects in the Continental West Germanic continuum located in the “waist of the Ne- 
therlands" (Dutch Limburg) and neighboring regions of Belgium and Germany. 
This study is of interest in that it focuses on a single group of dialects that due to 
political separation have been subject to three different types of dialect-standard 
relationships in the recent past. Gerritsen finds a marked tendency toward dialect 
loss among younger informants on the German side of the border, as compared 
with their peers in the Netherlands and Belgian villages. This she attributes to the 
greater linguistic distance between the local dialects and standard German as op- 
posed to standard Dutch, as well as to differences in educational policy and atti- 
tudes toward dialect speech in the three countries. 

The importance of the function of the standard language(s) for processes of 
dialect change within national boundaries is highlighted by the case of dialects of 
Létzebuergesch in Luxembourg, formerly considered an integral part of the West 
Moselle Franconian dialect continuum. Gilles (1999) notes that as the result of the 
spread of central Létzebuergesch features associated with the dialect of Luxem- 
bourg city, the dialects in the eastern border regions are becoming less like those of 
neighboring parts of Belgium and Germany; i.e. this is an example of horizontal 
convergence within a country, without any appreciable vertical convergence, con- 
tributing to horizontal divergence across state borders. 

Scheuringer (1990) examines a case of cross-border divergence within the 
Middle Bavarian dialect continuum along the German-Austrian border, where “the 
same" standard language is employed on both sides of the border. While the major 
isoglosses in this continuum historically did not align with state borders, the border 
has come to represent an important linguistic discontinuity due to divergent pat- 
terns of dialectal convergence. Scheuringer notes that in the border towns of Sim- 
bach (Bavaria) and Braunau (Austria), the process of convergence of certain shared 
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dialectal features toward standard High German has proceeded unevenly on the 
two sides of the border, with the dialects on the German being more conservative 
than those on the Austrian side, while for other features divergence is also mani- 
fested in convergence toward different regiolectal colloquial target varieties (the 
Munich regiolect on the German side, and Vienna regiolect on the Austrian side). 

Even more striking is the ongoing process of dialect divergence along the Ger- 
man-Swiss border. As discussed in Auer (2005: 22-25) dialectological studies by 
Schifferle (1990) and other researchers have demonstrated that the originally very 
similar High Alemannic dialects on the two sides of the border are converging 
toward German regiolectal varieties on German territory, while on the Swiss side 
of the border the only type of convergence observed is toward the Zurich dialect of 
Swiss German. 

The linguistic effects of the former border between the FRG and the GDR have 
also attracted considerable attention among social dialectologists. Schlobinski 
(1997) in a summary of his own and other scholars’ research on the Berlin Urban 
Vernacular, notes important differences in the sociolinguistic evolution of the local 
Berlin vernacular in the FRG and the GDR. While in West Berlin the local ver- 
nacular came to be highly stigmatized among the upper and middle classes, in East 
Berlin it functioned more broadly as an urban colloquial variety, and had even 
begun spreading to neighboring regions, such as Brandenburg. 

Harnisch (2010) examines another segment of the former state border between 
the GDR and the FRG, along the Thuringia (former GDR)-Bavaria (FRG) border. 
Comparison of dialectal data from the 1930s and 1990s in the two regions reveals a 
remarkable pattern of dialectal divergence and convergence with regard to realiz- 
ations of syllable-initial /r/ along the border. When the state border was drawn in 
the late 1940s on the basis of the border of the Soviet zone of occupation, it cut in 
an arbitrary manner across a dialectal boundary between the East Franconian api- 
cal articulation [r] and Thuringian uvular [R]. Harnisch found that over the course 
of only two generations, the dialectal boundary had aligned itself with the state 
border, meaning that those communities on the GDR side that in the 1930s had api- 
cal [r] had shifted to the uvular articulation [R] (shared by East Middle German 
dialects and the German standard pronunciation norm), while those communities 
on the FRG side of the border that had originally had uvular [R] had adopted the 
Upper German apical [r]. Thus, employing the terminology introduced by Auer 
and Hinskens (1996; see discussion above), it may be said that the linguistic adap- 
tation of speakers on the GDR side of the border involved both horizontal conver- 
gence with the Thuringian (Middle German) dialect area and vertical convergence 
with the standard German norm, while speakers on the FRG side converged hor- 
izontally with the East Franconian (Upper German) dialects while diverging verti- 
cally from the standard pronunciation. 

It could be argued that such striking examples of linguistic polarization within 
a relatively short period of time are most common in the context of “hard” political 
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borders, where cross-border interaction is reduced to a minimum and relations be- 
tween the neighboring states are characterized by competition and conflict. At the 
same time, it is important to bear in mind that even in the case of border regimes 
such as that obtaining between the former GDR and FRG, patterns of dialect diver- 
gence and convergence in border regions are intimately bound up with attitudinal 
factors, i.e. political and regional identities. In this regard, Harnisch notes that after 
the emergence of the state border the apical articulation [r] came to serve as one of 
the linguistic markers of a regionally prestigious “Bavarian” identity on the FRG 
side of the border (being the dominant vernacular pronunciation throughout most 
of the Federal State of Bavaria), while uvular [R] came to be regarded by speakers 
on both sides of the border as a typical feature of the German spoken in the GDR. 
Inasmuch as speakers on the FRG side of the border wished to be identified as Bav- 
arians, rather than as being "from the East," uvular [R] rapidly receded among the 
younger generation in favor of the regiolectal variant [r]. On the GDR side of the 
border, the apical pronunciation, reinterpreted as a marker of being "from the 
West," came to be avoided in favor of the “GDR” variant, uvular [R] (2010: 279). 

Insular West Germanic (Anglic) is today represented by dialects of English and 
Scots. While the Scottish-English national border has been quite stable for cen- 
turies, and has been said to coincide with a number of important phonological and 
lexical isoglosses, there is evidence that processes of linguistic convergence, in 
particular, the spread of features from standard English and southern English urban 
varieties to the dialects of northern England, may potentially contribute to further 
cross-border linguistic divergence. 

The lexical aspects of linguistic divergence at the Scottish-English border have 
been investigated by Glauser (1974, 2000). Glauser (2000: 70) observes an inter- 
esting type of dialect divergence along political borders that does not fit within 
conventional wave or gravity models of geolinguistic diffusion, noting that in 
locations along the English side of the English-Scottish border, informants rejected 
older regional plural forms such as kye ‘cows’, shuin ‘shoes’, een ‘eyes’ as either 
non-existent or as “Scots,” while such forms were easy to elicit further south, as 
well as on the Scottish side of the border. This case presents a challenge both to the 
standard wave model as well as the gravity model, in that it implies that conver- 
gent, standardizing changes (at least in nominal morphology) may affect more pe- 
ripheral regions (the border zone) before those areas that are geographically closer 
to the main centers of the innovation. Here, the divergent change is motivated pre- 
cisely by residents in the immediate border region wishing to disassociate them- 
selves with the population on the other side of the border, a motivation that would 
be less strong the greater the distance from the border. 

An interesting case of phonological convergence and divergence as a function 
of identity “making and marking” along the Scottish-English border is presented 
by Llamas (2010). This study, part of a larger project on Accents and Identity on the 
Scottish-English Border, is based on sociolinguistic sampling of the speech of two 
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age groups (11-22 and 56-82) in four locations, two on each side of the Scottish- 
English border (Berwick-upon-Tweed (England) and Eyemouth (Scotland) in the 
east, and Carlisle (England) and Gretna (Scotland) in the west), along with cogni- 
tive and attitudinal questionnaire responses regarding informants’ claimed 
national and regional identities and evaluations of different accents and national 
identities. Focusing on realizations of coda /r/ among the two age groups in these 
communities, where rhotic variants are considered stereotypically “Scottish” and a 
zero realization is considered “English” or perhaps “British,” Llamas finds a sig- 
nificant correlation between the use of forms that may be said to serve as indices of 
a specific national identity and both claiming that identity and having positive at- 
titudes toward it. At the same time, she notes that this sociolinguistic divide is 
stronger and more stable among younger speakers in the eastern portion of the 
Scottish-English borderland (Eyemouth and Berwick-upon-Tweed), reflected both 
in linguistic behavior and in identities and language attitudes, while in the western 
region younger informants appear to be converging across the border both in terms 
of the spread of the zero realization of coda /r/ and in terms of their identities and 
attitudes, as populations on the two sides of the border appear to draw increasingly 
from a shared British identity. Llamas’s study highlights the importance of the 
local in border impact studies — how “the same" border, although a projection of 
the power and influence of specific political and cultural centers, may play differ- 
ing roles in the construction and transformation of the identities of different bor- 
derland communities, as a result of local socio-cultural, economic, geographical 
and other factors. 


4.2. The Romance languages 


As Goebl (2000) points out, Romance dialectology has historically been quite sen- 
sitive to the question of the role of political borders in processes of dialect diver- 
gence and convergence, not only because the modern Romance languages owe 
their very existence to a process of political fragmentation, but also due to the clear 
coincidence of many dialectal divisions and ecclesiastical and other borders 
throughout the Romance world. However, the impact of modern state borders on 
convergent and divergent developments in Romance dialect continua has only 
more recently become a focus of research. 

Focusing on the problem of border effects within the oil area of Gallo Ro- 
mance, Pohl (1978) discusses some aspects (primarily lexical) of linguistic diver- 
gence along the French-Belgian border. In his analysis he distinguishes two key 
concepts that are of relevance to the study of border effects in other European dia- 
lect continua as well: “linguistic field" (a continuum of closely-related varieties) 
and “communication field" (the communication network linking groups of speak- 
ers with each other). In Pohl’s view, the “communication field" is much broader 
than the notion of social network, in that it includes exposure to national and re- 
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gional media. Pohl argues that in the case of the French-Belgian border, lexical di- 
vergence is a consequence of a lack of congruence between the linguistic field link- 
ing the dialectal substratum and standard French superstratum on the two sides of 
the border and the communication field, which in this area is divided into Belgian 
vs. French zones of influence. 

Francard (2009: 107) notes that although standard French is rapidly displacing 
the indigenous Gallo Romance varieties — Walloon (which only crosses the 
French-Belgian border in a limited number of localities), Picard, Champenois and 
Lorraine — the influence of these local varieties on standard French in Wallonia, 
particularly on the lexical level, is considerably greater than in other parts of the oil 
area within the Gallo Romance continuum. At the same time, he notes, French 
speakers in Wallonia and Brussels tend to perceive the distance between “French 
French” and “Wallonian French” to be greater than actual linguistic data would 
warrant. Francard attributes this border effect in language attitudes to a feeling of 
linguistic insecurity among Walloons that is shared with other Francophones in pe- 
ripheral areas of the French speech territory. At the same time, Francard notes a re- 
cent change in attitudes toward local language varieties, in particular Walloon dia- 
lects as a symbol of Belgian Walloon identity, a factor that could potentially 
promote cross-border divergence from French French. 

Goebl (2000) presents the results of a macro-level study of dialect similarity 
within the western Romance dialect continuum employing the dialectometric 
methodology that has been pioneered in Romance dialectology. Focusing on the 
southeastern French, Francoprovengal, Occitan, and Lombard dialects, Goebl 
presents evidence that sharp differences between dialects in border regions are 
characteristic only of the Italian-French border (dialects of Occitan and Franco- 
provençal) and Swiss-Italian border (Lombard dialects of Italian), while the border 
effect is minimal in the case of the border between France and Francophone Swit- 
zerland. 

There are relatively few sociolinguistic studies devoted specifically to the dia- 
lects of Occitan along the French-Italian and French-Spanish borders, in particular 
the Occitan enclaves in western Piedmont in Italy and the Val d’ Aran in Catalonia. 
The latter region is the focus of a recent study by Carrera (2008), who argues that 
the main distinguishing feature of the local Aranese dialect, essentially a Gascon 
variety of Occitan, is that it has been influenced by Catalan and Castilian Spanish 
rather than French, unlike the majority of Gascon dialects, i.e. cross-border diver- 
gence between Aranese and other Gascon Occitan varieties is primarily the result 
of vertical convergence toward different Romance standard languages, rather than 
intra-Occitan dialectal differentiation per se. 

The problem of border effects in the Iberian Romance dialectal continuum has 
within the last twenty years attracted increasing interest on the part of social dia- 
lectologists, a development directly related to issues of Catalan political autonomy 
and language policy within Spain and the EU as a whole. Linguists classify the 
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dialects of the eastern part of Aragon (known as the Franja) as unambiguously 
Catalan, while to their west are found transitional Catalan-Castilian and Catalan- 
Aragonese dialects. The speakers of the dialects of the Franja themselves, how- 
ever, rarely identify their speech as Catalan, using instead various traditional des- 
ignations such as tamarita, fragati, nonaspí, maellá, chapurreao, etc. Despite the 
fact that Catalan language instruction has recently become available in local 
schools in the Franja, as Fort-Cafiellas (2007) notes, most speakers of Catalan dia- 
lects in the Franja still consider themselves Aragonese rather than Catalan, which 
gives rise to ideological challenges from both sides: from the Aragonese/ Castilian, 
the notion that one must speak Castilian to be considered Aragonese, and from 
the Catalan side, the notion that speakers of varieties of Catalan are by definition 
Catalans. 

The dialects of northern Portugual and Galicia, considered by many linguists 
to belong to a single diasystem, are a particularly fruitful area for the study of re- 
cent border effects in a European dialect continuum divided by an “old” political 
border. Recent studies of the Portuguese-Spanish border region by human and 
political geographers (for example Pinheiro 2009), have shown how despite the 
diminished importance of the state border as an institution and physical barrier in 
the context of EU integration, it continues to function as a salient symbolic bound- 
ary. While linguistic issues are peripheral to the concerns of Pinheiro's study, his 
findings would nonetheless serve as a useful framework for the investigation of so- 
ciolinguistic border effects along the Spanish-Portuguese border in Galicia. 

The eastern Romance language Romanian, including its eastern variant “Mol- 
dovan," sometimes still regarded (for example, in the separatist Transdniester re- 
gion) as an independent language, provides one of the most striking recent 
examples of “planned” linguistic divergence in language standardization. Since the 
collapse of the Soviet Union, however, the success of the Moldovan project has 
been cast in doubt, as Moldova officially abandoned the Cyrillic alphabet in favor 
of the Romanian Latin script, eliminating one of the few features distinguishing 
written “Moldovan” from “Romanian.” 

It should be noted that the territory of the historical Principality of Moldavia is 
today in fact divided between modern-day Romania and Moldova; the Prut river, 
forming the border between the two states, did not constitute in the past a linguistic 
border. Recent studies by Ciscel (2007) and other researchers, however, have docu- 
mented a number of Russian-influenced changes in the spoken Romanian/Moldo- 
van language of urban areas, including palatalization of dental stops and velariz- 
ation of fricatives, morphosyntactic calques, and heavy lexical borrowing (many 
such contact-induced innovations are not, however, considered normative). Never- 
theless, further empirical research along the border is required to determine 
whether rural dialects on the two sides of the Romanian-Moldovan border have di- 
verged in a similar fashion, and to what extent basilectal dialect varieties affect 
spoken standard Romanian/Moldovan on the two sides of the border. 
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4.3. The Slavic languages 


The Slavic languages are of particular interest for investigating the role of political 
borders in the dissolution and maintenance of dialect continua in Europe, in that 
the Slavic-speaking countries have experienced perhaps the greatest number of 
significant changes in border locations and statuses over the course of the last 
hundred years. Given that many ofthese borders have only been established within 
the last few generations, it becomes possible in many parts of Slavic Europe to in- 
vestigate border effects in apparent time, comparing processes of linguistic diver- 
gence and convergence in the speech of older and younger residents of border com- 
munities. 

One of the most complex regions in the Slavic-speaking parts of Europe with 
respect to the impact of political borders on dialect maintenance and change is the 
South Slavic dialect continuum, which begins in the northwest with the internally 
highly differentiated dialects of Slovene, transitioning to varieties of Croatian and 
Serbian (also referred to as “Central South Slavic,” as this portion of the continuum 
includes varieties spoken in Bosnia-Hercegovina and Montenegro as well), and 
ending in Macedonian and Bulgarian in the southeast. As in the case of Continental 
West Germanic, dialectologists (or at any rate, those without a political agenda) 
have generally noted a common lack of congruence between political borders and 
dialectal isoglosses in much of the South Slavic dialect continuum. 

While linguists have long noted the existence of significant shared features be- 
tween the Slovene and Croatian Kajkavian dialects of the Slovenian-Croatian 
border region, the impact of the modern border on local attitudes and identities has 
only recently become a focus for research, primarily by Slovenian anthropologists 
and sociologists. Recent research by Nikocevié (2005) on the development of local 
attitudes and practices in a number of communities along the Slovenian-Croatian 
in Istria, a peninsula on the Adriatic coast divided between Croatia, Slovenia and 
Italy, shows how even a recently introduced border regime can affect local iden- 
tities and perceptions. While the dialects on the two sides of the border are very 
similar, the border is construed to mark the territorial extent of two distinct (albeit 
to some extent mutually comprehensible) superposed standard languages, Slove- 
nian and Croatian. Populations on the two sides of the contemporary border have 
traditionally had close kinship, cultural and economic ties, and a local identity 
characterized either by ambiguity or a refusal to accept either of the national labels 
“Croatian” and “Slovenian,” opting instead for “Istrian.” Since Slovenia joined the 
EU in 2004, this border has become an even more significant obstacle to move- 
ment. However, Nikočević notes that the impact of the border on identities is 
uneven; while people on the Slovenian side increasingly tend to think of them- 
selves as Slovenes and refer to the population across the border as “Croatians,” the 
local Istrian identity remains particularly strong on the Croatian side, which is 
economically less developed than the area on the Slovene side of the border. 
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The problem of divergent language planning in the former Yugoslavia, in par- 
ticular the fate of the language formerly known as “Serbo-Croatian,” has for ob- 
vious reasons attracted particular interest over the last decade. Greenberg (2004) 
documents the disintegration of “Serbo-Croatian,” once generally considered a 
single language with two distinct regional standard variants, into several increas- 
ingly divergent *Ausbau" standard languages. Three of these four "successor lan- 
guages," Croatian, Bosnian and (potentially) Montenegran, are based on the same 
basic phonological type (ijekavian), while standard Serbian is based on the eka- 
vian dialect (for example, standard Croatian and Bosnian lijepa rijeka vs. stan- 
dard Serbian Jeng reka ‘beautiful river’). Divergent planning is thus expressed 
primarily in lexical differences (reflecting both different dialectal sources as well 
as different attitudes toward lexical borrowing and the sources of such borrow- 
ings), although some divergent morphological and syntactic features are observed 
as well. It remains to be seen, however, how divergent language planning will be 
reflected in the dialect landscape of border regions within the Central South 
Slavic area, and indeed even speakers from non-contiguous areas of Central South 
Slavic still appear to have no difficulty communicating in their native variety with 
speakers of the nominally “other” language. Moreover, it appears that there are 
significant attitudinal obstacles to the acceptance of “national” standards; it seems 
unlikely, for example, that Bosnian Serbs will abandon their ijekavian pronunci- 
ation in favor of the ekavian Serbian standard, despite attempts by the Bosnian 
Serb leadership in the 1990s to introduce this variant as an assertion of Serbian 
identity. 

Another portion of the South Slavic dialect continuum in which political 
borders have influenced the development of local dialects and the identities of their 
speakers are the dialects of the Prizren-Timok type spoken in the Serbian-Macedo- 
nian-Bulgarian border region.* The speakers of these dialects, which share many 
important structural features with Bulgarian and Macedonian (including a number 
of key phonological features, as well as the loss of nominal inflection), self-ident- 
ify primarily as Serbs on the Serbian side of the border (with a small area of self- 
identified Bulgarians along the southwestern part of the border), as Bulgarians on 
the Bulgarian side, and as Macedonians on the Macedonian side of the border. 
Some linguists studying this portion of the South Slavic dialect continuum have ar- 
gued that political borders have had a minimal impact on the original structural 
unity of the dialect areas they divide; for example, Miseska-Tomié (1992) claims 
that there are no isoglosses that coincide with the contemporary Macedonian-Bul- 
garian state border. However, Angelov (2000) suggests otherwise, at least with 
regard to the Bulgarian-Macedonian border, noting that the influence of the Mace- 
donian and Bulgarian standard languages on the contemporary dialects has led to 
cross-border horizontal divergence. The potential for significant dialectal diver- 
gence along the contemporary Bulgarian-Macedonian border is also indicated by 
the results of a study by Herson-Finn (1996), who found that Macedonian inform- 


516 Curt Woolhiser 


ants typically regard Serbian to be closer to their own language than Bulgarian, 
contrary to the majority opinion among linguists. 

As far as the West Slavic subgroup is concerned, similar instances of politically 
segmented dialect continua can be cited. The Czech-Slovak or southern West 
Slavic dialect continuum, despite the obvious role that nationalism and national 
borders have played in the construction of a Slovak standard distinct from Czech, 
has only very recently become the focus of border impact research. A recent study 
of the linguistic identity of residents of the Slovak side of the border (Bosnitovä 
2006, cited in Sloboda 2006: 228) finds that local residents in the western Slovak 
village of Skalica categorize their local dialect as Slovak and are converging to- 
ward standard Slovak, despite the fact that the local dialect, which shares many 
features with Moravian dialects of Czech across the border, is regarded as “Czech” 
by their neighbors to the east. 

The impact of political borders and divergent national identities on the dialectal 
continuum linking Moravian varieties of Czech with Silesian Polish dialects is the 
focus of Hannan's (1996) study of the contemporary Zaolzie dialects in the vicinity 
of Těšín in the Czech Republic. Hannan notes that speakers of the Zaolzie dialect, 
which is typologically closer to the Silesian dialects of Polish, who identify them- 
selves as Czechs lead the way both in the adoption of lexical and syntactic in- 
fluences from standard Czech and in the shift toward the regional variety of Czech. 
Those speakers of the Zaolzie dialects who identify as Poles, on the other hand, are 
more conservative in their linguistic behavior, and also continue in some cases to 
use standard Polish alongside Czech in education and other domains (Hannan 
1996: 169). Thus, as political borders force border populations to choose political 
(and often, linguistic) allegiances, groups within a state's boundaries who choose 
ethno-cultural allegiance to a neighboring nation may show horizontally divergent 
behavior with respect to their immediate neighbors on the same side of the border, 
either through innovation or through maintenance of traditional dialect features as 
an ethnic marker. 

The East Slavic dialect continuum is characterized by a gradual transition from 
dialects of Ukrainian in the SW to dialects of Russian in the NE, with the core 
Belarusian dialect area representing in many respects a transitional zone between 
the two. Thus, on both the Belarusian-Ukrainian and Belarusian-Russian borders, 
where there are relatively few areas where older dialectal isoglosses coincide with 
modern political divisions, we might expect to find significant cross-border diver- 
gence. However, given the fact that standard Russian served as the dominant offi- 
cial language in the late Tsarist period and much of the Soviet period in Belarus 
(and, to a lesser extent, in northwestern Ukraine), the dialects on both sides of these 
borders have in many cases been exposed to the same standardizing pressures. 

The contemporary Belarusian-Russian border, which coincides to a large ex- 
tent, particularly along its southern portion, with the late 17th-18th century border 
between the Grand Duchy of Lithuania (the eastern member of the Commonwealth 
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of the Two Nations, or Polish-Lithuanian Commonwealth) and Muscovy/the Rus- 
sian Empire, is considered a classic example of an arbitrary border imposed upon a 
relatively undifferentiated section of a larger dialectal continuum. The current 
borders were laid out during the 1920s as the administrative border between the 
Belarusian Soviet Socialist Republic and Russian Soviet Federative Socialist Re- 
public within the USSR's quasi-federative system, being based to varying degrees 
on ethnographic and linguistic, economic and historical considerations, but in 
many cases without consideration for the identities and preferences of the local 
population. Over the last decade, a number of studies carried out on both sides of 
the contemporary Belarusian-Russian border have found that both at the level of 
language structure and at the level of language attitudes, the dialects on both sides 
of the contemporary Belarusian-Russian border remain fairly ambiguous, with 
local residents typically describing their speech as “mixed” (implying a mixture of 
Belarusian and Russian), while younger speakers are generally converging to vary- 
ing degrees toward standard Russian. In this region, rather like the western part of 
the Scottish-English border investigated by Llamas (2010), or much of the popu- 
lation along the Slovenian-Croatian border in Istria discussed by Nikočević 
(2005), local residents, particularly on the Belarusian side, appear to adopt a strat- 
egy of neutrality, in this case often appealing to a shared Eastern Orthodox Slavic 
identity. 

According to Sloboda (2006) a number of his informants in the Belarusian- 
Russian border region claimed that while old people on both sides of the border 
speak the same “mixed language,” the younger generation on the Russian side sup- 
posedly speak a “purer” Russian than their peers on the Belarusian side. Also note- 
worthy is that on the Belarusian side, nearly all of the informants tended to indicate 
Belarusian as their “native language" (Russian rodnoj jazyk, Belarusian rodnaja 
mova), although it should be noted that in Soviet and post-Soviet practice, the 
“native language” as reflected in censuses, surveys and official discourse was 
essentially defined as the standard language associated with an individual’s ethnic 
heritage, regardless of whether or not he or she actively uses it. On the Russian 
side, given the absence of official recognition of a territorially-based Belarusian 
minority and the absence of Belarusian-language schooling, it is unlikely that any 
natives of the region self-identify as Belarusian. While there is a high degree of so- 
ciolinguistic variability on both sides of the border, representing a continuum of 
varieties from local dialects (spoken now only by the oldest members of rural com- 
munities) to mixed varieties and near-standard Russian (and, among a relatively 
small percentage of the population, near-standard Belarusian as well on the Bela- 
rusian side), the trend toward vertical convergence toward standard Russian ap- 
pears overall to be stronger on the Russian side. 

While there are some archaic shared North Slavic phonological features link- 
ing the Western and East Slavic languages, as well as some later more localized 
convergent developments, it is generally agreed by linguists that there are no true 
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transitional dialects in the zone of contact between East Slavic (Belarusian and 
Ukrainian) and West Slavic (Polish and Slovak). In the wake of WWII the Polish- 
Ukrainian linguistic border was along much of its extent brought into accordance 
with state borders as a result of government-sponsored ethnic cleansing in Poland’s 
eastern provinces, while the westernmost portion of the Belarusian and transitional 
Belarusian-Ukrainian dialects has since WWII been divided between Poland and 
the Republic of Belarus. In Woolhiser (2005) I examine the impact of the state 
border on the evolution of the Belarusian dialects in the northern part of Poland's 
Podlasie province and the neighboring Hrodna (Grodno) region of Belarus on the 
basis of dialect atlas data, sociolinguistic interviews and surveys conducted in vil- 
lages on both sides of the contemporary border. While dialect attrition due to 
purely demographic factors (outmigration of younger people to the cities, the death 
of older dialect speakers) is in evidence on both sides of the border, fundamental 
structural change in the dialects is observed primarily on the Belarusian side. This 
striking asymmetry is evidently connected above all with the closer genetic and 
typological relationship between the Belarusian dialects and Russian, which has 
facilitated the emergence of a continuum of compromise varieties incorporating 
elements of local Belarusian dialect and standard Russian (and to a lesser degree, 
standard Belarusian). On the Polish side of border, on the other hand, among those 
social groups where dialect attrition has occurred (including a sizable percentage 
of village residents under age 40), it is a more abrupt process involving language 
shift from the local Belarusian dialect in favor of the regional Polish interdialect, 
heavily influenced by the standard language but showing some residual Belarusian 
substratum features. Border effects are also observed in the domain of language 
attitudes, which were investigated on the basis of interview and survey questions. 
Striking cross-border divergences in attitudes toward Polish, Russian, standard Be- 
larusian and local Belarusian dialects were observed, reflecting the sociolinguistic 
integration of the population on the Polish side of the border into the Polish 
national speech community and of the population on the Belarusian side into a hy- 
brid Russian-Belarusian speech community. 

Unlike the Belarusian-Russian and Belarusian-Ukrainian border regions, most 
of the contemporary Russian-Ukrainian border does not cut across a dialectal con- 
tinuum; rather, it represents a fairly arbitrary division of an area of linguistically 
mixed settlement, a result of relatively late (17th-19th centuries) migration to a 
previously sparsely populated steppe region of speakers of Ukrainian dialects from 
the west and speakers of south Russian dialects from the north; similarities be- 
tween the Ukrainian and Russian dialects located on both sides of the border are 
thus the result of secondary convergence, as well as the influence of standard Rus- 
sian on both groups of dialects. Most research on the contemporary Ukrainian- 
Russian border region has focused on national identities and language attitudes on 
the Ukrainian side (Zhurzhenko 2006), rather than the impact of the border on lan- 
guage use and dialect landscapes. In general, it seems, residents of the Ukrainian- 
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Russian border region, particularly those with cross-border family or economic 
ties, have somewhat ambivalent attitudes toward the border, although they are 
increasingly coming to regard it as a "natural" and necessary part of their local en- 
vironment. How the shifting attitudes toward the border, and the discontinuity in 
national identities it symbolizes, are reflected in processes of dialect maintenance 
and change on the two sides of the border is a topic that awaits further research. 


5. Conclusion 


Despite the lofty rhetoric of a “borderless Europe,” the actual experience of popu- 
lations living in many modern European border regions suggests that modern state 
or "national" borders will in all likelihood continue to play a significant role in 
processes of dialect maintenance and change in Europe for the foreseeable future. 
Indeed, as Auer (2005: 28) notes, even after political borders have ceased to serve 
as significant physical barriers to the movement of people, goods and ideas, their 
socio-psychological effects may persist for generations, if not for centuries. Inas- 
much as political borders mark the territorial limits of “imagined communities" 
and the linguistic repertoires which in various ways symbolize national identities, 
the sociolinguistic study of border effects must thus be based not only on system- 
atically collected, representative linguistic data, but also grounded in the study of 
the subjective and ideological dimensions of linguistic variation. As the body of 
sociolinguistic research on European borders and borderlands develops in size and 
scope, it will ultimately be possible to undertake comparative studies employing a 
uniform theoretical apparatus and methodology, making it possible to identify the 
full range of social and structural factors that promote or impede dialect divergence 
and convergence across political borders. 


Notes 


1 As noted by Auer (2005) and Gal (2010) the field of dialect geography was often ideo- 
logically complicit, whether wittingly or not, in promoting these political projects. 

2 The changing status and functions of European borders over the course of the last century 
are discussed by O’ Dowd 2003. 

3 See, for example, Donnan and Wilson 1994; Wilson and Donnan 1998; O'Dowd 2003; 
Lundén 2006, among others. 

4 These dialects are referred to as Torlak (torlaSki) in Serbian dialectology, and are some- 
times referred to as Shop (shopski) or western Bulgarian dialects in the Bulgarian dialec- 
tological tradition. 


520 Curt Woolhiser 


References 


Anderson, Benedict 
1991 Imagined Communities: Reflections on the Origin and Spread of Nationalism. 
Rev. ed. London: Verso. 
Anderson, Malcolm 
1996 Frontiers: Territory and State Formation in the Modern World. Cambridge: 
Polity Press. 
Angelov, Angel 
2000 The political border as a factor for language divergence. Paper presented at 
ICLaVE I, Barcelona. 
Auer, Peter 
2005 The construction of linguistic borders and the linguistic construction of borders. 
In: Markku Filppula, Juhani Klemola, Marjatta Palander and Esa Penttilà (eds.), 
Dialects across Borders: Selected Papers from the 11th International Conference 
on Methods in Dialectology, 3-30. Amsterdam/Philadelphia: John Benjamins. 
Auer, P. and Frans Hinskens 
1996 The convergence and divergence of dialects in Europe. New and not so new 
developments in an old area. Sociolinguistica 10: 1-30. 
Britain, David 
2007 Geolinguistics — diffusion of language. In: Ulrich Ammon, Norbert Dittmar, 
Klaus Mattheier and Peter Trudgill (eds.), Sociolinguistics: An International 
Handbook of the Science of Language and Society. Vol. 1: 34—48. Berlin/New 
York: Walter de Gruyter. 
Busch, Brigitta and Helen Kelly-Holmes 
2004 Language boundaries as social, political and discursive constructs. In: Brigitta 
Busch and Helen Kelly-Holmes (eds.), Language, Discourse and Borders in 
the Yugoslav Successor States, 1-12. Clevedon: Multilingual Matters. 
Carrera, Aitor 
2008 Fets d'anivellament dialectal en una zona de frontera. El cas de l'occità de la 
Vall d' Aran. In: Johannes Kabatek and Claus D. Pusch (eds.): Variació, poli- 
glossia i estàndard, 1—22. Aachen: Shaker. 
Ciscel, Matthew H. 
2007 The Language of the Moldovans: Romania, Russia, and Identity in an Ex- 
Soviet Republic. Lanham, MD: Lexington Books. 
Donnan, Hastings and Thomas Wilson 
1994 An anthropology of frontiers. In: Hastings Donnan and Thomas Wilson (eds.), 
Border Approaches: Anthropological Perspectives on Frontiers, 1-14. Lan- 
ham, Maryland: University Press of America. 
Fort-Cafiellas, María Rosa 
2007 Sociolinguistics on the Aragon-Catalonia border. IJSL 184: 109-119. 
Francard, Michel 
2009 Regional languages in Romance Belgium: The point of no return? IJSL 196/ 
197: 99-119. 
Gal, Susan 
2010 Language and political spaces. In: Peter Auer and Jürgen Erich Schmidt (eds.) 
Language and Space: An International Handbook of Linguistic Variation. Vol. 
1, 33-50. Berlin and New York: De Gruyter Mouton. 


Border effects in European dialect continua 521 


Gerritsen, Marinel 
1999 Divergence of dialects in a linguistic laboratory near the Belgian-Dutch- 
German border: Similar dialects under the influence of different standard lan- 
guages. Language Variation and Change 11: 43-65. 
Gilles, Peter 
1999 Dialektausgleich im Létzebuergeschen: zur phonetisch-phonologischen Fo- 
kussierung einer Nationalsprache. Tübingen: Niemeyer. 
Giesbers, Charlotte 
2008 Dialecten op de grens van twee talen — een dialectologisch en sociolinguistisch 
onderzoek in het Kleverlands dialectgebied. Dissertation, Radboud Universi- 
teit Nijmegen (http://dare.ubn.kun.nl/handle/2066/67849). 
Glauser, Beat 
2000 The Scottish-English border in hindsight. International Journal of the Sociol- 
ogy of Language 145: 65—78. 
Glauser, Beat 
1974 The Scottish-English linguistic border: lexical aspects. Bern: Francke. 
Goebl, Hans 
2000 Langues standards et dialectes locaux dans la France du Sud-Est et l'Italie sep- 
tentrionale sous le coup de l'effet-frontiére: une approche dialectométrique. 
International Journal of the Sociology of Language 145: 181—215. 
Greenberg, Robert 
2004 Language and Identity in the Balkans: Serbo-Croatian and its Disintegration. 
Oxford: Oxford University Press. 
Hannan, Kevin 
1996 Borders of Language and Identity in Teschen Silesia. New York: Peter Lang. 
Harnisch, Rüdiger 
2010 Divergence of linguistic varieties in a language space. In: Peter Auer and 
Jürgen Erich Schmidt (eds.) Language and Space: An International Handbook 
of Linguistic Variation. Vol. 1, 275—295. Berlin and New York: De Gruyter 
Mouton. 
Herson Finn, Viktoria 
1996 What is "Nash"? Toward a theory of ethnolect in the South Slavic dialect con- 
tinuum. Unpublished Doctoral Dissertation, Ohio State University, USA. 
Hinskens, Frans 
1993 Dialekt als lingua franca? Dialektgebruik in het algemeen en bij grensover- 
schrijdend contact in het Nederrijnland en Twente. In: Ludger Kremer (ed.), 
Diglossiestudien. Dialekt und Standardsprache im niederlündisch-deutschen 
Grenzland, 209—245. Vreden: Landeskundliches Institut Westmünsterland. 
Kloss, Heinz 
1967 Abstand languages and Ausbau languages. Anthropological Linguistics 9: 


29-41. 
Kremer, Ludger 
1999 The Netherlands-German national border as a subjective dialect boundary. In: 


Dennis Preston (ed.), Handbook of Perceptual Dialectology. Vol. 1, 31—36. 
Amsterdam/Philadelphia: John Benjamins. 
Kremer, Ludger 
1979 Grenzmundarten und Mundartgrenzen. Kóln: Bóhlau Verlag. 


522 Curt Woolhiser 


Kremer, Ludger and Hermann Niebaum (eds) 
1990 Grenzdialekte. Studien zur Entwicklung kontinentalwestgermanischer Dialekt- 
kontinua. Hildesheim: Olms. 
Llamas, Carmen 
2010 Convergence and divergence across a national border. In: Carmen Llamas and 
Dominic Watt (eds.), Language and Identities, 227—236. Edinburgh: Edin- 
burgh University Press. 
Lundén, Thomas (ed.) 
2006 Crossing the Border: Boundary Relations in a Changing Europe. Eslóv: Fór- 
lags ab Gondolin. 
Nikotevic, Lidija 
2005 Myth, rhetoric and political relations at the Istrian border. In: Thomas M. Wil- 
son and Hastings Donnan (eds.), Culture and Power at the Edges of the State, 
229—253. Münster: LIT Verlag. 
O’Dowd, Liam 
2003 The Changing Significance of European Borders. In: James Anderson, Liam 
O'Dowd and Thomas M. Wilson (eds), New Borders for a Changing Europe, 
13-36. London: Frank Cass. 
Pinheiro, José Luís Palmeiro 
2009 Transborder cooperation and identities in Galicia and Northern Portugal. Geo- 
politics 14: 79-107. 
Pohl, Jacques 
1978 Communication field and linguistic field: the influence of the border (France 
and Belgium) on the French language. International Journal of the Sociology 
of Language 15: 85—90. 
Preston, Dennis (ed.) 
1999 Handbook of Perceptual Dialectology. Vol. 1. Amsterdam/Philadelphia: John 
Benjamins. 
Scheuringer, Hermann 
1990 Sprachentwicklung in Bayern und Ósterreich. Hamburg: Buske. 
Schifferle, Hans-Peter 
1990 Badisches und Schweizerisches Alemannisch am Hochrhein. Germanistische 
Linguistik 101: 3, 315—340. 
Schlobinski, Peter 
1997 The sociolinguistics of Berlin Urban Vernacular. In: Proceedings of the Second 
Workshop: Divergence and Convergence of Dialects Across Political Borders, 
223-246. Strasbourg: European Science Foundation. 
Sloboda, Marián 
2006 Folk views on linguistic variation and identities in the Belarusian-Russian bor- 
derland. In: Frans Hinskens (ed.), Language Variation: European Perspec- 
tives, 217—231. Amsterdam/Philadelphia: John Benjamins. 
Wilson, Thomas M. and Hastings Donnan (eds.) 
1998 Border Identities: Nation and State at International Frontiers. Cambridge: 
Cambridge University Press. 
Woolhiser, Curt 
2005 Political borders and dialect convergence/divergence in Europe. In Peter Auer, 
Frans Hinskens and Paul Kerswill (eds.), Dialect Change: Convergence and 


Border effects in European dialect continua 523 


Divergence in European Languages, 236-262. Cambridge: Cambridge Uni- 
versity Press. 
Zhurzhenko, Tatiana 

2006 The new post-Soviet borderlands: Nostalgia, resistance to changes, adaptation: 
a case study of three near-border villages, Kharkiv Oblast’, Ukraine. In: 
Madeleine (ed.) 

2006 Borderland Identities: Territory and Belonging in Central, North and East 
Europe. Eslöv: Förlags ab Gondolin, 57-87. 


28 Non-standard varieties 
in the areal typology of Europe 


Adriano Murelli and Bernd Kortmann 


1. Areal variation and areal typology 

2. Areal patterns in European dialect syntax 
2.1. Subject pronoun doubling in Dutch dialects 
2.2 Subject clitics in Lombard dialects 

2:3: Transitional zones 

3. Relative clauses in European dialects and non-standard varieties 
3.1. Word order 

32. Nature of the relativizer 

3.3. Syntactic role of the relativized item 

4. Final remarks 

1. Areal variation and areal typology 


Areal variation is central to both dialectology and areal typology. Language contact 
does not primarily take place between the (written) standard varieties of languages, 
but in day-to-day language contact between the spoken (typically non-standard) 
varieties of the adjacent languages involved. Hence the inclusion of dialects is 
bound to offer new insights to areal typology, i.e. concerning the areal distribution 
of individual syntactic variables in regionally restricted convergence areas. This is 
established practice for studies on, for example, the Balkan Sprachbund, but it is 
necessary also in the exploration of other contact superposition zones in (or invol- 
ving) Europe, such as the Circum-Baltic area and the Mediterranean. On a larger 
scale, the systematic study of grammatical variation in dialects has the potential for 
making a significant contribution to exploring whether Europe, or a large part of 
continental Europe, does or does not form one large linguistic convergence area 
(a veritable Sprachbund Europe, as it were). And yet, as pointed out by van der 
Auwera in Chapter 15 (this volume, section 4.1), dialects played at most a marginal 
role in the EUROTYP-project (1990-1994), just as “the social dimension of lan- 
guage diversity", in general, has been neglected by areal linguistics, according to 
Dahl (2001: 1460). The ultimate questions which the inclusion of dialects in Euro- 
pean areal typology might therefore answer are the following: (i) What does the 
"real", i.e., the vernacular, spoken non-standard, landscape of Europe look like? 
(ii) What is the typological profile of “real” Standard Average European, and in 
what way does it differ from SAE as it emerged in EUROTYP (once again cf. 
Chapter 15, this volume, section 3; cf. also Haspelmath 2001 and Kónig 2010)?; 
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and (iii) Is it possible, in the wake of works by Chambers on vernacular universals 
(for a discussion cf. Kortmann and Szmrecsanyi in press), but also Decsy (1973) 
and Haarmann (1976) on Europemes and Europeanisms, respectively, to identify 
vernacular Euroversals? It is this larger perspective which will stand at the centre 
of this chapter, especially the questions in (1) and (ii). 

As has been shown in numerous studies in European dialect syntax over the last 
10—15 years, the grammars of the (vast majority of the) non-standard varieties of 
a given language often differ strikingly from the grammar of the standard variety. 
Negative concord is a notorious case in point: it is typical only of the non-standard 
varieties of the Germanic languages, but not for the standard varieties (for details 
cf. section 4.2 in Chapter 15, this volume). Another, more subtle example can be 
given from that domain of grammar which will figure prominently in the present 
chapter, namely relative clauses. In typological studies of relative markers in the 
European languages, English is characterized as a language using, like almost all 
(standard varieties of) European languages, relative pronouns. It has even been 
claimed that the relative pronoun strategy is “a remarkable areal typological fea- 
ture of European languages, especially the standard written languages" (Comrie 
quoted in Haspelmath 2001: 1495). However, in non-standard varieties of English, 
as Hermann (2005) has convincingly shown, there is not a single dialect that does 
not use the relative particle(s) that, what, as/at more frequently than the relative 
pronoun De, the wh-strategy). Similar observations have been made for other 
morphosyntactic features, too, both in English and other European languages, so 
that the general claim has been formulated that findings from studies on dialect 
syntax may well serve as a crucial corrective for typological classifications (e.g. in 
the World Atlas of Language Structures (WALS) when, for the individual maps, 
passing judgement on the appropriate choice of the relevant symbol and colour 
of the one measuring point representing English, Dutch, Italian, or any given lan- 
guage). At the same time, language typology may serve as an important (addi- 
tional) reference frame and source of inspiration for dialectology when, for 
example, selecting phenomena to be investigated and appropriate methodologies, 
shedding light and interpreting the observable variation in light of what is known 
about patterns and limits of large-scale cross-linguistic variation (cf. the contribu- 
tions in Kortmann 2004 on how dialectology and typology may fruitfully interact 
and complement each other). Take, for example, the interpretation of dialect vari- 
ation in the domain of relative clauses against the background of Keenan and Com- 
rie's (1977) NP Accessibility Hierarchy, i.e. that very hierarchy which made the 
concept of hierarchies such a successful and firmly established one in functional 
typology. This approach has been adopted in various studies on relative clauses in 
English and German dialects (notably in Herrmann 2005 and Fleischer 2004), and 
will be adopted on a much broader scale for relative clauses in dialects and non- 
standard varieties of European languages in section 3. It is in this central section of 
this chapter that we will try to show with the help of one particular structural fea- 
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ture and almost classic object of study in late 20% century language typology, 
namely relative clauses, what can be gained from an areal typological view when 
systematically looking at a wide range of regional dialects and non-standard var- 
ieties of European languages. Before, however, some examples of areal variation 
in Europe on a smaller scale will be given from the domain of dialect syntax. 


2. Areal patterns in European dialect syntax 


In terms of areal variation, dialect (morpho)syntax is not different, in principle, 
from what is known about areal variation in accents and dialect vocabularies. Dis- 
tinct areas of syntactic variation can be identified for dialects, too, even if they are 
typically larger than those known from dialectological studies on phonological and 
lexical variation. This has clearly emerged from past and current research projects 
in some 20 European countries on morphosyntactic variation in (largely, but 
not exclusively, Germanic and Romance) dialects over the last 10—15 years. (For 
an overview cf. Kortmann (2009: 181—184), and the regularly updated homepages 
of what may justly be considered the hub of current research on dialect syntax in 
Europe: the Meertens Institute in Amsterdam, especially its project Edisyn, which 
has been instrumental in establishing a Europe-wide research network on dialect 
syntax: http://www.dialectsyntax.org/, http://www.meertens.knaw.nl/projecten/ 
edisyn/). In sections 2.1 and 2.2, two examples of such small-scale areal variation 
will be given, followed by a brief account of transitional zones in such morphosyn- 
tax-based areal patternings in section 2.3. 


2.1. Subject pronoun doubling in Dutch dialects 


From the study of Dutch dialect syntax, one phenomenon which yields nice areal 
patterns is subject pronoun doubling (cf. De Vogelaer 2008). Here are some 
examples of subject doubling in sentences with inverted (VS) word order (1a) and 
normal (SV) word order (1b and c): 


(1) a. Ga-de (gij) naar Brussel? 
£0.2SG-YOU gitic YOU song tO Brussels 

‘Are you going to Brussels?’ 
b. Ge-gaat (gij) naar Brussel. 
you, -g0.28G YOUsrong tO Brussels 

“You are going to Brussels.’ 
c. Gij gaat gij naar Brussel. 
YOU one £0.2SG YOUsrong tO Brussels 


“You are going to Brussels.’ 
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Map 1: Subject doubling in Dutch dialects, inverted word order 


Relevant for the present purposes will only be subject doubling as in (1a), i.e., sen- 
tences with inverted word order, an obligatory subject clitic -de, and an optional 
pronoun (gij) which doubles the subject clitic for pragmatic or stylistic reasons. In 
some Dutch dialects, subject doubling is found only for a limited set of subject pro- 
nouns, e.g., first and second person singular as well as second person plural in the 
city dialect of Antwerp. Map 1 shows that three broad areas can be distinguished 
on the basis of whether or not a given dialect may use all personal pronouns in con- 
texts such as (1a), and if not, which personal pronouns are allowed: just second 
person (singular and plural), or first person singular as well. Roughly, the three 
areas follow a west-(north)east axis. (For findings concerning this and other 
doubling phenomena in the syntax of European dialects, cf. Barbiers et al. 2008.) 


2.2. 


Subject clitics in Lombard dialects 


Another interesting areal pattern in European dialects is observable in Northern 
Italy. The data base for these dialects is the Atlante Sintattico d'Italia (ASIt; form- 
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erly ASIS; http://asis-cnr.unipd.it/). For Lombard dialects, a tripartite division into 
Alpine, Eastern and Western varieties is traditionally assumed on the basis of pho- 
netic features. This distinction is confirmed when morphosyntactic features are 
considered (Murelli 2006). In particular, the analysis of subject clitics on the basis 
of the ASIt reveals that the three groups of dialects differ in the number of subject 
pro- and enclitic forms they possess (see Table 1). In the examples, note that pro- 
clitics are used in affirmative sentences (2a) and enclitics in questions (2b). 


(2) a. Ti te mengä e te beef tüt al di. 
WOH eene yOu,;;, eat2sc and YOU... drink.2sG all the day 
“You eat and drink all the day.’ (Villa di Chiavenna, Alpine) 
b. Ti la cümprä-t? 
YOU strong it buy.2SG-YOU gitic 


“Are you buying it?’ (Villa di Chiavenna, Alpine) 


In Alpine dialects four subject proclitics occur regularly; enclitics range from one 
to six and are relatively frequent. In most varieties the 1PL-clitic derives from the 
substantive homo and the third person shows gender distinction both in the singular 
and in the plural. These three persons also have the same verb ending in the pres- 
ent, so the corresponding clitic may be seen as functionally motivated. Eastern dia- 
lects can be split into two areas: in the North-East there are four proclitics which 
occur regularly, in the South-East only three. As for enclitics, three to five are used 
in many varieties; however, especially in the South-East, there are varieties with no 
enclitics, at all. All varieties show gender distinction and formally identical end- 
ings for the third person singular and plural. The 1PL-clitic derives from Latin 
homo in the north-eastern dialects. In Western dialects three proclitics occur regu- 
larly, but no enclitics are attested. The overall profile of this area is quite varied: in 
particular, mountainous, isolated communities display a richer clitic system than 
urban varieties. Gender distinction is limited to the third person singular. 

Upon closer inspection (see also Map 2), the original tripartition may even be 
merged into two basic oppositions: ‘mountainous vs. plain’ and ‘urban vs. rural’. 
In fact, north-eastern (mountainous) varieties share many features with the Alpine 


Table I: The distribution of subject pro- and enclitics in Lombard dialects 
(+ = clitic is attested in the great majority of dialects; +/— = clitic is attested in 
about half of the dialects; — = clitic is attested in at most 1 variety). 


dialect group n proclitics enclitics 


ISG 2SG 3SG IPL 2PL 3PL 1SG 2SG 3SG IPL 2PL 3PL 


Alpine ll +=- + + + +- + +- + + +- +- +- 
Eastern 10 +- + + +- +- + +- +- + +- + 
Western 10 +- + + +- +- + 
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Alpine dialects 
4 proclitics 
| to 6 enclitics 


North-Eastern dialects 
4 proclitics 
3 to 5 enclitics 


MILANO 
H 
Western dialects 


3 proclitics 
0 enclitics 


South-Eastern dialects 
3 proclities 
0 to 3 enclitics 


Map 2: The distribution of subject clitics in Lombard dialects (Italy) 


ones (four obligatory proclitics, preservation of enclitics, gender distinction in 
third person, 1PL-clitic originating from Latin homo, three identical verbal end- 
ings), while south-eastern varieties, spoken in the plain, share some features with 
rural western varieties (three obligatory proclitics, reduced presence of enclitics, 
two identical verb endings). Similarly in the western area, urban varieties (Milano, 
Lecco, Bellinzona) never show more than three proclitics; instead, conservative 
varieties spoken in mountainous or rural areas rely more strongly on subject clitics 
as devices of person marking. 

When taking into consideration the whole diatopic axis, dialects can be seen as 
interacting with Standard Italian, which has no subject clitics. The use of clitics can 
be measured in terms of the degree to which the dialects converge towards Stan- 
dard Italian: converging dialects show a reduced inventory of subject clitics (here 
the prototypical example is Milanese, with only two subject proclitics and no en- 
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: clitic 6 5 4 3 2 desinence | 


Brione Bormio Monno Casalmaggiore Standard ; 
Semogo Malonno Mantova Italian : 
(Alpine var's) (north-eastern var's) (south-eastern var's) Milano 
Cevio district of Milano 
Locarno (western varieties) 


(Swiss varieties) 


Figure 1: The 'clitic-desinence' continuum in Lombard dialects 


clitics); non-converging varieties like the Alpine dialects show a nearly complete 
subject clitic paradigm. The Lombard dialects can be arranged on a person-mark- 
ing continuum ranging from a ‘clitic’ to a ‘desinence’ pole, according to which 
morphosyntactic form is preferably adopted for person marking. Converging dia- 
lects are close to the *desinence' pole, where Standard Italian is placed; non-con- 
verging dialects are close to the ‘clitic’ pole. The two poles of this person-marking 
continuum and the progressive decrease in the number of clitics attested in dialects 
are represented in Figure 1. 

Similar areal patternings along north-south or east-west axes can be observed 
for dialect-syntactic variation in many other European countries (e.g. for England 
cf. Kortmann 2010: 849—852 and Kortmann and Langstrof in press). 


2. Transitional zones 


In identifying such dialect areas for individual syntactic phenomena as in the two 
previous sections, what typically also emerges are transitional zones. Recall, for 
instance, subject pronoun doubling in Dutch dialects in section 2.1. Another 
example, taking us directly to that morpho-syntactic feature which will take centre 
stage in section 3, is the distribution of relativizers in Lower and Upper Sorbian 
dialects (cf. FaBke 1996: 170-176), as depicted in Map 3. In the Lower Sorbian 
speaking area the particle ako clearly dominates (90%); in the Upper Sorbian 
speaking area the relative pronoun kotryz (50 96) and the relative particle kiz (45 96) 
are most frequently used. Transitional dialects show an ambiguous behaviour: 
in the north of the Upper Sorbian core area the use of kotryZ and kiz decreases, 
in the south of the Lower Sorbian core area the use of ako decreases, both at the 
expense of a third particle, hac/(h)az, which becomes more frequent the further 
one moves away from both core areas. So, the most frequent relativizers in Upper 
and Lower Sorbian never occur in the same transitional zone. However, a major 
relativizer may be attested as a minor one in a transitional zone, e.g. kofryZ, or 
a minor relativizer of both core areas may be attested in contact areas, where it 
can also represent the one which is most frequently used, as is the case for coz 
Sto(j)Z. 
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Map 3: The distribution of relative elements in Sorbian dialects (based on Fake 1996: 176) 


The typical situation for syntactic variation, especially on a supraregional scale, 
can be likened to a cline or slope. The following properties can be observed in such 
transitional zones (cf. especially Seiler 2005), each individual transition zone ex- 
hibiting at least one, but typically a combination and (upon close investigation) 
perhaps even all of these properties: 


i. Frequency rates for the morphosyntactic phenomenon at hand (be it text fre- 
quency in corpora, frequency in terms of survey locations, or frequency of 
mention by informants) are somewhere in between the relevant rates for the 
clearly distinguishable syntactic areas. 
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ii. Where two dialect areas clearly delineated on syntactic grounds differ in terms 
of the number of variants they allow, the number of variants to be observed in 
the transition zone lies between the maximum and minimum in the two neigh- 
boring zones (consider again, for instance, Map 1). It is also possible, however, 
that more variants are allowed in the transition zone than in either of its neigh- 
boring zones. This is the situation in (iii): 

iii. The variants of a given syntactic phenomenon which are found exclusively or 
strongly preferred in clearly identifiable syntactic areas (e.g., variant A in zone 
1 and variant B in zone 2) are found side by side in the transition zone (i.e., both 
A and B) and may even combine (AB). As illustrated in Chapter 15 (this vol- 
ume, section 4.2), triple negation as used by many East Flemish dialect speak- 
ers (en niemand niet ‘not no-one not’) can be explained as a typical transition 
zone phenomenon (vis-à-vis en niemand ‘not no-one’ in West Flemish dialects 
and niemand niet ‘no-one not’ in the dialects of Brabant and Limburg). What 
may also be observed is a pronounced regional skewing in the transition zone 
as regards additional options: if, say, variant A is the sole or strongly preferred 
variant in zone 1 and, correspondingly, variant B in zone 2, then what can be 
Observed in a transition zone is, for example, that there is a shift from variant A 
to variant B as the variant preferred by informants as we move away from zone 
1 towards zone 2. 

iv. Ina transition zone where both variants of the dialect zones it separates can be 
found, the use of these variants is typically subject to syntactic constraints, 
such that each of the variants A and B can be used only in a subset of the con- 
texts in which they are used in zones 1 and 2 respectively. 

v. Aminor feature in the two dialect areas becomes a major one in the transition 
zone (as shown in Map 3 for což and sto(j)Z in the transitional dialects of Upper 
and Lower Sorbian). 


3. Relative clauses in European dialects and non-standard varieties 


Typological studies investigating relative clauses in Europe usually rely on data 
from (written) standard varieties. It is in this section that we will report on some of 
the major results of an in-depth investigation of relative clauses in European dia- 
lects and non-standard varieties (Murelli 2009), showing that in these varieties 
several other strategies are attested than in the standard varieties of European lan- 
guages. As a consequence, claims like Haspelmath's (2001: 1494) that Standard 
Average European languages have postnominal relative clauses introduced by an 
inflecting relative pronoun or a relative particle will need to be significantly quali- 
fied. Three cross-linguistically significant parameters of variation in relative 
clauses will briefly be considered: the word order (3.1), the nature of the relativizer 
(3.2), and the syntactic role of the relativized item (3.3). 
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3.1. Word order 


The prototypical European relativization strategy is postnominal, i.e. it follows the 
head noun within the same NP. In non-standard varieties at least one alternative 
word order is attested, i.e. the so-called correlative strategy: the relative clause is 
not embedded within the NP, but is adjoined to the main clause. This strategy is at- 
tested in a small number of European non-standard varieties: Ukrainian, as in (3), 
and Russian, as in (4). (For these and all of the following examples in section 3, see 
Murelli 2009 for full references on their sources.) 


(3) V jaku storonu vinok | poplyve, tudy divka zamiz 
in RPRO.ACC.E.SG side garland will.flow there girl in.marriage 
pide. 
will.go 


"Ihe direction that the garland flows on water, in that direction the girl will 
get married.’ (Dnipropetrovsk dialect, Mel nyčuk 1962: 114) 


(4 I kotoryj mal'cik ubil volka, on dal emu tri rublja. 
and RPRO.NOM.M.SG boy killed wolf he gave him three ruble 
‘And the boy who killed the wolf, to him he gave three rubles.’ (Rjazan’ dia- 
lect, Šapiro 1953: 107) 


3. Nature of the relativizer 


Concerning the kinds of relativizers attested in the European area, Comrie and 
Kuteva (2005: 498) claim that “the relative pronoun strategy dominates"; Cristofaro 
and Giacalone Ramat (2007: 84) state that “invariable relative elements are exclus- 
ively found in Western Europe". Still, the analysis of non-standard varieties reveals 
that there is more to it. First, relativizers that convey the syntactic role and possibly 
the gender and number of the relativized item in the standard variety may fail to do 
this in non-standard varieties. For instance, the Standard Rumanian sentence (5a) 
appears both as (5b) and as (5c) in the Muntenian dialect. The relative pronoun care 
remains uninflected in (5b), so it doesn't convey plural agreement; the syntactic 
position relativized is conveyed through the insertion of the preposition /a. In (5c) 
care acts as a relative particle: it remains uninflected and does not convey the syn- 
tactic role of the relativized item, a task which is taken over by the clitic le. 


(5) a. oamenii cărora le faci bine. 
men.DET RPRO.DAT.PL CL.DAT.3PL you.do good 
(Standard Rumanian) 

b. oamenii la care le faci bine. 
men.DET to RPRO CL.DAT.3PL you.do good 
(Muntenian dialect, Rusu 1984: 177) 
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c. oamenii care le faci bine. 
men.DET RPTL CL.DAT.3PL you.do good 
"The people to whom you do something good.' (Muntenian dialect, Rusu 
1984: 177) 


A similar phenomenon is attested in Bulgarian: the inflected masculine form of the 
pronoun kojto (accusative kogoto, as in (6a) and (7a); dative komuto) is replaced by 
the base form kojto for the relativization of subjects and direct objects (6b); the 
same form is preceded by a preposition when lower positions of the Accessibility 
Hierarchy are relativized (7b). 


(6) a. Covekdt,  kogoto (*go) vidjaxme ... 
man.DET RPRO.ACC.M.SG CL.ACC.M.3SG we.saw 


b. Covekät, ` Fotto go vidjaxme ... 
man.DET RPRO.M.SG CL.ACC.M.3SG we.saw 
"The man we saw.’ (Bulgarian, Miseska-Tomié 2006: 271) 


(7) a. Covekät, na kogoto (mu) dadoxme pismoto. 
man.DET to RPRO.ACC.M.SG CL.DAT.M.3SG we.gave letter.DET 


b. Covekät, na kojto (mu) dadoxme pismoto. 
man.DET to RPRO.M.SG CL.DAT.M.3SG  we.gave  letter.DET 
“The man we gave the letter to.’ (Bulgarian; Miseska-Tomié 2006: 271) 


In non-standard varieties, so-called specialized relativizers, i.e. elements that 
can relativize a single syntactic position (e.g. Standard English where or Stan- 
dard German wo can only relativize locatives), may fail to encode this position 
and/or be used for relativizing other syntactic positions, too, thus functioning 
as relative particles. Take Alemannic wo in (8) or Serbian/Croatian gdje in (9), 
both of which relativize the syntactic position of subject, and not locatives. Simi- 
larly, the syntactic position relativized by French dont in (10) is that of direct 
object: 


(8) Das isch e Fisch wó fliegt 


that is DET fish RPTL flies 
"That is a fish that flies.’ (Alemannic dialect, Fleischer 2004: 75) 


(9) Ali on, kad vidi majku gdje place, navali još većma i 
but he when he.sees mother RPTL cries assails still more and 
stane je zaklinjati, da mu kaže kto je. 


starts her beg.INF COMP to.him tells who is 
‘But when he sees his mother crying, he becomes even more insistent and 
begs her to tell him who s/he is.’ (Serbian/Croatian, Dmitriev 1972: 245) 
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(10) Dans cette ville, ilya dr règles dont toute personne 
in this town theres ten rules RPTL every person 
doit respecter et savoir. 
must follow.INF and know.INF 
“In this town there are ten rules that must be known and followed by every- 
body.’ (French, Gapany 2004: 189) 


The most pervasive phenomenon in non-standard varieties is perhaps the great 
number of relative particles, which, contrary to what Cristofaro and Giacalone 
Ramat (2007: 84) claim, also occur in East-European languages. This can be seen 
in Table 2, where all languages displaying more than one relative particle are 
listed. 


Table 2: European languages in which more than one relative particle is attested 


language relative particles 
Bosnian-Croatian-Serbian Sto, te, da 

Bulgarian deto, Sto(to), deka, da 
Czech co, jak, Ze 

English that, as, what, which 
Lower Sorbian kenz, což, ako, hac, (h)az 
Macedonian Sto, da 

Polish CO, Ze 

Rumanian ce, de, cd, sd 

Upper Sorbian kiz, kož, StojZ 


It must be pointed out that where several particles are present, they usually do not 
show random distribution. Interchangeability is quite restricted: for instance, the 
Czech relative particle jak may be substituted by co in sentences like (11). Still, 
Czech is quite an exception in this respect. 


(11) To je ten chlapec, jak jsem ti o nem říkal. 
that is that boy RPTL Lam to.you about him spoken 
"That's the boy I told you about.’ (Czech, Gotab & Friedman 1972: 37) 


More commonly, the distribution of relative particles varies either diatopically or 
semantically. In the former case, different particles are used in different dialectal 
areas. For instance, this is the case in Bulgarian: the particles det and detu are at- 
tested in eastern dialects, the particles deka and sto in western regions (Stojkov 
19933: 273). Similar cases are attested also in English (Herrmann 2005: 25-26) and 
in the Lower-Upper Sorbian dialectal area (cf. section 2.3 below). In other cases 
particles show semantic variation: some particles are used to encode "pure relative 
relations, other to encode *mixed' relative-consecutive or relative-final relations. 
This is the case with Rumanian ce/de as opposed to cd/sd and Macedonian Sto as 
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opposed to da. The use of sd in (12) and da in (13) gives the relative clause a non- 
factual value. 


(12) da om să doarmă cu ochii deschişi n- am văzut. 
but man RPTL sleeps with eyes.DET open not Lhave seen 
‘But a man who sleeps with open eyes, well, I have never seen this.’ 
(Non-standard Rumanian, Vulpe 1980: 136) 


(13) Imam braka da mi pomognat. 
Lhave brothers RPTL me help 
‘I have brothers who help me.’, ‘I have brothers to help me.’ 
(Macedonian, Miseska-Tomié 2006: 435) 


Finally, in non-standard varieties we are also likely to encounter combined relative 
elements: they may consist of two simple, i.e. monomorphemic, relativizers, a 
simple relativizer and a non-relative particle, or a simple relativizer and a resump- 
tive element. In (14) and (15) we find relative elements which are composed of a 
relative pronoun (ds dean, u kakoj) and a relative particle (wos, Stu). 


(14) s Hulz, ás deon wos dds gmächt is. 
DET wood from RPRO.DAT.N.SG RPTL that made is 
"The wood with which it is made.’ 

(North Bavarian dialect, Fleischer 2004: 65) 


(15) Dyk edit’ Za on von na  toj-ta losadi, stu u 
PTL goes PTL he there on ` that-PTL horse RPTL at 
kakoj dva Z'rjabénka Z’rjabilis’. 

RPRO.GEN.F.SG two foals were.foaled 


"There he goes on that horse that gave birth to two foals.’ 
(Kursk dialect, Russia, Akimova 1964: 142) 


In (16) and (17) the relative particle co and the zero-marker combine with a non- 
relative particle (to). 


(16) Duzy Albin, co to doskonale czyta. 
big Albin, RPTL PTL wonderfully reads 
‘Big Albin, who can read very well.’ 
(Polish dialect, Urbańczyk 1939: 31) 


(17) Popravilsja tot mužik Ø zmij-to ukusil? 
recovered that peasant snake-PTL bit 
‘That peasant who was bitten by a snake, is he well now?’ 
(Russian dialect, Šapiro 1953: 58) 
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Finally, in (18) and (19) a relative pronoun and a relative particle combine with a 
resumptive element (in both examples a pronominal PP). 


(18) am crezut că ia... lucruri de-a lu... femeia / cu care- 
Lhave believed COMP he.took things from DET woman.DET with RPRO 
o trăit cu ea// 


he.has lived with her 
‘I thought he took ... things from ... the woman he has lived with.’ 
(Non-standard Rumanian, Vulpe 1980: 129) 


(19) Toj stary dom, Sto ja u jaho zašla,  nichto ne pomnic’. 
that old house RPTL I in it entered nobody not remembers 
“Nobody remembers the old house where I entered.’ 

(Belarusian dialect, Akimova 1964: 142) 


In sum, even this short sketch demonstrates that in European dialects and non-stan- 
dard varieties many more relativizers are attested than just the relative pronoun. 
Moreover, uninflected relativizers seem to be evenly distributed in all European 
languages (with the noticeable exception of Hungarian and Estonian); they are 
clearly not confined to Western Europe. 


3:9. Syntactic role of the relativized item 


Having discussed how the syntactic role of the relativized item may be coded 
within the relative clause, Cristofaro and Giacalone Ramat (2007) identify two 
clear-cut areas in Europe: 


[The first area] roughly cover[s] Eastern Europe, and includ[es] Slavic languages, Finn- 
ish, Hungarian, and Rumanian. German and Greek also follow this pattern. Languages 
in this area typically present an inflected relative element that can be used for all syn- 
tactic roles. 

[The second area] roughly corresponds to Western Europe, and includes Romance and 
Germanic languages (except German), Greek and Irish. In this area, a variety of strat- 
egies are found that provide an overt indication about the syntactic role and possibly the 
gender and number of the relativized item [...]. These strategies alternate with strategies 
that provide no indication about the syntactic role, gender and number of the relativized 
item. (Cristofaro and Giacalone Ramat 2007: 83—84) 


However, when considering non-standard varieties, no evidence can be found for 
such a bipartite division. On the contrary, the vast majority of European languages 
exhibit both strategies, 1.e. those making the syntactic role explicit and others not 
making it explicit. In particular, the following phenomena are attested in almost all 
European non-standard varieties we investigated: 


i. Decumulation: the syntactic role of the relativized item is made explicit by 
means of a resumptive. This happens with low positions of the Accessibility 
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Hierarchy, like oblique (see (19) above), but also with higher positions, like 
subjects, as in (20). 


(20) Znaci Casnika maladzen’kaha, sto én Zonki ne mae? 
you.know butler young RPTL he wife not has 
‘Do you know the young butler who has no wife?’ 
(Non-standard Belarusian, Atrachovié 1966: 609) 


ii. Double encoding: the syntactic role of the relativized item is conveyed twice, 
usually by means of a relative pronoun and a resumptive element. Again, this 
strategy applies to all positions of the Accessibility Hierarchy, both high (sub- 
ject, as in (21)) and low (oblique, as in (18) above). 


(21) si el sS- o dus la ceilalti/ la C.V./ la B./ 
and he REFL has gone to others to C.V. to B. 
care ei... bea vin la B./ 
RPRO they drank wine at B. 
‘So he went to the others, to C.V, to B., who ... were drinking wine at B's." 
(Non-standard Rumanian, Vulpe 1980: 129) 


iii. No encoding (or: gapping): in this case the syntactic role of the relativized item is 
not made explicit. As in both previous cases, this strategy applies both to higher 
and to lower positions on the Accessibility Hierarchy, e.g. subjects, like in (22), 
and genitives, as in (23) - cf. also (17), where a direct object is relativized. 


(22) A eto vot synok Ø v armii sluziw vo flote tdk. 
and this PTL son in army served in navy PTL 
“And this here, this is the son who did his military service in the navy.’ 
(Russian dialect, Lapteva 1976: 332) 


(23) Vot  stul-ot Ø noga-to  izlomana. 
PAR chair-DET leg-PTL broken 
‘Here is the chair which has a broken leg.’ 
(Russian dialect, Lapteva 1976: 332) 


A special instance of the strategy in (iii) is the so-called case matching: here the 
syntactic role of the relativized item remains unexpressed because the relativized 
item has the same syntactic role or the same case form in the main and the relative 
clause. In (24) the head noun and the relativized item in the relative clause have the 
same syntactic role (oblique): for this reason, the syntactic role is not expressed in 
the relative clause. 


(24) Won jezo z tym awtom, ako cora jo Je. 
he goes with DET car RPTL yesterday he.is gone 
“He goes with the car he went with yesterday.’ (Lower Sorbian dialect, Janas 
1976: 187) 
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Finally, when considering again Keenan and Comrie's (1977) Accessibility Hier- 
archy, it may be stated that non-standard varieties are sometimes more consistent 
than their standard counterparts. Take, for instance, Standard English and Standard 
Swedish: here subject relativization by means of a zero-marker is not possible, al- 
though it is for object relativization. This runs counter to the prediction of the Ac- 
cessibility Hierarchy that if a position further down on the Hierarchy can be rela- 
tivized by a given strategy, then all positions to its left can be relativized by that 
strategy, too. The situation is different, however, in English and Swedish dialects: 
here subjects, too, can be relativized by a zero-marker, as shown in (25) and (26). 
Hence, in these dialects the zero-marker strategy fully patterns with the Accessi- 
bility Hierarchy. 


(25) Iseen a chap at Broughton Moor @ got his leg took off. 
(Herrmann 2005: 69) 


(26) Vi ha en ganska häfti kyrkoherde Ø spela golf. 
we have DET pretty cool vicar plays golf 
*We have a pretty cool vicar who plays golf.' 
(Burträsk dialect, Sweden, Klarsson and Sullivan 2002: 104) 


In sum, what the evidence from non-standard varieties presented in 3.1 to 3.3 
shows is that it is necessary to moderate the exceptional status ascribed to Euro- 
pean relative clauses in cross-linguistic investigations (e.g. by Comrie and Kuteva 
2005 on the dominating relative pronoun strategy in European languages). Euro- 
pean languages clearly do exhibit the typologically rare relative pronoun strategy, 
but they also exhibit a number of typologically more common strategies like gap- 
ping (see iii above) and pronoun retention (see strategies i and ii in section 3.3). 
Additionally, non-standard varieties also possess strategies that seem to go against 
the principle of economy: in examples like (19) the syntactic role of the relativized 
item is encoded twice within the relative clause, which can be regarded as redun- 
dant. Although exceptional, this strategy is attested in at least twelve European lan- 
guages distributed over the Romance, the Slavic and the Balkan area. 

The occurrence of each strategy may significantly vary from language to lan- 
guage: for instance, in Russian dialects the relative pronoun, the relative particle, 
the zero-marker, and the correlative strategy are attested (Sapiro 1953); in English 
dialects only the relative particle and the zero-marker are attested, whereas relative 
pronouns are virtually absent (Herrmann 2005). However, there is no evidence for 
the bipartition proposed by Cristofaro and Giacalone Ramat (2007): strategies that 
make the syntactic role explicit and those that do not make it explicit can both be 
found all over Europe. Surely, we may expect that each strategy shows a different 
frequency from language to language; quantitative language-specific studies are 
needed to establish the ranking of relativization strategies within (non-standard 
and standard varieties of) individual languages. As yet, the necessary electronic 
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corpora for such quantitative studies are lacking for the vast majority of (even the 
otherwise best-described) dialects and non-standard varieties spoken in Europe. 


4. Final remarks 


The two most important conclusions to be drawn from the previous sections are 
these: dialects and non-standard varieties, in general, should be given systematic 
coverage in language typology (at least for those languages for which solid ac- 
counts for these spontaneous spoken varieties are available), and dialects and non- 
standard varieties must be given systematic coverage in areal typology, especially 
so in the areal typology of Europe. After all, the dialects (as well as regiolects and 
emerging sub-standards; cf. Auer, chapter 26, this volume) of European languages 
are probably the best described and researched in the world, both synchronically 
and diachronically. Compare also Goebl (2001: 1484) on the great potential of a 
fruitful collaboration between areal typology and dialectology, not only in terms 
of what kinds of data are considered, but also in terms of methodology (e.g. in the 
domain of dialectometry). 

Anyone wanting to shed light on either small-scale linguistic convergence 
areas in Europe (or, for that matter, contact superposition zones; cf. Waelchli, 
chapter 17, this volume), or on large coherent parts of Europe constituting (one) 
large linguistic convergence area(s), such as Standard Average European, is well 
advised to include what is currently known about phonological and, especially, 
morphosyntactic and lexical variation in the (primarily regional) dialects of Eu- 
rope. As for the idea of a “Sprachbund Europe”, the systematic inclusion of data 
from dialects will help us evaluate previous observations and claims concerning 
distinctive, maybe even uniquely, European features, i.e. features not or only rarely 
observed in languages spoken in other parts of the world. In principle, the same 
applies to statements in the areal typological literature concerning structural fea- 
tures which are considered to be constitutive of a given linguistic convergence area 
in a small(er) part of Europe. Take, for example, the prototype of a Sprachbund 
within Europe, the Balkan Sprachbund: many alleged balkanisms can be observed 
outside the Balkan peninsula as well. And in many cases it is above all in dialects 
and non-standard varieties of non-Balkan languages that the very same or closely 
related features can be found (double object constructions, for instance, can also be 
found e.g. in colloquial Russian, Polish, or Czech, dialects of German and English; 
cf. Hinrichs 1999: 88). 

European linguistics is currently facing the unique opportunity to make the 
exploration of the structural properties of dialects and non-standard varieties, 
in general, a systematic part of typological research. The necessary prerequisites 
and overall research climate for this *micro-orientation", as van der Auwera calls it 
(chapter 15, this volume), could hardly be better. Not only has the data situation 
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strongly improved and is it constantly improving, due to a range of recent research 
initiatives in different parts of Europe (e.g. via the building up of large data resources 
and electronic dialect corpora offering recordings of dialectological/socioling- 
uistic interviews, but also authentic spontaneous spoken language and different 
kinds of, e.g. oral history, interview data). This will allow ever more fine-grained 
and large-scale (both qualitative and quantitative) studies to be conducted. At the 
same time the new generation of dialectologists and dialect syntacticians is highly 
versed in modern syntactic theories and debates as well as in (e.g. corpus-lin- 
guistic, statistical, or cartographic) methods for analyzing, interpreting and repre- 
senting their data and data analyses. This interest in current syntactic theory is 
matched (and in no small part fuelled) by a significant change in the overall climate 
in language typology and syntactic theorizing. Especially since the late 1990s the 
interest in structural variation on the micro-level has strongly increased. Syntacti- 
cians working within a formalist framework, for example, have turned “micro” in 
the guise of the study of microparametric variation. But also more and more adher- 
ents of functionalist-driven and usage-based theories have discovered the signifi- 
cance of data from non-standard varieties (for an overview of current theories and 
methods in European research on dialect syntax, cf. Kortmann 2009). In general, 
there is widespread agreement across all theoretical factions and research tradi- 
tions that variation (both on the macro- and the micro-level) has become the core 
explanandum of linguistic research. 

The true challenge now lies in the joining of forces of all the above-mentioned 
parties to the benefit (not only, but especially) of significantly advancing our 
knowledge in the areal typology of Europe at the grass-roots level, as it were. At 
present, the maps resulting from European dialect research projects are still maps 
of individual countries (just witness all the maps in this chapter). Truly com- 
parative research on the micro-level of language variation does not exist yet or is at 
most slowly budding (e.g. research on syntactic doubling phenomena under the 
umbrella of the European Dialect Syntax (Edisyn) project; cf. section 2 above). 
However, only once research on European dialects and non-standard varieties has 
reached truly pan-European dimensions in the sense that the relevant research no 
longer stops at the national boundaries and is (ultimately) able to provide maps 
showing the distribution of structural features and patterns across smaller and 
larger areas in Europe, will we have the opportunity of going beyond Sprachbund 
claims for the ausbau end of the European languages, i.e. for the standard (written) 
varieties, and identifying smaller and larger convergence areas among the Euro- 
pean vernaculars. For “big” typology, too, this may let linguistic Europe appear in 
a different light when considered against what is known about patterns and limits 
of variation among the languages of the world. 


See also the following chapters in this volume: 15 by van der Auwera, 16 by 
Tomić, 17 by Waelchli, 18 by Sansó, and 26 by Auer. 
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5. Outlook: on language politics 

1. Introduction 


In terms of figures, linguistic diversity in Europe does not really amount to any- 
thing much on a world scale. Yet, linguistic diversity is high on the European 
political agenda. In their quest for diversity in unity, the Council of Europe and the 
European Union show a special concern for the preservation and promotion of mi- 
nority languages in times of globalization. The reasons for this concern are mani- 
fold and range from the cultural value attached to languages to the symbolic role 
of languages in ensuring the political stability in language minority settings. With 
the European Charter for Regional or Minority Languages (1992) the Council of 
Europe developed a legal tool aiming at a culturally and politically inspired pre- 
servation and promotion of linguistic diversity on the territories of its member 
states. Against the background of a general description of the situation of language 
minorities in Europe, this contribution will especially focus on the European 
Charter. It will briefly describe its contents and will scrutinize the way in which the 
Charter links up with current language planning theory and practice. To conclude, 
this contribution will tackle a number of research desiderata that could advance 
both the contact linguistic study of language planning and language politics as well 
as the actual situation of Europe’s indigenous language minorities. 


2. Language minorities in Europe 


The public and political discourse surrounding language minorities in Europe dis- 
tinguishes in a rather generalizing way between allochthonous (or ‘new’) minor- 
ities and autochthonous (or ‘old’) minorities. The ‘new’ minorities consist of 
migrant workers or asylum seekers who recently (i.e. in most cases in the second 
half of the 20* century or later) settled in a European state. Examples are the Turks 
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in Belgium, the Portuguese in Luxembourg or the Moroccans in Spain. The "old 
minorities consist of communities that have lived in their respective territories for 
centuries. Examples are the Aromanians in Greece, the Welsh in the UK, the Sami 
in Sweden, the Livonians in Latvia, and the Hungarians in Slovakia. In this chapter 
the focus will be on the ‘old’ minorities. 

Although no commonly agreed definition of the term ‘autochthonous language 
minority' exists (cf. Extra and Gorter 2006 for a discussion), there is a consensus 
among contact linguists that the characteristics of an autochthonous language mi- 
nority are mainly to be seen as a difference in terms of its linguistic and cultural 
distinctiveness, and the inequality concerning its social status and its position 
vis-à-vis the dominant majority (cf. Rindler Schjerve 2006: 108). In Section 1 of 
Euromosaic I. The production and reproduction of minority language groups in the 
European Union, Peter Nelde, Miquel Strubell and Glyn Williams (1996: 10-12) 
expound the link between the differences in social status and power relations of the 
minority vs. the majority, on the one hand, and the role of the minority language as 
opposed to the majority language, on the other hand. Within a language sociologi- 
cal framework they convincingly argue that the differences in social status and 
power position as they exist between the minority and the majority are reflected in 
the lower prestige, the lower status and the less developed (in some cases: poorly 
developed or even non-existing) legitimisation and institutionalisation of the mi- 
nority language vis-à-vis the majority language. As a consequence, language very 
often develops into a significant symbol of social conflict in minority settings, even 
when it may not be the direct cause of the conflict (cf. Inglehart and Woodward 
1967; Nelde 2006). Especially during the ethnic revival in the 1960s and 1970s 
many simmering conflicts in indigenous language minority settings came to the 
surface and illustrated the great destabilisation potential of language minorities 
and the specific role of language in (neutralizing) social conflict. It is hardly a 
coincidence that precisely in the decades following the ethnic revival the situation 
of indigenous language minorities increasingly appeared on national and inter- 
national political agendas. 

Shortly after its first direct election in 1979 the European Parliament passed a 
series of resolutions on the need to preserve the language and culture of European 
autochthonous language minorities (cf. EBLUL 2003 for an overview). In 1984 the 
Council of Europe organised a public hearing on regional and minority languages 
in Strasbourg. In 1995 that same Council published the Framework Convention on 
the Protection of National Minorities. This convention entered into force in 1998. 
It grants individual rights to members of national minorities, including a relatively 
small number of language rights. A document that, however, solely focuses on lan- 
guages and more specifically on the preservation of minority languages in most of 
the aspects of the life of its speakers is the Council of Europe's European Charter 
for Regional or Minority Languages. It was published in 1992 and entered into 
force in 1998. The Charter holds an important position as a frame of reference in 
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European discussions on the preservation of linguistic diversity. Parayre (2008: 
125), for example, points out that the European Parliament in its 2006 Resolution 
on Multilingualism called on its Member States to comply with the Charter and to 
cooperate in a closer way with the Council of Burope's Language Policy Division. 

Since the Charter influences the development of a minority language policy on 
the level of the EU and the Council of Europe's member states, it seems appropri- 
ate to elucidate this document against the background of contemporary views on 
language planning. First, however, the discussion turns to a general description of 
its contents, based both on the text of the Charter and the explanatory report that 
accompanies it. 


3; The European Charter for Regional or Minority Languages 


Next to a preamble the Charter consists of five parts: general provisions (Part I), 
objectives and principles (Part II), measures to promote the use of regional or 
minority languages in public life (Part HI), application of the charter (Part IV) and 
final provisions (Part V). 

The Charter's overall objective is that parties who ratify the Charter will 
respect and will attune or continue to attune their policies, legislation and practice 
to the preservation and promotion of all regional and minority languages on their 
territory (Art. 7). Regional or minority languages are defined as languages that are 
(1) traditionally used within a given territory of a state by nationals of that state who 
form a group numerically smaller than the rest of the state's population, and (ii) 
different from the official language(s) of that state (Art. 1). Regional languages are 
languages spoken (either by a majority of the citizens or not) in a limited part of the 
territory of a state. Minority languages are languages either spoken by persons who 
are not concentrated on a specific part of the territory of a state or spoken by a 
group of people that, though geographically concentrated on a part of the state, is 
numerically smaller than the population that speaks the majority language. The 
Charter does not specify which languages correspond to the concept of regional or 
minority languages. Yet, Art. la states that the charter includes neither dialects of 
the official language(s) of a state nor the languages of migrants. 

As far as the measures to promote the use of regional or minority languages in 
public life are concerned, the Charter lists slightly more than 100 measures that 
pertain to the following areas: education (Art. 8), judicial authorities (Art. 9), ad- 
ministrative authorities and public services (Art. 10), media (Art. 11), cultural ac- 
tivities and facilities (Art. 12), economic and social life (Art. 13) and transfrontier 
exchanges (Art. 14). States who ratify the Charter commit themselves to imple- 
ment a minimal of 35 measures in respect of each language specified at the time of 
ratification. To ensure that states select measures from each of the areas in articles 
8-13, three measures must be chosen from each of the articles 8 and 12, and one 
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from each of the articles 9, 10, 11 and 13. Whereas Part II of the Charter applies to 
all regional or minority languages on a state's territory, states are free to decide 
to which regional or minority languages on their territory the measures of Part III 
will apply. Thus they can choose to start with a limited number of beneficiaries and 
extend that number at a later stage. Article 2 even leaves open the option to sign 
the Charter without selecting any language for the purposes of the application of 
Part III. 

Within the year following the Charter's entry into force, each state has to pres- 
ent a report to the Secretary General of the Council of Europe in which the policy 
pursued in accordance with Part II of the Charter and the measures taken (Part III) 
are explained. The other reports have to be submitted in intervals of three years. 
A Committee of Experts examines these public reports. This committee prepares a 
report for the Council of Europe's Committee of Ministers. That report is accom- 
panied by the comments that the states have been requested to make and may be 
made public by the Committee of Ministers. The Committee of Ministers itself 
hands its recommendations over to the states. The whole monitoring process is 
documented on the website of the Council of Europe (www.coe.int). 

At present 23 of the 47 member states of the Council of Europe have ratified the 
Charter and 10 have signed it. Of the BU's 27 member states 16 have ratified, 3 
have signed, and 8 have neither signed nor ratified it. In the case of those EU 
Member States who signed but did not ratify the Charter, the ratification process 
has not really started yet (the case of Malta) or is mainly hampered by constitu- 
tional and/or political problems (the case of France and Italy, though the latter 
seems to be close to ratification). In case of the EU Member States that did not sign 
the Charter the reasons are more diverse: 


— The Belgian government seems to fear that an acceptance of the Charter and 
especially of the minority language criteria used in the Charter could under- 
mine the territorial linguistic equilibrium as it has been cemented in the lan- 
guage laws of the 1960s — laws that helped to create political stability after a 
period of civic turmoil following World War II. 

— In Bulgaria and Greece it seems hard to find a consensus on those regional or 
minority languages to which the Charter could or should apply. Language- 
related ethno-religious conflicts and the fear of a fragmentation of the national 
fabric to the example of some of the neighbouring Balkan states complicate the 
discussions in Greece. 

— The Baltic States are still too preoccupied with the reassessment of their 
national languages and the repositioning of the Russian language to devote 
much attention to the Charter. 

— The problem in Ireland is that Irish has a double status: it is the only minority 
language but at the same time it is the first official language of the Republic. 

— With its estimated 10,000 speakers of Mirandese and its estimated 5000 speak- 
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ers of Caló (an Iberian Romani language) Portugal apparently does not really 
feel the need to complement the existing legislation on minority languages with 
provisions from the Charter. 


After this short and generalizing sketch of some states’ reluctance to sign the 
Charter, the following paragraphs turn to a discussion of the link between the 
Charter and existing views on language planning. 


4. The Charter against the background 
of language planning theory and practice 


Both the Charter and the explanatory report broadly correspond to the state of the 
art of language planning theory at the beginning of the 1990s. Following the basic 
corpus-status distinction as made by Haugen (e.g. 1969) and Kloss (e.g. 1969) and 
the extension provided by Cooper (1989), the Charter proposes measures in all 
three branches of language planning: corpus, status, and acquisition planning. Cor- 
pus planning mainly implies the standardization and/or elaboration of the lexicon, 
grammar and the orthography of a given language. Status planning aims at chang- 
ing the societal status and the functional range of a given language without neces- 
sarily aiming at an increase of the number of people actually using this language or 
language variety. Acquisition planning, finally, aims at an increase of the number 
of users of a given language. 

In the field of acquisition planning the Charter recommends the availability of 
education in the regional or minority language on all levels of education ranging 
from kindergarten to adult education (Art. 8). To raise the status of a language the 
Charter (Art. 9—13) promotes measures in a number of domains of language use 
that have featured prominently in sociolinguistics since the early 1960s and were at 
that time adapted from the work of Georg Schmidt-Rohr (e.g. 1933). Corpus plan- 
ning in the context of the Charter is mainly related to translation and terminologi- 
cal activities (1) supporting the role of the language in the media and the courts and 
(ii) aiming at the maintenance and development of administrative, commercial, 
economic, social, technical or legal terminology (cf. Art. 9 and 12). Those 
measures directed at the training of minority language teachers and the availability 
of minority language education also imply (a concern for) corpus planning issues. 

By leaving the choice of appropriate measures for each of the selected minority 
languages on their territory to the member states themselves, the Charter acknowl- 
edges the necessity of tailor-made approaches in language planning. This is rooted 
in Haugen's ecology of language and repeated in Joshua Fishman's groundbreak- 
ing work on Reversing Language Shift (1991). The fact that the CoE's member 
states who ratify the charter are obliged to select language planning measures 
in the different areas of language use mentioned in articles 8-13 can be linked to 
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Haugen and Fishman's plea for the development of complementary and mutually 
reinforcing language planning measures in different domains of language use 
(cf. also Cooper 1989). Moreover, the system of reporting and monitoring links the 
Charter to the necessity for language planning evaluation (cf. Rubin 1983). 

This short sketch makes clear that the skeletal structure of the Charter beyond 
any doubt corresponds to the basics of language planning theory at the beginning 
of the 1990s. Yet, the Charter also contains a number of flaws. 

Although Art. 1a of the Charter and point 32 of the explanatory report ex- 
plicitly state that the Charter does not concern local varieties or different dialects of 
one and the same language, the explanatory report in the same point also empha- 
sizes that the charter does not pronounce itself on the often-disputed question of 
which forms of language constitute separate languages. No wonder, then, that the 
extensive lobbying of inventive pressure groups results precisely in the official rec- 
ognition of such local speech forms as regional or minority languages (cf. the situ- 
ation of Limburgish and Lower Saxonian in The Netherlands). By passing on the 
language-dialect question to the member states without even providing compul- 
sory criteria that should be used to underpin their decisions, the Charter not only 
opens the door for the recognition of what Hans Goebl (2002) — building on Kloss’ 
original Abstand-Ausbau distinction — has referred to as dream and ghost lan- 
guages. It also provides the member states with the possibility to ‘recover’ regional 
or minority languages as mere dialects of the national language. 

Linguists not only frown upon the way in which the Charter subverts the 
demarcation of its planning object, i.e. regional or minority languages. They also 
criticise the fact that on the level of the Council of Europe no instruments seem 
to exist that could help bridge the at times considerable gap between the theoreti- 
cally envisaged straightforward process of selecting, elaborating, implementing 
and evaluating appropriate language planning measures and tortuous daily state 
politics. This critique is backed up by jurists' analyses of the Charter and the way it 
is treated by the Council of Europe's member states. 

From Pfeil (2003: 32-33) one can derive that the selection of language plan- 
ning measures by the Council of Europe's member states is certainly not always 
the result of a dedicated search for the most appropriate measures. The selection is 
rather influenced by budgetary constraints and the efforts governments are willing 
to invest in the promotion of an indigenous language minority as only one of many 
points on the agenda during a period of government. As a consequence, the temp- 
tation to opt for at least a number of measures that can easily be elaborated and 
implemented is never far away. Furthermore, the elaboration and implementation 
of those language planning measures that are eventually selected is slowed down 
because of deficient communication between central and regional/local authorities 
and/or uncertainties regarding the overall and final (financial) responsibility for 
the implementation of the Charter — a responsibility that is vested with the states 
(cf. Parayre 2008: 128). Although the defects in the process of selecting, elaborat- 
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ing and implementing measures are not entirely kept out of the states’ reports, criti- 
cal reading is needed to balance the reports (cf. de Varennes 2008: 33). The same 
creativity with which states report on the implementation of minority language 
planning measures also shines through the way in which they manage to ignore 
part of the recommendations made by the Committee of Ministers (cf. Parayre 
2008: 129). 

With the ultimate aim of smoothening the whole minority language planning 
process envisaged by the Charter, the European Parliament and especially its In- 
tergroup for Traditional National Minorities, Constitutional Regions and Regional 
Languages (founded in 1988) try to exert pressure on the Council of Europe's 
member states. The Intergroup thus adds to the indirect diplomatic pressure on 
member states as it is also exerted by the Council of Europe's Committee of Ex- 
perts that — as Parry (2008: 269) expresses it — has the advantage of being able to 
fuel national/regional debates with external and international detached expertise. 
Yet, the Committee of Experts is in need of more means and instruments to effec- 
tively accomplish its task. The same applies to the Language Policy Division of the 
Council of Europe with which the Committee of Experts cooperates. 

As early as in 2001, on the occasion of the international conference ‘From The- 
ory to Practice — The European Charter for Regional or Minority Languages', Gijs 
de Vries, at that time State Secretary of the Ministry of the Interior and Kingdom 
Relations in The Netherlands, suggested to create a database with the details 
of everything European countries are "doing to improve the position of regional 
or minority languages and to implement the intent of the charter" (cf. Council of 
Europe 2001: 10). He considered a database the appropriate means to gauge and 
compare measures in the field of language minorities and to develop "best prac- 
tices’. Fernand de Varennes shared his opinion and Donall O’Riagain, the former 
president of EBLUL, added on to it by suggesting the establishment of a European 
centre for linguistic diversity that ^would serve as a clearing-house for language 
planning, for data gathering and for the sharing of expertise" (cf. Council of Eu- 
rope 2001: 18; 26). Similar propositions were presented at an international confer- 
ence on the Charter in Swansea in 2006 (cf. Council of Europe 2008). 

Anticipating a more systematic collection of best practices in minority lan- 
guage planning, the next paragraphs list a number of general points of interest in 
relation to the three branches of language planning in European minority settings 
today: 


Status planning — Generally speaking, the main concern of language minorities in 
Europe most certainly is the increase of the societal status and the spread of the 
functional range of the minority language in order to try and ensure the survival of 
the minority language. Regions such as Catalonia, Wales and South-Tyrol serve as 
longstanding shining examples on how to proceed. Language planning agencies in 
these regions consciously and successfully aim at a coordinated harmonization of 
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language use in such sectors of society as education, social welfare, adminis- 
tration, culture and (local) economy, starting out from the belief that in terms of 
language use supply creates demand rather than its axiomatic converse. Next to co- 
ordinated efforts, however, spontaneous (or grassroots) efforts also are largely re- 
sponsible for language revival and maintenance in Catalonia, Wales and South- 
Tyrol. Since spontaneous activities are hardly documented, the chances that they 
will be ignored or only briefly touched upon in a list of best practices is rather 
great. Yet, a stronger consideration for the systematic and comparative analysis of 
(un)successful (un)planned language planning activities at the grassroots level in 
well-known as well as hardly known minority settings could especially be of inter- 
est to such language minorities that — contrary to the Catalans, the Welsh or the 
South-Tyroleans — do not have a strong autonomy and do not have a large number 
of financial and institutional means at their disposal. The SMiLE-report, which 
examines 17 different (grassroots) interventions in favour of regional or minority 
languages in various settings and specifically looks at the goals, the outcomes and 
the cost-effectiveness of the interventions, could serve as an example on how to 
proceed (cf. Grin et al. 2002). 


Acquisition planning — Language minorities facing a situation in which the inter- 
generational continuity of the minority language in the home, family and neigh- 
bourhood is interrupted tend to strongly focus on acquisition planning to maintain 
and/or increase the number of minority language users. Certainly in Europe it 
seems that acquisition-planning measures have the potential to substantially con- 
tribute to language shift reversal. In their critical analysis of the regional results of 
the PISA-study, Weber and Nelde (2004) point out that above all regions like Cata- 
lonia, Wales and South-Tyrol make high scores. One reason for this is that these re- 
gions have successfully gone through a period of devolution and have managed to 
positively influence the attitudes vis-à-vis the minority language by promoting the 
position of the minority language in the local economy. Another reason for the high 
PISA-scores is that the regions mentioned managed to reform their educational 
system in compliance with sound pedagogical concepts aiming at the promotion of 
minority languages. Considerable attention was (and still is) paid to the share of 
the minority language in the curriculum, the goals of minority language education, 
the methods chosen to pass on the minority language, the organisation of minority 
language teacher training, the continuity of minority language education on all 
school levels, and the link between education and extra-curricular activities of all 
sorts are all well-considered. A catalogue of best practices should especially high- 
light these pedagogical considerations if it is meant to help not only the strong mi- 
nority language communities to transcend the (in language minority surroundings 
often experienced) pre-occupation with strengthening the language in the folklor- 
istic space and to relegate the intergenerational transmission of the minority lan- 
guage to kindergartens and/or primary schools. Good examples on how to mould 
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general principles on minority language education in such a way that they fit the 
own language minority setting can be found on the website of the Istitut Régional 
de Recherche Educative in Val d’Aoste (Italy) (www.irre-vda.org). 


Corpus planning - In the European political discourse on language minorities cor- 
pus planning, i.e. the codification (involving graphization, grammatication and 
lexication) and elaboration (involving terminological modernization and stylistic 
development) of minority languages, is hardly mentioned. It seems that corpus 
planning is thought of as something that goes without saying. Nothing is, however, 
further from the truth. Corpus planning even poses a constant challenge to estab- 
lished institutions such as the Fryske Akademy (in Ljouwert/Leeuwarden, the Ne- 
therlands), Euskaltzaindia (the Basque Language Academy in Bilbo/Bilbao, Spain) 
and the Sorbski Institut (in BudySin/Bautzen, Germany). Corpus planners working 
in these institutions do not only have to codify and elaborate their minority lan- 
guage in a setting that is characterized by intense language contact and subject to 
the pressure of English as a lingua franca. They also experience that their work is 
not purely linguistic but is heavily intertwined with the social context they work in. 

This context influences the selection of a norm that precedes codification and it 
also influences the elaboration of the codified language. Corpus planners are 
doomed to experience how different forces in society attach emotional values to 
specific historically grown linguistic varieties and sometimes experience the rejec- 
tion of specific features of their own variety almost as a denial of their linguistic 
identity. The emotional factor attached to codification explains why in the process 
of codification most often attempts are made to merge features of the existing var- 
ieties. Yet, this not always proves to be an ideal solution. 

In a recent attempt to create a common Ladin standard in Northern Italy the 
corpus planners of SPELL (Servisc de Planificazion y Elaborazion dl Lingaz 
Ladin) were confronted with the choice of either promoting one of the written var- 
ieties used in the valleys of the Ladin Dolomites to the Ladin standard, or trying to 
develop a new standard Ladin, based on common characteristics of all the written 
varieties used in the valleys. They finally decided to choose the second option. 
Building on the work of the Swiss professor Heinrich Schmid the SPELL-team de- 
veloped a grammar for and a dictionary in standard Ladin (cf. wwww.spell- 
termles.ladinia.net). Hope existed that SPELL's endeavors would lead to a general 
acceptance of ‘Ladin Dolomitan' as a unified written variety of Ladin that could 
help to further status and acquisition planning measures (e.g. the production of 
teaching materials, the availability of laws and administrative documents in the 
minority language, etc.). This hope, however, was in vain. Shortly after Ladin Do- 
lomitan was launched, the government of South-Tyrol decided to formally exclude 
it from all its legal texts, publications and websites. At the local level, Ladin Do- 
lomitan is also hardly used in the administration and its use in schools is blocked in 
favor of local idioms. 
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The Ladin example shows that the road to the spread of the new norm is a hard 
and weary one and quite some institutional support and a positive social and politi- 
cal climate is needed for it to settle down in the language community. A list of best 
practices in language planning should not ignore the issue of corpus planning and 
could especially benefit from in-depth research on the sociology of corpus plan- 
ning and especially on the political forces that (try to) influence corpus planning 
decisions. Not only the study of corpus planning, but also the study of status and 
acquisition planning could profit from a greater consideration of the way in which 
political forces at all levels of society (try to) influence language planning deci- 
sions. 


5. Outlook: on language politics 


Research on the relationship between language and politics has long concentrated 
on the treatment of language issues on the institutional political level and the level 
of party politics, i.e. those levels of society at which official language policies are 
given shape!. Pioneering work in this respect was done by the Canadian political 
scientist Kenneth D. McRae. Inspired by Inglehart and Woodward's (1967) explo- 
ration of the relationship between linguistic concentration and political stability, 
McRae started a series on Conflict and Compromise in Multilingual Societies in the 
1980s. It currently consists of three volumes: one on Switzerland published in 1983, 
one on Belgium published in 1986 and one on Finland published in 1997. In his 
series, McRae for each of the selected countries identifies when and how basic divi- 
sions in social structure become politicized, how they escalate into political con- 
flicts, when and how attempts are made to moderate and/or resolve these conflicts, 
and to which degree social or political cleavages become incorporated in the struc- 
ture of formal institutions. McRae shows a primary concern for the language di- 
mension as a form of social cleavage. The framework to investigate the language di- 
mension in each of the three volumes is organized around four main headings: (1) 
historical traditions and developmental patterns, with special reference to the evol- 
ution of the language groups (including language minorities) and language policy; 
(2) the social and demographic structure of the language groups, and relationships 
between language divisions and other social divisions; (3) the perceptions and at- 
titudes of the language communities, and how these attitudes have been mobilized 
or manifested in political parties or other associations; and (4) constitutional and in- 
stitutional arrangements concerning languages (McRae 1997: 1—2). Although 
McRae from the 1980s onwards influenced the works of such linguists working on 
language conflict or those interested in discourse analytical approaches to language 
ideology and language tensions (cf. recent work in the Journal of Language and 
Politics edited by Ruth Wodak and Paul Chilton and published by John Benjamins), 
he long remained one of the very few political scientists to systematically tackle the 


Minorities, language politics and language planning in Europe 557 


relationship between language and politics. In recent years, however, this field of 
study has gained vigor. In the wake of the EU’s debate and focus on regionalism, 
political scientists mainly interested in ethno-politics, the nation-state-question 
and/or multi-level governance increasingly started to pay some of their attention to 
the interplay between (the debate on) regional autonomy and (the shaping of) lan- 
guage politics, policy and planning in the EU (cf., Keating 1998, De Winter et al. 
2006). More importantly, however, political science in recent years also produced a 
number of volumes explicitly devoted to the possible contribution of political the- 
ory and policy analysis to the societal management of linguistic diversity. 

In 2003 Will Kymlicka and Alan Patten edited a volume on Language Rights 
and Political Theory. The book examines the issue of linguistic diversity and lin- 
guistic rights from the perspective of normative political theory. Following Kym- 
licka and Grin (2003: 16), the leading question behind this approach is what so- 
ciety should do in favor of language preservation and why some actions may be 
said to be more justified than others. Spurred on by Francois Grin, this rather the- 
oretical-philosophical approach has been complemented by a language policy 
analysis approach that studies how society’s goals regarding linguistic diversity 
can be reached and if some way of reaching them is preferable to other ways. It 
does so by analyzing and interpreting various legal, international, political, econ- 
omic, demographic, historic and cultural factors determining language policy. This 
analysis takes place against the background of larger systems of values, beliefs and 
political ideologies (cf. Grin 2003; Kymlicka and Grin 2003: 19-21). 

As can be witnessed in the case studies listed in the book Nation-Building, Eth- 
nicity and Language Politics in Transition Countries (2003) edited by Farimah 
Daftary and Francois Grin, the language policy analysis approach that covers is- 
sues related to the status, the corpus as well as the acquisition of languages not only 
addresses the institutional level and the level of party politics but all politically rel- 
evant levels of society. It thus meets the long standing desideratum that political 
science should extend its scope and should not only focus on the institutional side 
or the content side of politics but also on politics as a formal, and above all infor- 
mal or sometimes hidden process (cf. von Alemann 1995: 544). 

The future will show whether the still relatively young language policy ap- 
proach will gain firm footing, in how far it will be able to influence and alter the 
existing views in the largely descriptive research on language politics, policy and 
planning and in how far it will be able to influence the actual language political, pol- 
icy and planning practice. In terms of its obvious contribution to the development of 
best practices it would be a lost opportunity if all those interested in (the study of) 
the preservation of linguistic diversity would fail to make use of its potential. 


See also the following chapters in this volume: 23 by Breu, 25 by Extra, 30 by Gil- 
les, 34 by Wodak and 44 by Wright. 
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Note 


1 Along the lines of francophone literature on language policy and planning, language pol- 
icy is understood here as the sum-total of choices and objectives that — following a politi- 
cal process — are meant to regulate (the use of) languages in social life. Language plan- 
ning aims at the realization of these choices and objectives through concrete measures 
(cf. Labrie 1996 for a broader discussion). 
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1. Introduction 


This chapter explores the sociolinguistic dynamics in speech communities located 
on the Germanic-Romanic language border by presenting three case studies for 
Belgium, Luxembourg and the province of Lorraine in France. The specific multi- 
lingual situation in these countries is the result of a long lasting (socio)linguistic 
contact between Germanic and Romance language varieties. In the first section the 
overall development of the language border will be discussed. The following sec- 
tions will then deal with the developments of the language situation for the three 
speech communities. It will become clear that socio-political developments have 
led to different kinds of contact situations and different multilingual societies. 


2. The development of the Germanic-Romance language border 


The language border separating Germanic from Romance languages/varieties 
emerged during the late Roman Empire and stabilized in the 11‘ century. It begins 
geographically in the west at the border between France and Belgium (near Bou- 
logne and Calais) and then runs more or less horizontally through Belgium. Close 
to the Belgian eastern frontier, the language border then moves southwards along 
the western border of Luxembourg, Lorraine and Alsace. It then enters Switzerland 
and divides the territory, roughly speaking, into a western French-speaking area 
and an eastern German-speaking area. In southern Switzerland the language border 
turns eastwards and enters Italy (van Durme 2002). Surprisingly, this language 
border has been relatively stable since the Middle Ages. Nevertheless, on both 
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sides transition zones have emerged which provide evidence for the fact that a lan- 
guage border in the strict sense does not exist. Due to this kind of language contact 
several multilingual speech communities have developed. 


3. Case studies on language policies at the language border 


"Language policies’ are regarded here as regulatory mechanisms that affect the 
ecology of languages and language usage in a certain country or society. These 
policies can either be explicit or implicit and official or unofficial. Language laws, 
for example, are a clear case of explicit and official policies, whereas company- 
internal language regulations are instances of explicit and unofficial policies. Lan- 
guage policies do not only influence the dimensions of language usage, they 
also establish hegemony. Language policies are the result of a “institutionalization 
of language as a basis for distinctions among social groups (classes)" (Tollefson 
1991: 16). 

The case studies below begin with a short history of the territory and emerg- 
ence of multilingualism at the language border, followed by a discussion of lan- 
guage policies (e.g., legislation, school system, regional issues). Most policies 
presented here concern the (sociolinguistic) status of languages; corpus planning 
(e.g., the elaboration of language learning materials) will only be touched upon 
briefly. 

The following map illustrates the region of the language border under dis- 
cussion. The language border is indicated by a dashed line; state borders are drawn 
with a straight line. 


3.]. Belgium 


Officially, Belgium is a trilingual country, where Dutch, French and German are 
spoken in different regions. At first sight, this brief description thus sketches a 
straightforward picture of territorial multilingualism. Nevertheless, as is well 
known, organizing and maintaining Belgian multilingualism has become an enor- 
mous political endeavor creating several social problems. Throughout the national 
history of Belgium from 1830 onwards language policy issues have played a major 
and also very conflictual role in the relationship between the Flemish and Walloon 
populations. 

A glance at the history of Belgian language policies helps illuminate this con- 
flict. In 1794 the territory belonged to France and, in accordance with its mono- 
lingual policy, the French language was assigned the status of the only possible 
administrative language. In 1830 the central administrative positions to govern the 
newly founded state were in the hands of a small French-speaking elite. This was 
also true for Flanders, where French was the only officially allowed administrative 
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the case studies 


language. Consequently local Flemish varieties were suppressed and had the 
status of low-prestige spoken dialects. Right from this time a Flemish movement 
emerged to enhance the status of Dutch by improving language teaching and learn- 
ing in schools. This lasted until 1889 when for the first time French and also Dutch 
were declared the two official languages of Belgium. The following years up until 
today are characterized by a growing separation of the two major language groups. 
Instead of applying the official bilingualism policy to the whole territory, largely 
monolingual territories were defined. These language policy developments were 
accompanied by modifications in other public domains: Flanders and Wallonia 
have largely independent administrative and governmental structures (e.g. inde- 
pendence in legislature, jurisdiction, school authorities, and cultural matters). Vari- 
ous modifications in the constitution (1970, 1980, 1988, 1993) and language laws 
(e.g. 1932, 1963; see Verdoot 1996: 1115-1117.) led to a strong federalization of 
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Belgium. This development also called for an accurate fixing of the language 
border, i.e. the demarcation of the linguistic territories. Also, in order to maintain 
‘linguistic peace’, language censuses have generally been prohibited. 

With regard to the structure of the linguistic territories, four types can be de- 
fined, each of them applying partly different language policies. 

(1) Monolingual regions: Dutch is the language of the northern region of 
Flanders, sometimes also called ‘Flemish’ (Vlaams), neighboring The Netherlands 
(5.9 million inhabitants). With regard to language, the region of Flanders and The 
Netherlands established a ‘language union’ (Nederlandse Taalunie) in 1980 to pro- 
mote a unified discussion on linguistic issues, language policies and language teach- 
ing. The two national varieties of Dutch are quite similar, although some tendencies 
for divergence are noticeable. French-speaking Wallonia (Wallonie) is located to the 
south of the language border (3.3 million inhabitants). Through the Communauté 
francaise de Belgique it is a member of the ‘International organization of French- 
speaking countries' (Organisation internationale de la francophonie). 

The strict monolingualism in these two regions manifests itself in, for example, 
different place names on official road signs. Thus, several toponyms have different 
names according to region (in Wallonia: Anvers, Liége, Bruxelles, Gand; in 
Flanders: Antwerpen, Luik, Brussel, Gent). With regard to the labeling of products, 
the laws for consumer protection state that the label of a product has to be in the 
language(s) of the region where the product will be sold. For reasons of conveni- 
ence most products show bilingual labels in French and Dutch, sometimes even 
German is included. At least in this domain of language use the official multilin- 
gualism of Belgium is demonstrated. 

A measure to ensure national multilingualism has been implemented in the 
school system (Verdoot 1996: 1114). Besides the main language, Dutch or French 
respectively, pupils are encouraged to choose the language of the other region as 
the first foreign language. Although until the 1990s Dutch was the first choice in 
Wallonia, one can observe a clear tendency towards the choice of English as the 
first foreign language instead. Further activity to promote multilingual compet- 
ences is underway in the form of newly created immersion programs, mainly in the 
Walloon region. 

(2) Bilingual region: A special status has been attributed to the Belgian capital 
Brussels (0.9 million inhabitants). Geographically located in Flanders, it is never- 
theless officially bilingual (Dutch/French). Being originally a Dutch-speaking 
town, French has become the most frequently used language in Brussels. Although 
the Dutch-speaking population forms a minority today, all public administrative 
services have to be offered equally in the two languages, making the city of Brus- 
sels an example of an extremely complex linguistic situation with its own language 
legislation (Treffers-Daller 2002). 

(3) Monolingual regions with linguistic facilities: Communities located around 
the capital of Brussels or along the strict language border have been awarded lin- 
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guistic ‘facilities’. Members of minorities in these communities have the right to 
use their language in certain administrative domains. For example, French-speak- 
ing inhabitants of the Flemish community of Sint-Genesius-Rode/Rhode-Saint- 
Genése to the south of Brussels have facilities for the use of French. The reverse 
is true for the Dutch-speaking inhabitants of the Walloon community Mouscron/ 
Moeskroen or for the German-speaking inhabitants of the Walloon community 
of Malmedy/Malmünd. Negotiating the linguistic facilities for these approx. 30 
“communes à facilités’ has led to several severe language conflicts on the local and 
even on the national level (Bouillon 2008). 

Furthermore, the German-speaking community in the east of Belgium also 
belongs to this group of monolingual regions with linguistic facilities (for details 
see below). 

(4) Monolingual regions without linguistic facilities: For various reasons some 
smaller areas with linguistic minorities have not been provided with linguistic fa- 
cilities. The Arlon area (‘South Old Belgium’; see below) and the community of 
Bocholz/Beho (‘Central Old Belgium’) belong to this group. 

In the remainder of this section, the structure of the German-speaking commu- 
nities in Belgium will be discussed in more detail (see also Darquennes 2004). 
A few predominantly German-speaking communities located on the language 
border in the east have belonged to Belgium ever since its foundation in 1830. This 
area, called ‘Old Belgium’, consists of ‘Old Belgium North’ bordering with the 
Netherlands in the north, where one can find Limbourgian varieties, “Old Belgium 
Central’, consisting only of the community of Bocholz/Beho, and ‘Old Belgium 
South’, i.e. the region around the city of Arlon. 

The Arlon region (Areler Land) formerly belonged to Luxembourg and was part 
of the so-called ‘quartier allemand’, where German was taught in school and Lux- 
embourgish was the spoken variety. French-German bilingualism was quite com- 
mon, especially among the élites. After becoming part of newly founded Belgium in 
1839 the influence and usage of French increased due to several factors: deliberate 
relocation of administrative personnel from the French-speaking parts into the 
Arlon region; post-war reactions to the attempts of the German occupying forces 
during the two world wars to reduce French and force German-only policies; 
expulsion of German and also spoken Luxembourgish (which was regarded as a 
dialect of German) from the school system; and, more importantly, negative atti- 
tudes towards these languages. The massive decline and decrease has turned the 
Areler Land of today into a nearly completely French-speaking part of Wallonia. 
Germanic varieties are restricted to the old generation, are neither learned as mother 
tongue languages nor taught in school, and are limited to some functional contexts 
(i.e. family). How many speakers still use or understand these Germanic varieties 
today is less clear; Darquennes (2005) assumes that they still amount to 16,000 
to 21,000. Nevertheless, these varieties are highly endangered. Beginning in the 
1970s, various attempts at strengthening or revitalizing Luxembourgish have taken 
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place in this area. In 1990 the local association Areler Land a Sprooch (‘Arlon 
region and language’, ALAS) succeeded in getting the local variety accepted as one 
of the regional varieties (‘Décret relative aux langues régionales endogenes de la 
Communauté francaise"). This also means that Luxembourgish is protected in the 
Arlon region by the ‘European Charter for Regional or Minority Languages’. Due 
to the fact that Belgium has not signed this charter yet, the outcome of this status for 
the actual use of the language is quite small. On the local level, language learning by 
immersion has been implemented only for Dutch, English and German, with the in- 
tentional exclusion of Luxembourgish. On the other hand, there is a noticeable 
upswing of Luxembourgish due to the fact that many people living in this area are 
cross-border workers in the Grand-Duchy where they can expect more attractive 
jobs (see Klein 2005). Interestingly, learning the regional language here is not con- 
nected with language maintenance or revitalization but rather with pure economic 
reasons. Despite these changes in the context of the linguistic need for the local lan- 
guage(s) in the workplace, the Areler Land finds itself in a similar condition as the 
Germanic-speaking part of the Lorraine (see 3.3). In both cases the language is 
highly threatened because of 'intergenerational disruption’, i.e. there is no trans- 
mission of the language within families (see Darquennes 2005). 

While the use of Germanic varieties in ‘Old Belgium’ is decreasing, we find a 
quite stable German-speaking population in so-called *New Belgium' (approx. 
70,000 speakers), which consists mainly of the cities of Eupen and St. Vith and 
their surrounding villages. These territories originally belonged to Prussia and 
were assigned to Belgium after WWI. This area is the officially acknowledged Ger- 
man-speaking area of Belgium, and is renowned as the "best protected minority 
in the world'. Alongside (Standard) German, various German-based dialects are 
in use (east Limburgian in the north, Ripuarian and Luxembourgish dialects in 
the south). In administration and local government German is the official and most 
frequently used language. French is present due to permanent contact with the 
neighboring Walloon areas and due to several French-speaking people living in 
the German-speaking area. In terms of language usage, this area is therefore best 
characterized as triglossic. This language situation is maintained due to at least two 
decisive factors: (1) loyalty to Standard German and the dialects within the Belgian 
territory is very strong and thus favors intergenerational transmission; and (2) the 
language policies of the school system are capable of ensuring German language 
learning and teaching. This small area thus provides a sound example for the suc- 
cessful application of language policies to maintain a minority language efficiently. 

It should become clear from this brief description of the language policies in 
Belgium that this country is characterized by a high degree of linguistic diversity. 
According to Darquennes (2004: 5) the complicated system of territoriality, which 
evolved through long-lasting legislative processes, was in the end appropriate to 
overcome the severe language conflicts of the 20" century. 
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3.2. Luxembourg 


In 1839 the Grand-Duchy of Luxembourg (483,000 inhabitants) was founded in its 
present-day borders as a result of the Vienna Congress (1815) and the Belgian rev- 
olution (1830). Until the beginning of the 19% century, larger parts of the territory 
were situated in the Romance-speaking area while the other parts were in the pre- 
dominantly Germanic-speaking area. Thus, language contact between Romance 
and Germanic varieties has been a long-standing issue. From mediaeval times on- 
wards varieties of both genetic groups have been used as written languages in of- 
ficial documents (e.g. charters), thus creating a complex system of multilingualism. 

After several modifications to the size of the territory (parts went to France in 
1659, to Prussia in 1815, and to Belgium in 1839), the present-day country is con- 
fined to a so called 'quartier allemand', with a population predominantly using 
West Germanic languages (i.e. Luxembourgish and Standard German). The state 
border in the west is constituted by the language border (with the exception of the 
"Areler Land’, a small region around the city of Arlon, which for strategic reasons 
was assigned to Belgium in 1839; see 3.1). Throughout history, French has also al- 
ways played an important role as an administrative language in the *quartier allem- 
and' and gained a lot of prestige. It is this long-lasting presence of French and Ger- 
manic varieties on the territory of Luxembourg that gave rise to the present-day 
multilingual society. The local language Luxembourgish (Létzebuergesch) is of 
Central Franconian origin (Newton 1990). At the beginning of the 19th century it 
was regarded as a dialect of German with low prestige that served only as a spoken 
language. Consequently, Luxembourgish was not mentioned in the constitution of 
1848, which declared Luxembourg to be a bilingual country (French/German). 
With the evolution of national identity in the 19t century and the experience of oc- 
cupation by Germany during the two world wars, Luxembourgish now finds itself 
in an ongoing process of becoming by and large structurally and sociolinguistically 
independent from Standard German (Gilles 2000, Moulin 2006). Elaboration in the 
domains of grammar, vocabulary and functional domains allows the classification 
of Luxembourgish as an “Ausbausprache’ (Kloss 1967, Gilles and Moulin 2003). 
As far as attitudes are concerned, Luxembourgish is accorded a high emotional 
position, as it is considered an important component of national identity. 

This process of ‘emancipation’ (Gilles 1998) of Luxembourgish from the dia- 
lects of German cumulated in the language law of 1984, in article 1 of which Lux- 
embourgish was officially declared to be the (only) national language (‘La langue 
nationale des Luxembourgeois est le luxembourgeois'). Administrative languages 
are French, German, and Luxembourgish (article 3). For all purposes related to 
legislation and jurisdiction French is the only decisive language (article 2).! The 
language law was an important step not only to accept Luxembourgish officially as 
the national language but also to promote the trilingualism of Luxembourgish, 
French and German as the basis of the society. In doing so it was possible to avoid 
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assigning Luxembourg to either the French or German-speaking countries.? The 
language law of 1984 itself contains virtually no precise language policies or lan- 
guage planning issues. Rather, it was intended to confirm the status quo of the lan- 
guage use in the Grand-Duchy in the second half of the 20th century. 

Language use today in Luxembourg is characterized by trilingualism with a 
high degree of triglossia (Horner and Weber 2008). Until the 1970s Luxembour- 
gish was the most important spoken language, while French and German were con- 
sidered written languages. Since then massive immigration (mostly from Portugal; 
constituting 15 % of the overall population) and cross-border workers from France, 
Belgium and Germany (140,000, forming a third of the overall workforce) have 
brought radical modifications into the multilingual situation. Within this multicul- 
tural and multilingual society, the total amount of foreigners is 46.2 %, and it be- 
comes difficult to establish a language-per-domain description. According to the 
work of Fernand Fehlen (e.g. 2009) the most adequate way to capture this multi- 
lingual situation therefore lies in applying a Bourdieu-type analysis of the ‘lin- 
guistic market’ in Luxembourg. In general, an increase of French is noticeable 
in public life, because several foreigners and cross-border workers prefer to use 
French. Public signposting and advertising is predominantly in French, which 
often gives the visitor the (false) impression of a French-speaking country. French 
still has the high prestige of a language conveying culture" in general. Many 
Luxembourgers, though, feel unease in speaking French, which is still regarded 
by many as a foreign language. In addition many Luxembourgers are irritated 
by using French in most shops, because large parts of the personnel come from 
French-speaking countries. However, French has evolved as a ‘lingua franca’. Ger- 
man, on the other hand, has lost several domains since WWII and today it is mainly 
employed as a receptive language: most newspapers are written in Standard Ger- 
man and the most preferred TV channels are also German. 

Against the background sketched above, it seems evident that the small lan- 
guage Luxembourgish is struggling under the predominance of French. This, how- 
ever, is definitely not the case. Luxembourgish has not only maintained its status as 
the most important spoken language, but also managed to conquer new domains. 
Besides its secure status as the language of the family and among friends, Luxem- 
bourgish is still the most widespread spoken language in administration, parlia- 
ment and government, and on the radio. Especially in the domains of New Media 
(SMS, email, chat, private homepages, Facebook) Luxembourgish has become the 
preferred language. Indeed, it seems that Luxembourgish is the only language 
possible here — provided the participants are luxembourgophone (Gilles in press). 
These innovations in communication are helping this small language to develop a 
niche of written language use, which gains more and more importance for every- 
day life. Future developments in communication will show an increase of digital 
and informal media, and this will become a crucial factor in the language mainten- 
ance of Luxembourgish. 
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The present-day situation is thus characterized by an overall increase of multi- 
lingualism — one has to manage the three languages on a sufficiently high level to 
participate on all domains of the public life. This trend is reflected in the somewhat 
idealistic slogan of the Ministry of Education: ‘our true mother tongue is multi- 
lingualism' (Réajustement de l'enseignement des langues 2007). 

This high capital of multilingual proficiency has to be guaranteed by an appro- 
priate language teaching and learning system at school. The implicit language po- 
lices at work here aim in the first place at ensuring high proficiency in German and 
French (and also English). German is introduced as the first language and taught as 
a mother tongue. Bearing in mind that more than 40 % of the pupils have different 
mother tongues than Luxembourgish, it is obvious that these pupils often en- 
counter problems at school. The rationale behind this ‘German first’ policy lies in 
the assumptions that (1) Luxembourgish is quite close to German so that children 
with this mother tongue have no problems in learning this language during their 
first years in school and (2) children with another mother tongue learn Luxembour- 
gish in their pre-school education. French is then introduced from the second year 
onwards. Although Luxembourgish was introduced in the school curriculum as 
early as 1912, it is still not fully integrated. While in primary school Luxembour- 
gish is used as a language of instruction, it is largely absent in secondary school. At 
this school level only one hour weekly is devoted to the mother tongue. To sum up, 
then, the Luxembourg school system offers a nice example of the ‘cost of multilin- 
gualism' as large portions of compulsory schooling are spent on language learning. 
For a long time the language policies of the school system have been subject to 
constant criticism (Weber 2009). The main critical point relates to the German-first 
approach, which clearly penalizes pupils with non-Germanic mother tongues. Por- 
tuguese children, in particular, suffer from the Luxembourgish school system. In 
order to overcome this problem the establishment of two ‘language tracks’, a Ger- 
man and a French one, has been proposed. In this system the children would have 
the choice between German and French as their first language. The educational in- 
stitutions are reluctant to adopt this system, however, as they fear the emergence of 
two language groups in Luxembourg, as is the case in Belgium. Thus, for the sake 
of the unity of society, the traditional school system will in principle be retained. 
However, measures have been established to offer additional language training for 
non-Luxembourgish children. The future of the comparatively conflict-free multi- 
lingual situation depends strongly on the appropriateness of the school system; 
much still needs to be done to adjust the system to the extremely heterogeneous 
and multicultural Luxembourgish society of today. 

The last domain of language policies discussed here relates to the labor market. 
It is not possible to generalize over all the different workplaces, but a few trends 
can be mentioned here. For most workplaces French is indispensable today, Eng- 
lish is increasing, especially in internationally oriented businesses like the banking 
sector, with German maintaining its medium position (Klein 2005). An increase of 
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Luxembourgish is noticeable in the last 10 years particularly in the tertiary sector. 
More and more employers expect their employees to master the ‘three languages of 
the country’ (les trois langues usuelles du pays) (Pigeron and Fehlen 2005). This 
implicit language policy has as one consequence that more and more people are 
starting to learn Luxembourgish. The demand for language classes is booming, not 
only in the Grand-Duchy itself but also in the neighboring countries. Several lan- 
guage classes in France, Belgium and Germany even receive funding from the 
Luxembourgish Ministry of Education. 

In order to increase the use of Luxembourgish, to facilitate integration and to 
minimize language conflicts at the workplace, a ‘language leave’ (congé linguis- 
tique) was launched in 2009. This (paid) leave will allow all employees and self- 
employed people to attend language classes in Luxembourgish (maximum 200 
hours). 


3.3. Lorraine 


In France the language border lies in the east of the country, coming from the north 
and dividing the Moselle state (Département de la Moselle) into a western, French- 
speaking, part and an eastern, Germanic-speaking, part. The regional varieties of 
the Lorraine area around the cities of Thionville, Boulay and Sarrebourg belong to 
the central Franconian dialects and, from a historical point of view, they form a lin- 
guistic continuum with Luxembourgish and the dialects of neighboring Germany 
(for language history see, e.g., Hudlett 2001). Several names exist for the regional 
language, among them Lorraine francique, Platt and patois. Carrying partly a very 
low prestige made it easy for French language policies to constantly reduce these 
Germanic varieties, with the result that the regional language is seriously endan- 
gered today (Laumesfeld 1996). 

Lorraine is a typical border region, exhibiting an intricate territorial history as 
it belonged to several different countries in the course of history. In this Romance- 
Germanic bilingual transition zone the population frequently experienced the pre- 
dominance of either the French or the German language. From 1794 onwards, as a 
consequence of the French Revolution, only French was allowed in official docu- 
ments and in administration. From 1871 to 1919 the Lorraine region belonged to 
Prussia and German was taught in school regularly. During this time the public use 
of the dialect was stigmatized. After the end of WWI German was again oppressed 
and stigmatized, and also the regional language, which was considered a dialect of 
German. French took over all influential domains, e.g. schools and administration. 
The opposite took place between 1940 and 1945, when the Nazis prohibited the use 
of French. An anti-German attitude after WWII brought along a massive reduction 
of the Franconian varieties, again accompanied by an increase of French. From 
then on German was not taught in school anymore, a situation that lasted until the 
1980s, when German was introduced again as a foreign language (Kieffer 2003: 
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236). In general, we can see a decline in the use and the status of Germanic var- 
ieties in Lorraine. 

A few language societies like Gau un Griis are trying to save and maintain the 
regional language by promoting it as part of the regional culture, history and iden- 
tity that will soon be lost if awareness is not increased. Nevertheless, the active 
use of Francique is fairly restricted to the older generation. According to Hughes 
(2005: 136), less than 20 % of speakers younger than 15 speak the regional lan- 
guage. Without a sound intergenerational transmission it will become difficult to 
maintain the regional language. In school French is the main language, and Ger- 
man classes are optional. Some schools offer classes in the regional language 
(langues et cultures des pays mosellans), but the impact on language learning 
within families or society is low (Kieffer 2003: 238). Hughes (2005: 149) ident- 
ified only little effort of the speakers to defend their language. Thus, the vitality of 
the language is even worse than in the southern Alsatian area, where severe lan- 
guage loss can be observed. Statistical figures about speakers and language use in 
the Lorraine are scarce, as is reliable sociolinguistic research. 

A slight improvement can be noticed, though, with respect to cross-border 
workers from the Lorraine commuting to Luxembourg or the Saarland region in 
Germany. In order to succeed on the labor market mastering Francique (which is 
linguistically quite similar to Luxembourgish and the Saarland dialect, and partly 
also to Standard German) is regarded as advantageous in finding more lucrative 
employment. As in the case of the Arlon region in Belgium it is nevertheless un- 
clear whether this economically motivated language behavior will help the lan- 
guage situation. The use of the regional language here is externally motivated and 
seems to be having repercussions on the usage of Francique in the Lorraine itself. 

With regard to official language policies by the French (local or national) insti- 
tutions, it has to be stated that there is (unfortunately) only little official support in 
favor of the Germanic varieties in the Lorraine. Although a Conseil Académique 
des Langues Régionales was founded in 2003 with three members from Lorraine, 
and although France signed the *European Charter of for Minority or Regional 
Languages’ (but never ratified it) (Darquennes in this volume) no official language 
planning measures have been taken to maintain or strengthen the situation of Fran- 
cique in Lorraine. For the national language policies of France the local varieties of 
Lorraine are regarded as local, spoken variants of German and they thus cannot be 
attributed official status, e.g. in the school system or as an alternative language in 
administration. It is felt in the population of Lorraine that the strong presence and, 
of course, the high prestige of French will eventually make the Germanic varieties 
vanish and, as a result, the language border will move further east and ultimately 
coincide with the state border. 
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4. Discussion 


In its 1000 years of evolution, the Germanic-Romance language border led to a 
variety of contact situations and nation states with specific multilingual situations. 
Language policies (explicit or implicit, official or inofficial) are at work in order to 
cope with the linguistic, but also social and cultural diversity. The three case 
studies offer quite different policies. Belgium is the classical example for the prin- 
ciple of territoriality (another instance would be Switzerland). Although on a 
national level the country is classified as multilingual, regulations on the federal 
level gave rise to three largely monolingual regions. Quite the opposite is the case 
for Luxembourg, where the three languages have to be mastered by the whole so- 
ciety. Lorraine, finally, is an example for language decay due to the lack of official 
policies to protect minority languages. 

For all these *border communities', we shall finally discuss the status (or 'real- 
ity’) of the language border. For Belgium the ‘taalgrens’ appears to be very real, as 
itis part of everyday life and particularly in politics. In Lorraine, on the other hand, 
the language border is not prominent at all, or rather it coincides with the borders of 
the nation state, because the centralistic and monolingual policy of the French gov- 
ernment largely ignores indigenous minority languages. For Luxembourg, the lan- 
guage border also seems to play no role. This, however, is not because of a general 
neglect of the border. Here, the traditional multilingualism as well as the massive 
cross-border commuting minimizes the impact of the real border by turning 
the whole country into a zone of multilingualism (Treffers-Daller and Willemyns 
2002: 2-3). 


See also the following chapters in this volume: 24 by Troester-Mutz, 25 by Extra, 
26 by Auer and 29 by Darquennes. 


Notes 


1 The text of the language law is available on-line at http://www.legilux.public.lu/leg/a/ 
archives/1984/0016/a016.pdf#page=6 [1. 3. 2009]. 

2 Nevertheless, Luxembourg, like the French-speaking part of Belgium, is member of the 
“Organisation internationale de la francophonie’ (http://www.francophonie.org/). 
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1. Introduction 


After a general introduction addressing the concepts ‘feminism’ and ‘language pol- 
icies’, this article will give a short historical overview of various feminist language 
policies in Europe (section 1). In the second section, four different stages of fem- 
inist language politics in Europe will be presented: a fact-finding stage, a strategy 
proposal stage, a stage of implementation, and a stage of evaluation. For all of these 
stages, the influence of different political, cultural and linguistic aspects and factors 
will be explored. The central elements to be considered are the existing political 
systems and ideologies, the role and public perception of women's movements, and 
governmental regulations of sexist discrimination. A detailed analysis of these 
points will allow for a better understanding of the various developments of feminist 
language politics in Europe. What will be discussed in the third and final section are 
recent developments and challenges to feminist language politics, including the 
impact of new developments in gender theory, models of interdependent forms of 
oppression, pragmatic language concepts, and the influence of national and trans- 
national politics on feminist language politics. 

Feminist language politics were initiated in all European countries in the 1970s 
by feminists in the women's movement and from outside of academia. It wasn't 
until the 1980s that feminist academics adopted feminist political targets and 
claims, theorized them and introduced them into linguistics as an academic field. 
The goal was to institutionalize feminist language politics in a systematic way and 
to establish it as an academic sub-discipline of linguistics. As of now this has not 
been successful in most European countries yet: “In fact, ‘mainstream’ literature 
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on language planning either ignored or denied the existence of feminist language 
planning until Cooper's (1989) work on language planning and social change 
which includes the American non-sexist language campaign as one of its case- 
studies" (Pauwels 2003: 552). 

There are, however, a large number of feminist political activities concerning 
language use which can be found even earlier than the second women's movement 
in Europe, namely in the works of female authors deploring the lack of female 
voices and words in language, and the lack of space in which to express them- 
selves. Take, for example, Virginia Woolf's A room of one's own (1929) as well as 
several novels by the French philosopher and writer Monique Wittig (see Tudor 
2010a for a detailed account of Wittig's literary oeuvre and its impact on feminist 
language theories). One important and early source of feminist language politics 
thus is fictional writing by women, often science fiction, in an effort to find new 
wordings and new forms of communication (e.g. the science fiction works of Susan 
Elgin, a linguist, who developed a new language called Ládaan, in her work Native 
Tongue (1984)). Another early novel, translated into several languages and still fa- 
mous for its creative linguistic feminist changes, is Gert Brantenberg's Egalias 
dotre (1987). In addition to female fiction, numerous feminist political groups have 
taken into account the violating aspects of certain language forms and usages and 
have made this a major issue in their political work. 

A common starting point for all kinds of feminist language politics is the ob- 
servation that language and sexism are closely connected. However, the way they 
are connected differs substantially in different approaches. Here are important fac- 
tors to be kept in mind: 


e the way language is conceptualized: either as a language system, on the one 
hand, and language use applying that language system, on the other hand, or 
solely as language use with no language system as a reference point, as it is the 
case from a perspective-pragmatic point of view (see Hornscheidt 2006, 2008). 

e the way the relationship between language (use) and reality is conceptualized: 
is language (use) seen as a (possible) reflection of an outside reality, represent- 
ing a perception of the world which might be more or less accurate? Or is it 
better described as a tool for actually constructing world views which then con- 
stitute reality, i.e. not ‘referring’ to an ‘outer’ reality but as constantly re/con- 
structing it? 

e the way gender is conceptualized: Is gender seen as a twofold natural system 
that is independent from but has to be adequately reflected in language, or is it a 
rather artificial social perception initiated by language use, whose dichotomous 
structure has become naturalized by social and cemented by linguistic conven- 
tion? This last idea was perhaps formulated most strongly by Butler (1990). 

e the way feminism is conceptualized: is gender equality the ultimate aim of fem- 
inist language politics, i.e. that *women' and *men' should have the same op- 
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portunities to be represented in language (use)? Are femaleness and maleness 
seen as different realities which have to be linguistically represented in differ- 
ent ways, or is gender seen as a powerful construct which can (and should) be 
challenged and deconstructed by raising awareness and providing alternatives 
to the present linguistic conventions? 


Not all of these different aims and traditions are compatible with each other, and 
not all of them would like to see themselves discussed under the heading of fem- 
inist language politics. Following research developments in the last few years, pre- 
ferred labels would rather have been ‘critical gender and language research’, or 
*approaches at the intersection of language and forms of oppression', with the rel- 
evant representatives no longer subscribing to a traditional notion of feminism. 
This aspect will be taken up in section 3 of this chapter. Just as feminist theory has 
constantly changed and different theories and approaches have multiplied over the 
last five decades, linguistics has undergone changes and finer differentiations with 
regard to theoretical concepts and new subdisciplines, too, so that the dynamic in 
both fields is (still) contributing to ongoing changes in the way feminist language 
politics are implemented today. In addition to that, there are political and cultural 
differences and a variety of political and economic developments in Europe which 
have also contributed to varying conceptualizations of feminist language politics 
within the different European countries. Obviously, the diverse starting points 
taken in feminist language politics do not only lead to different analyses with re- 
spect to the interplay of language (use) and sexism; they also lead to different atti- 
tudes towards language change and, consequently, to different strategies of lan- 
guage change, and how they are implemented and evaluated. 


2. The scope of ‘traditional’ feminist language politics 


In this section, I would like to highlight a variety of aspects which have had some 
influence on how feminist language politics are conceived and implemented in the 
various linguistic communities and concrete situations. I will present them in chro- 
nological order. For each of these different phases, there are a large number of im- 
portant aspects to be taken into account: 


e differing conventionalized possibilities to refer to or, in a pragmatic model, to 
interpellate persons verbally with regard to making a gendered classification 
explicit or not; 

èe diverse political and historical conditions in different linguistic communities 
and for different languages under investigation, which lead to a variety of con- 
ceptualizations and applications of feminist language politics; 

e varying attitudes towards language as a societal and/or political tool; 

€ diverse attitudes towards politics. 
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2.1: Creative, mainly literary resistance 


A first stage in feminist language politics is characterized by various forms of non- 
academic political and cultural negotiations, linguistic actions, proposed changes 
and challenges to certain linguistic norms which have been understood and ana- 
lyzed by those who have experienced sexist verbal discriminations, i.e. conven- 
tionally speaking *women'. Taking these instances as a first phase of feminist lan- 
guage politics profoundly broadens the scope of linguistically relevant opposition 
to hegemonic linguistic norms and makes a variety of different forms of resistance 
and actions taken against these norms more visible, even within linguistic histori- 
ography. This idea has been developed here on the basis of postcolonial theories 
(i.e. Loomba 1998; Mills 1997), which point out that opposition towards norms can 
take many different forms and that it is important to change our perspective on that. 
Such linguistic (re)actions can materialize in written and oral texts. Even forms of 
not interacting verbally, of non-addressing and silencing, can thus be regarded as 
important linguistic strategies to oppose sexism and thereby as parts of feminist 
language politics. To investigate these different and continuously applied lin- 
guistic actions more systematically and to include them into the notion of feminist 
language politics is an academic task which has yet to be undertaken. As already 
mentioned in our introductory remarks, literary works have had an important im- 
pact on the reflection upon and change of androcentric language. An important 
focus of many forms of feminist language politics are person reference forms or, 
from a pragmatic point of view, person appellation forms (see Hornscheidt 2006 
for a longer discussion on the conceptual differences between the terminologies of 
person reference forms and person appellation forms and the consequences for lin- 
guistic research). They encompass all forms which are used to interpellate per- 
sons — in direct speech situations (1st and 2nd person forms) as well as in talking 
and writing about others (3rd person forms). The largest group which is under in- 
vestigation here is that of nouns interpellating people with regard to their status, 
functions and roles, such as feminist, linguist, cyclist, racist. Livia (2001) gives 
many examples from her linguistic research on literary works, where in order to 
fight linguistic sexism these forms have been challenged by feminists and changed 
in creative ways (e.g. gyn/ecologist, herstorian, s he). 


2.2. Systematic analysis of the sexist dimensions of language 


Pauwels (1998) draws a distinction between three dimensions of feminist language 
politics. Implicitly, however, she refers solely to person appellation practices and 
does not take into account other forms of feminist language politics like oral com- 
munication conventions. These dimensions, which will be discussed in some detail 
below, only emerged after the first stage of feminist language politics sketched in 
section 2.1. I would like to add that this first phase still exists in most linguistic 
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communities — and that it often exists quite independently of linguistic discussions 
and research on the topic. This in itself is an interesting observation on the degree 
to which feminist language politics is (or is not) embedded in broader social move- 
ments. "A first major stage in the planning process is to identify, document and de- 
scribe the problematic language issue. This forms the fact-finding stage" (Pauwels 
1998: 16). Pauwels regards this phase as largely fulfilled for most European lan- 
guages. An important academic publication in this field is Hellinger and Bubmann 
(2001/2002/2003), in which more than 40 languages were systematically investi- 
gated with regard to how person reference forms (using their terminology) can be 
used for gendered or gender-neutral references. The state-of-the-art reports col- 
lected in these volumes give detailed insights into the different stages of feminist 
language politics in a wide range of countries. From a European perspective, 
though, it needs to be said that there are, for instance, no accounts of the situation 
in many Eastern and South-Eastern European languages, like Albanian. Fur- 
thermore, even the most fundamental linguistic analyses are missing for many 
minority languages in Europe, like Roma and Saami. Even if Pauwels (1998, 2003) 
calls this phase “fact-finding-stage”, I would like to add that the postulation of its 
fulfilment is largely based on the idea that feminist language politics are more or 
less restricted to person reference forms and work on the assumption that there are 
only two (natural) genders. Both presuppositions are not uncontroversial, as men- 
tioned above. If these presuppositions are questioned, the so-called “fact-finding- 
stage" can for several reasons not yet be postulated to have reached its end or to be 
close to it. Maybe this stage will never be fulfilled: more deconstructive and inter- 
sectional notions of gender will make it necessary to question linguistic gender 
norms continuously, as will be argued in the last part of this chapter. 

Relevant research within this fact-finding stage can be differentiated for differ- 
ent underlying concepts of gender, language, and the relationship between gender 
and language in feminist language politics, on the one hand, and the effects these 
different definitions have on how feminist language politics are conceptualized, on 
the other hand. There are crucial differences between approaches which under- 
stand language as a tool for (re)constructing and therefore even changing reality, 
and approaches which understand language as a mere post-hoc reflection of a real- 
ity that is more or less independent of language and concrete language usage. This 
major difference, which is a characteristic division between many linguistic 
schools and approaches, is important for a better understanding of different atti- 
tudes, models and targets within feminist language politics. For example, Hel- 
linger and Bußmann (2002) understand feminist language change as a reaction to 
already changed societal gender relations and norms. From their point of view, lan- 
guage is more conservative than social realities are. This implies that social real- 
ities are independent of linguistic conceptualizations and perceptions; hence, fem- 
inist linguistic changes are taken to aim at a more adequate reflection of a social 
reality. 
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Gender-related language reform is a reaction to changes in the relationships between 
women and men, which have caused overt conflicts on the level of language compre- 
hension and production. Reformed usage symbolizes the dissonance between traditional 
prescriptions such as the use of masculine/male generics and innovative alternatives. 
Guidelines are based on the assumption that a change in behaviour, i.e., using more in- 
stances of non-sexist language, will be attended by a change in attitude so that positive 
attitudes towards non-sexist alternatives will develop. [...] Conversely, positive atti- 
tudes will motivate speakers to use more non-sexist language. (Hellinger and BuBmann 
2002: 18-19) 


From this point of view, social or societal changes are necessary to initiate lin- 
guistic changes which themselves, as a secondary effect, could have some effect on 
the interactants’ conceptualizations of persons as (un)gendered. 

However, this perspective on language and its relation to reality is questioned 
from a linguistic-pragmatic point of view, and this in turn has consequences on 
how feminist linguistic change is conceptualized. From the point of view of prag- 
matics, these processes are understood to be much more interdependent. Language 
is an important medium to (re-)produce images or constructions of reality. Hence, 
feminist language politics is understood as an instrument, and not as a conse- 
quence, of feminist politics. Feminist language politics as such can lead to import- 
ant new ideas and conceptualizations, even of models of gender, and are a crucial 
aspect of social change. 

By applying recent models of the interdependent complexities of forms of 
structural oppression to the complexities of gendered realities (Walgenbach et al. 
2007; see Hornscheidt 2007 for a discussion on the linguistic dimensions of this 
approach), traditional strategies of feminist language change are challenged and 
have to be reformulated. After all, they have to take into account the complexities 
in the interplay of different dimensions of oppression, like for example sexism, 
racism and lesbophobia in the case of person appellation forms. By taking into 
account interdependent forms of discrimination concerning linguistic actions, 
the discriminations (re)produced in strategies of feminist language change can be 
evaluated and changed accordingly. 


2:3. Recommendations for linguistic change 


A first step in achieving language reform is thus some consensus in the innovating 
speech community, in our case, socially conscious women, that a particular innovation 
is an appropriate non-sexist solution. The next step is the development of non-sexist lan- 
guage guidelines incorporating these solutions. (Ehrlich und King 1992: 154) 


Taking a two-gender-model as a given, one of the main questions which has been 
debated among linguists concerning the different language communities in Burope 
is which strategy of change is to be preferred and for what reasons: linguistic 
gender-neutralisation or linguistic gender-specification? These are the two strat- 
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egies which are mainly found and are usually proposed as the only possible alter- 
natives for the majority of linguistic communities. The strategy chosen is partly de- 
pendent on the different conventionalized possibilities to linguistically interpellate 
people within the different language communities. An important factor in this re- 
spect is that different European languages have different conventionalized ways of 
using, for example, grammatical gender differentiations for coding so-called natu- 
ral (in traditional approaches) or social gender differentiations. But still, there are a 
variety of strategies to choose from in the various language communities even if 
they share the same linguistic situation in terms of language structure. For 
example, this can be observed in the differing language policies found in Austria, 
West Germany and the former East Germany. Whereas in West Germany gender 
differentiation has been the most favoured linguistic change strategy by feminists 
working on this topic, in East Germany gender neutralisation was the only feminist 
language change strategy propagated. In contrast to both, in Austria, the so called 
"Binnen-I" (as in StudentInnen ‘university students’ — to denote female and male 
students instead of the generic masculine form Studenten) is the feminist language 
change strategy which has been most successfully implemented in public language 
usage over the last 15 years. This shows that the conventionalized possibilities to 
interpellate people and to explicate gender are only part of the answer. Together 
with linguistic conventions, gender ideologies as well as feminist positions and 
aims are obviously important, too, once it comes to the concrete strategies propa- 
gated and applied in a given community. 

‘Traditional’ strategies of language change advocated by feminists aim to im- 
plement systematic linguistic changes, which are implicitly indebted to a struc- 
turalist understanding of a language system as the basis of concrete language use. 
Within this approach to feminist language politics, it is the language system which 
needs to be systematically changed. Feminist linguistics thereby positions itself 
against traditional structuralist linguistics, which is conceptualized as being solely 
descriptive. According to Schoenthal (1989), feminist linguistics in Germany 
works in a tradition of linguistic criticism that structuralist linguistics is opposed 
to. This conflict, however, has not often been discussed and has led to a number of 
misunderstandings as the different epistemological starting points of these tradi- 
tions have not been made explicit. At the same time, however, this branch of fem- 
inist linguistics is still indebted to the idea of systematic change and thus tries, in a 
somewhat paradoxical gesture, to adopt structuralist norms. Moreover, these more 
traditional strategies of feminist language change are strongly decontextualized 
since they focus on the systematic change of single words and person appellation 
forms. The change of these forms is propagated without taking into account differ- 
ent communicative situations in which utterances are used. 

This focus on changes of the language system reflects that traditional feminist 
linguistics takes the language system to have the symbolic power to structure the 
world, in this case with respect to gender. Inherent in this assumption is that this 
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symbolic power is implicitly understood as being discursively constructed. The 
second stage in Pauwel's model therefore encompasses different, mostly system- 
oriented, strategies of language change. It is the aim and implicit norm of almost all 
feminist language changes propagated in the 1970s and 1980s to establish a sym- 
metrical language gender reference system, offering women and men the same op- 
portunities to be identified and addressed by linguistic forms, and/or to make 
gender invisible as a form of address and reference. (Note that all these strategies 
reproduce the idea of a natural two-gender-system. This has been challenged by 
more recent language change strategies, and will be presented at the end of this 
chapter.) It is perhaps this stage of feminist language politics which was most vis- 
ible and known to a broader public, and has even resulted in a number of system- 
atic language changes across different contexts in different speech communities. In 
some languages, like English or Swedish, single words which were analysed as 
being extremely androcentric were replaced by other forms, like e.g. English 
chairman which has been replaced by chairperson in public and official docu- 
ments; in Sweden, the term riksdagsman (lit. ‘man of parliament’) was replaced by 
riksdagsledamot ‘member of parliament’, but the latter change has not been very 
successful — both forms still exist in Swedish public language use (see Hornscheidt 
2006 for a longer and more detailed account of this). Another important change 
was the abandoning of the form Frdulein in German speaking communities, a noun 
denoting a non-married woman (see Hornscheidt 2008 for a discussion of this form 
and its resignifications). A similar change has been propagated for English, where 
the address forms Miss and Mrs are meant to be replaced by Ms — thus making the 
asymmetrical classification of women with regard to their marital status invisible 
and ultimately irrelevant. This change too, however, has only partially been suc- 
cessful (cf. section 2.4 below). 

More generally speaking, many European languages these days provide vari- 
ous guidelines for non-sexist language use which focus on person appellation 
forms. The scope of such guidelines varies from single institutions and companies 
propagating linguistic change for their institution, to different national governmen- 
tal bodies! on different levels, reaching up to working groups within the European 
parliament?. Still, large differences can be observed between European countries 
with regard to whether or not any such recommendations or guidelines exist, who 
has published them, and to what effect. As a general observation, there are far more 
guidelines for the Western and Northern European national languages and coun- 
tries than for the Eastern and South-Eastern European countries. The more we 
move to the West and to the North of Europe, the more strongly institutionalized 
are recommendations for a non-discriminatory language use with regard to a 
monolithic gender concept. This suggests connections between political systems, 
economic wealth and a public reflection on the sexist aspects of language (use). 
But even within the subgroups of Western and Northern European countries there 
are profound differences. For example, the differences between Austria and Ger- 


Feminist language politics in Europe 583 


many with regard to the institutionalisation of non-sexist language changes are as- 
tonishing for two countries so close to each other and sharing the same national 
standard language. With regard to non-sexist language change, Austria has many 
more recommendations and guidelines and even systematic changes with regard to 
person reference/appellation forms than Germany. Some forms which are still 
negatively associated in most contexts in Germany, like the so called Binnen-I, are 
fully established in public and official language use in Austria today. 


2.4. Implementation of strategies 


Fasold (1984) proposed a four-phase-model for language planning which Pauwels 
(1998, 2003) applied to (traditional) feminist linguistic change. However, even 
Pauwel's model cannot be fully transferred to feminist language politics for all 
European languages. One of the major differences is the absence of the phase of 
thorough implementation of feminist language change strategies on the level of 
public language usage. With the exception of English speaking countries, the phase 
of implementation has only randomly been carried out. Instead, proposed changes 
(2nd phase) have been directly evaluated (4th phase). This might be explained by 
social power relations, i.e. accepted sexism in institutionalized political positions 
of power, which therefore contributed to a non-implementation of feminist lan- 
guage change proposals. 

Even Labov, in his classic discussion on language change (1972), arrived at the 
conclusion that the scope of change is dependent on the social status that a certain 
group favouring this change has in society. “If the group in which the change orig- 
inated was not the highest-status group in the speech community, members of the 
highest-status group eventually stigmatized the changed form through their control 
of various institutions of the communication network" (Labov 1972: 179). In ad- 
dition to that, aspects of more general public attitudes towards the ideological basis 
of the proposed changes are of some importance as well. Schräpel (1985) sees 
prestige — the positive association a term needs in order to be used in a certain so- 
cial group —, power, group norms, and resistance as important factors for the suc- 
cessful implementation of linguistic change. According to Schräpel (1985: 217), 
linguistic change starts in certain social groups favouring this change, and is only 
later adopted by other people. 

Whereas there have been many official guidelines for non-sexist and non-racist 
language use in U.S. American universities and publishing houses since the 1970s, 
there have been no similarly widespread and thorough efforts to implement femin- 
ist language change in the European context. This applies even more to systematic 
strategies of non-racist language change, which barely exist in the European con- 
text. (What this means with regard to feminist language changes will be discussed 
in section 3.) With regard to monolithic feminist language changes, we can find 
either more general guidelines, as in the case of Swedish, or more specific guide- 
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lines, handbooks or manuals which have been published by individual local com- 
munities, interest organisations or institutions like, for example, newspapers or 
journals. In no case do they have a legally enforcing status, they are merely created 
and formulated as recommendations. This situation, too, needs to be seen in the 
context of the variety of linguistic cultures within Europe and the varying public 
attitudes towards governmental and/or official regulations as well as the diverse 
societal self-images concerning how social change happens. 

Sweden, regarded as the European country with the highest standards on 
gender equality, has only recently introduced some more official guidelines (see 
for example Milles 2008), whereas Norway already introduced official national 
non-sexist language guidelines in the early 1980s. Still another important aspect is 
the influence of the European Union's policies on strategic language changes. 
Since the European Commission makes a strong claim of propagating an anti-sex- 
ist policy, some of the newer European member states have recently started to ac- 
tively propagate feminist language changes on the governmental level, as for 
example countries formerly constituting Yugoslavia. 


2.5. Evaluation of (systematic) linguistic change 


For the European language communities, systematic evaluations of strategic anti- 
sexist language changes are largely absent. There exist only a small number of in- 
vestigations, all conducted in the 1990s, on the effects feminist language change 
had on conceptualizations and reality constructions. After that, research more or 
less stagnated. One of the reasons is an almost European-wide lack of an institu- 
tionalisation of gender and language research in universities and research institu- 
tions. This topic has been widely ignored and this has led to a situation in which 
new investigations in this field are only rarely undertaken. Investigations of oral 
conversations and gender constructions and stereotypes in conversation represent 
exceptions. This in itself leads to a disregard of the importance of language as an 
instrument for constructing realities and discrimination. The few studies which do 
exist focus mainly on English language communities. All international organi- 
sations on language and gender are also very much focused on English. Two aca- 
demic associations to be mentioned in this respect are FLING (Feminist lin- 
guistics) and IGALA (International Gender and Language Association). The latter 
has even announced a research award for the best annual publication in the field of 
language and gender. It has, however, to be published in English, and is therefore 
bound to reproduce certain linguistic norms. 

Cameron (1995) investigates rhetoric books for women and concludes that a 
non-sexist society does not necessarily lead to a non-sexist language. As a conse- 
quence of national and transnational politics, Cameron holds, a certain degree of 
non-sexist language use is politically correct today. But this does not lead to 
changes in personal attitudes. A ‘reformed’ language use can exist on the surface 
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without any effects or thorough reflection. As a consequence, feminist language 
change strategies, if applied in a given community, can make sexist attitudes even 
more invisible and ‘normal’. Similar results can be found for the introduction of 
the aforementioned form Ms, which has not been successful in eliminating concep- 
tualizations of women with regard to their marital heteronormative status. In 
contrast to that, the German noun Fräulein and the Swedish form fröken, both de- 
noting not (yet) married women, have been widely abandoned in public usage. 
However, fróken still is the form most frequently used by children to address their 
female teacher in kindergarten — certain gendered norms of marital status are thus 
implicitly (re)produced here (see Hornscheidt 2006, 2008 for more detailed dis- 
cussions). In addition to that, European flight companies have only recently re-in- 
troduced a term (‘Miss’) for unmarried women on their internet-based flight-sale- 
sites. A comparable form for men, however, is still lacking. 

As a consequence, those researchers in the 1990s who explored the conse- 
quences and success of feminist language politics mostly asked whether a strategic 
linguistic change was meaningful or whether it was/is rather the case that “[...] the 
interpretation of terms (neutral or not) seems to be heavily influenced by the ideol- 
ogies of an individual or speech community rather than by the particular pronouns 
used in a given context” (Ehrlich and King 1992: 156). Here again, we can see an 
underlying attitude towards language as a medium purely reflecting societal 
changes which are situated beyond language and language use. 

There is a huge public interest in feminist language changes. Many people are 
still strongly involved in feminist language politics in Germany although changes 
are only rarely propagated today. As an example from Germany, take the large 
number of negatively informed discussions on the German Binnen-I in the media 
during the last 25—30 years, culminating in a computer programme which can be 
downloaded to free texts from Binnen-I-forms. Even though this form has not yet 
been accepted by a broader German public, the intensive discussions show that the 
form is widely known. In addition, the strong opposition to this kind of language 
change shows the high social importance of linguistic naming practices. 

Independent of whether a person is a feminist or not, it is the self-identification 
and self-positioning which is most significant for propagating and implementing 
feminist strategies for language change (see e.g. Rubin and Greene 1991). Whether 
the relevant language changes lead to changes in a larger social group seems to be 
mainly dependent on the social position these persons take on. 


3. Future challenges 


In the last section of this chapter, I will try to give an outlook on future challenges 
in the field of feminist language politics with regard to a variety of aspects. 
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3.1. A pragmatic-constructivist language concept 


[Language] matters so much precisely because so little matter is attached to it; meanings 
are not given but must be produced and reproduced, negotiated in situated contexts of 
communication. (McConnell-Ginet 1998 [1989]: 49) 


If a subject becomes a subject by entering the normativity of language, then in some im- 
portant ways, these rules precede and orchestrate the very formation of the subject. Al- 
though the subject enters the normativity of language, the subject exists only as a gram- 
matical fiction prior to that very entrance. (Butler 1997: 135) 


Whereas linguistic change, in its traditional understanding in linguistics, is a reflec- 
tion of social change, a pragmatics-informed view understands linguistic change as 
an aspect and tool of social change. From this point of view, reality is always 
relative, and different linguistically manifested conceptualizations can be in com- 
petition with each other. Language usage is considered as the continuous manipu- 
lation of symbolic forms, which in itself has effects on the structure of physical and 
social experiences. Hence, language change is taken to be a continuously ongoing 
process of language use: there is no ‘correct’ and ‘true’ linguistic reference to a 
reality existing prior to language use, but instead the idea of appellation as a con- 
tinuous and dynamic process of resignification. A constant reiteration of certain lin- 
guistic changes can lead to a perception of certain forms as systemic and indepen- 
dent of concrete linguistic usages — as for the appellation forms ‘woman’ and ‘man’ 
for example, which are not questioned normally with regard to their correct refer- 
ence. “What happens in linguistic practices reflects or mirrors what happens in so- 
cial orders conceived as external to discourse itself" (Butler 1997: 157). 

Linguistic changes mean the change of the structure of the interactants’ ex- 
periences as a meaningful conceptualization of reality, as it is developed in cogni- 
tive pragmatics (see Marmaridou 2000). According to this approach, strategic lan- 
guage changes are a conscious effort to influence the structure of social experiences 
and, as a consequence, to change conventionalized conceptualizations, such as the 
deeply entrenched conceptualization of two sexes. Strategic feminist language 
change is thus a form of social change (see Hornscheidt 2006, 2008). The implemen- 
tation of a systematic language change within a social group can be regarded as a 
political strategy for enhancing a certain awareness within this group and for chang- 
ing it eventually. This helps to understand why feminist language change strategies 
are so strongly and intensively opposed in public discourse: the proposed changes 
are not only "linguistic", but do in fact question social conceptualizations and 
categorisations which have been experienced as ‘normal’ and neutral up to this 
point. 

From a discourse-analytical perspective (see e.g. Mills 1997), strategic femin- 
ist language changes can thus be regarded as discursive moments in the negotiation 
of certain ideas about language and gender as well as a fight for discursive posi- 
tions and authority. Linguistic change becomes possible when different linguistic 
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coding or naming practices of reality are possible and the social role of the indi- 
vidual is under debate. “In this context, a precondition of social change is the ten- 
tative character of social reality, values and institutions, and the possibility of lan- 
guage change to construct and express the embodiment of emerging social 
meaning" (Marmaridou 2000: 60). From this point of view, feminist strategies of 
linguistic change can be understood as powerful processes aiming to change public 
perceptions. Feminist language politics are thus no longer regarded as effects of 
feminist politics but as one important aspect of these political movements. The ne- 
gotiation of feminist language politics in public discourse can hence be analysed as 
representative for the negotiation of the role of language for social reality as well 
as gender norms in certain social groups. "An ideological analysis would see 
women as victims of sexist language, unable to intervene in the process whereby 
they are oppressed" (Christie 2000: 51). 

From the point of view of pragmatics, however, strategic language change must 
focus on language use rather than on linguistic forms. 


3.2. Intersectionality 


Far from worrying about excessive ‘political correctness’. [sic] I find it both astonishing 
and disturbing that allegedly feminist handbooks are still, in 1995, so completely inat- 
tentive to the need for guidance on usage in areas where sexism interacts with other op- 
pressions, or where differences exist among women. (Cameron 1998: 158) 


Recent models of complex forms of discrimination (Walgenbach et al. 2007, Cren- 
shaw 1991) go by the name of intersectionality or interdependency models and 
question monolithic conceptions of gendered realities and norms. They ask for the 
normalized conceptualizations which are inherent in ideas of gender as a central 
and universalized categorisation, if no other differentiating factors (e.g. race, sex- 
uality, age, dis/ability) are explicitly taken into account. Taking this perspective as 
a starting point, feminist language politics themselves can be charged with certain 
monolithic gender norms and images they presuppose. As a result, they have to be 
more differentiated. For example, in criticizing certain proposed changes concern- 
ing implicit racism, feminist language politics today is faced with major challenges 
by new models and ways of linguistic change. To put it another way: the feminist 
linguistic debates in Western Europe so far have universalized a middle-class, 
middle age, heteronormative, white gender norm and have thus been racist, classist, 
hetero-sexist and age-discriminating. Interdependency models make it possible to 
critically reconsider how feminist language politics have participated in making 
white norms invisible and thereby universalizing them. If interdependency is taken 
seriously as an analytical perspective on structural discrimination and privileging, 
this means that all feminist language changes have to take into account linguistic 
racism as well, if they do not want to run the risk of being racist themselves. Fo- 
cusing on just one form of discrimination is no solution, as gender and race dis- 
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crimination are intertwined in a very complex way. Interdependency models thus 
hopefully contribute to major changes for future feminist language politics. 


3.3. Outlook 


There are many different forms of feminist language politics today. Many of them 
try to disrupt supposedly pre-given norms, perform constant challenges to naturali- 
zations, create de-dichotomizations, and so on. Another important point which has 
been constantly brought up by feminists is the aim to make different voices heard. 
This could be of some importance for feminist language politics beyond person 
naming forms and would thus substantially widen the scope of feminist language 
politics. “Critical feminist writings focusing on issues of difference and voice have 
made important theoretical interventions that call for recognition of the primacy 
of voices that are often silenced, censored, or marginalized" (Hooks 1995: 300). 
Hooks's statement was formulated with regard to different varieties of English. 
This also shows how important it is to keep in mind that forms of linguistic dis- 
crimination and privileging are multifaceted and most of all complexly interwoven 
with each other — as is very centrally the case for sexism and racism. 


Notes 


1 See for example http://www.bmwfj.gv.at/NR/rdonlyres/2233958B-D6E7-46E8-8350- 
FA435EBF1B92F/33438/Leitfaden nicht diskr Sprachgebrauch.pdf as a more recent 
Austrian example. Interestingly enough, not only sexism or gender equality is taken up 
here, but different forms of linguistic discrimination (see also section 3.2 below). 

2 See for example http://www.bmwfj.gv.at/ChancenGleichheit/Publikationen/default.htm. 
This publication is also interesting as it takes into account different targets of linguistic 
discrimination besides gender, like for example age, sexuality, race. 
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32 English as a threat to other European languages 
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1:2; English as a lingua franca in Europe in the 21st century 
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2.1. The socio-political perspective 
2.2. The linguistic perspective 
3. Conclusions 


This chapter is structured as follows: in part one the official language policy of the 
European Union and the status of the English language in Europe in the 21st cen- 
tury will be outlined. In the second part the question will be posed of whether the 
English language can be considered a threat to other European languages and to 
European multilingualism. This question will be discussed from two perspectives: 
a socio-political and a linguistic one. In a brief conclusion, the title question will be 
answered and a compromise position will be presented between the two extreme 
poles of viewing English as a threat to other European languages and uncritically 
praising its advantages as an easy means of interlingual communication. 


1. Language policy and language use in the European Union 


1.1. The official language policy in the European Union 
and the status of English in Europe 


As of February 2009, the European Union (EU) has 23 official languages (not 
counting immigrant languages), and all languages in the EU can also function as 
working languages, which have legal status, i.e., may function as legitimate means 
of communication. In different EU organs, however, the number of working lan- 
guage may vary. For instance, the EU Commission has three working languages: 
English, French and German, but operates multilingually for public information 
purposes. The official Journal of the EU publishes articles in all 23 official EU lan- 
guages, and in the European Parliament all documents are fully multilingual. 
To keep this multilinguality up, an immense translation and interpreting machinery 
is required. As I have provocatively suggested elsewhere (House 2003), it would 
surely be much more efficient to use one and only one language in EU organs, the 
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only realistic candidate being English. In a similar but more moderate stance, Van 
Els (2005) suggested that in all informal oral consultations the use of English be in- 
stituted since such a measure would greatly enhance operative efficiency. Simi- 
larly, Habermas (1998) pointed out that the use of English in the EU might turn out 
to be a useful unifying instrument in denationalising Europe. 

Such suggestions of a lingua franca solution for Burope have, however, never 
been taken seriously (cf. e.g. Ammon 2006) as they undermine the European 
dogma of diversity and its strong commitment to multilingualism. In the Charter of 
Fundamental Rights, the EU declares that it respects linguistic diversity, and that 
linguistic diversity is a fundamental value of the EU. Despite this commitment, 
there is no common European language policy because language policy is under- 
stood to be the responsibility of the European member-states. The member-states 
are however supported in their decisions on language policy by many EU institu- 
tions which explicitly encourage all their citizens to become multilingual and fol- 
low the so-called “M+2” principle (Europeans should master two other languages 
over and above their mother tongue). In support of this ideal, the EU has instituted 
a number of programs (Lingua, Erasmus, Socrates, etc.) to make sure young Euro- 
peans partake in this diversity, mobility, multilingualism and multiculturalism. 

It is ironic, however, that despite the self-congratulatory rhetoric of Europe’s 
linguistic and cultural diversity, it is an undeniable fact that some languages in Eu- 
rope — most conspicuously English — are more equal than others. According to a re- 
cent EU Eurobarometer, about 40% of Europeans speak English as a second or 
foreign language, and over 50% claim they are able to speak and understand it. 
Further, in 19 out of 29 countries included in the poll it is English which is the most 
widely known language after the respective mother tongues. The penetration of 
English is most noticeable in Scandinavia and in the Netherlands, where just below 
90 % of the population say they know English. European citizens also think that 
they speak English better than any other second or foreign language, and 77% of 
European citizens believe that their children should learn English, and that English 
is the uncontested number one language in Europe, although there are regional dif- 
ferences in the level of English knowledge: in the Northern and Western member- 
states of the EU, English is more widely known than in Eastern and Southern Eu- 
rope. For communication purposes it is thus clearly English which is most widely 
used in Europe. It is used by many European citizens as a means of trans-local, 
trans-regional and trans-national communication — just as it is used in this way 
worldwide. 

A glance at other major EU languages is also revealing: for instance, although 
German is the most widely spoken European language and is the L1 of 18% of 
Europeans in various European countries, only 14% of Europeans master it as a 
second or foreign language. And French, probably still the politically most promi- 
nent language in the EU, is the mother tongue of only 12 % of the European popu- 
lation, with 14% speaking French as a second or foreign language. 
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Given the spread of English across Europe, one wonders what the real reason 
might be for the unabating opposition against English, and why it should be the 
case that in many official policy recommendations, English is just not mentioned, 
e.g., in the so-called Whitebook that emerged from the European Commission 
(1995) pushing the M+2 objective for all citizens of Europe. One plausible inter- 
pretation of this surprising blindness to reality 1s the deliberate attempt to defend 
the linguistic interests of powerful European nation-states, most obviously France. 

Is the official M+2 goal a realistic one? High levels of competence in more than 
one language tend to occur primarily with persons who live in smaller member- 
states with more than one official language and whose mother tongue belongs to 
the lesser-used languages. As to the citizens who best fit the proclaimed ideal of the 
“multilingual European”, they are young, well-educated, born in a country which 
is not the country they currently reside in, who have a strong motivation to learn 
and use languages on the job. All others are (sadly) only proficient in their L1 — and 
have some rudimentary knowledge of English. 

As in many other parts of the world, it is mostly speakers of immigrant lan- 
guages who are truly multilingual. They speak their L1 and the majority language 
of their country of residence, and often also learn a third language, most frequently 
English, which they recognise as a useful lingua franca in Europe. 

With so much support for English "from below", the opposition in principle 
against the use of English also in EU institutions might well diminish in the future. 
Extra (2006: 37) makes the relevant point that the use of English as a lingua franca 
(ELF) in Europe need not at all be seen as clashing with the European tenet of 
multilingualism and multiculturalism. He suggests three steps towards multilin- 
gualism and multiculturalism in Europe: learning three languages, an early start to 
language learning activities, and offering a wide range of languages to choose 
from. Following these actions would open the door to an approach in which immi- 
grant languages in Burope would be included. And he continues "Although this 
may sound paradoxical, such an approach can also be advanced by accepting the 
role of English as lingua franca for intercultural communication across Europe" 
(2006: 37). 

An alternative suggestion to using ELF in the EU is EUROCOM or “Semicom- 
munication", i.e. making use of speakers' receptive competence in their interlocu- 
tors' language. In interactions with one another, speakers of different European 
Lis each speak their own language, and rely on their interactants’ receptive com- 
petence for successful communication to take place (cf. ten Thije and Zeevaert 
2007). How successful this procedure is, will critically depend on the typological 
distance of the languages involved, and of course on speakers’ competence in the 
respective L2s. The use of ELF as much more generally applicable option of se- 
curing communication and EUROCOM are thus not mutually exclusive. 
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1.2. English as a lingua franca in Burope in the 21st century 


English is the current lingua franca worldwide, and in Europe too. What do we 
mean by "lingua franca"? In its original meaning, a lingua franca — the term is 
widely thought to originate in the Arabic lisan al farang — was simply an intermedi- 
ary or contact language used, for instance, by speakers of Arabic with travellers 
from Western Europe. Its meaning was later extended to describe a language of 
commerce, a rather stable variety with little room for individual variation. This 
meaning is clearly not applicable to today's global English, whose major feature is 
its enormous functional flexibility and spread across many different linguistic, geo- 
graphic and cultural areas, as well as its openness for foreign forms. ELF can be re- 
garded as a special type of intercultural communication where there is no consist- 
ency of form that goes beyond the participant level, i.e., each combination of 
interactants seems to negotiate and govern their own variety of lingua franca use in 
terms of proficiency level, degree of code-mixing, degree of pidginisation, etc. One 
might even go as far as saying that with every Korean, Italian and German, Dutch 
and French individual who uses English as a mediating language, there arises a 
unique and genuine speech community where the roles and rules of mutual under- 
standing have first to be established. 

Such formal (and functional) flexibility, coupled with global spread, has led to 
another remarkable feature: That the number of non-native speakers of English is 
now substantially larger than its native speakers (the ratio is about 4 to 1, tendency 
rising). The fact that English is increasingly NOT owned by its native speakers 
makes for a parallel increase in the diversification of the English language through 
a series of acculturation and nativisation processes. The linguistic consequences of 
such large-scale processes of convergence are numerous non-native varieties. And 
this means that there is no monolithic “hegemonic” English voice — but rather a di- 
versity of different voices which reflect differences in the social, economic and 
political backgrounds of its speakers. For inner circle native speakers of English 
the dominant variety they speak is steadily losing influence. ELF is also not a lan- 
guage for specific purposes, a pidgin or a creole language, and it certainly is not 
some kind of "foreigner talk" or learner language. The interlanguage paradigm of 
the late 1960s with its focus on the deficits of learners of a foreign language vis à 
vis the native norm is not valid in the context of ELF. 

Instead of the learner, it is rather the multilingual individual and her “multi- 
competence" (Cook 1992) who is to be taken as a norm or yardstick for explaining 
ELF communication. Such a view is supported in the literature on bilingualism, 
where the notion of a "simultaneous activation" of speakers' L1 and ELF in the 
minds of multilingual subjects is widely accepted today. ELF might reasonably be 
described as a type of "register" — the idea being that when the English language as 
a “virtual language" is employed in different contexts of use, for different pur- 
poses, by different people, it fulfils different functions and is changed accordingly. 
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ELF's main characteristic is clearly its multiplicity of voices, its variability 
(Firth 2009). ELF is a language for communication, a medium that is given sub- 
stance with different national, regional, local, and individual cultural identities. 
The English language does not carry such identities; it does not — but potentially 
can — function as a language of identification. When English is used as a language 
for communication, it is in principle neutral with regard to the different socio-cul- 
tural backgrounds of its users, and has thus undoubtedly great potential for inter- 
national understanding — precisely because in ELF communication there is no 
fixed norm, and because lingua franca speakers must always work out anew — in 
different communities of practice — a joint linguistic, intercultural and behavioural 
basis for their communication. In recent corpus-based work on the nature of ELF 
(House 2009) there is converging evidence that ELF is indeed a sui generis kind of 
English where certain constructions are being re-interpreted and strategically used 
in ways that differ systematically from native English (see below 2.2.). 


2 Is English a threat to other European languages 
and to multilingualism? 


2.1. The socio-political perspective 


Why should it be English and not some other language that has become the world's 
foremost lingua franca? In answer to this question it is often pointed out that the 
British Empire, present day US global power, and an ever increasing technology- 
driven need for easy international border-crossing communication as well as a 
preference for a type of "neutral ground" have paved the way for the English lan- 
guage. This is of course particularly true in countries with very many different lan- 
guages and dialects (such as India). Another reason often given is that English is 
preferred by many people because it is supposedly “a simple code" — a rather du- 
bious reason. 

But what are we to make of the claim made by proponents of “ecolinguistics” 
(cf. Mühlhäusler and Fill 2000) or “liberation linguistics" (cf. Kachru 1991) that 
ELF poses a threat to other languages, that it is in effect a “killer language", a per- 
fidious agent of linguistic dominance and imperialism? (cf. e.g. Phillipson 1992; 
Skutnabb-Kangas 1999) Here one needs to point out that since ELF is a language 
with the currently widest communicative range, many people — if not most — 
choose to use it out of their own free will, and they more often than not make an in- 
tuitive distinction between ELF as a language for communication and their native 
language(s) as their language(s) for cultural and linguistic identification. These in- 
digenous languages and ELF are then NOT in competition, rather they supplement 
each other, there is never a stark either-or situation (cf. Gupta 2009). What about 
the linguistic and the human right to use one’s mother tongue in Europe whenever 
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one so wishes? This is of course always possible; there are no restrictions and sanc- 
tions on the use of any language. However, the decision to use or avoid using any 
given language depends on many situational, cognitive and affective factors. In 
contemporary Europe, individuals are in principle free to choose. As concerns 
ELF, speakers very often know what they are doing when they use ELF - and they 
do it in the majority of cases to their own advantage. De Swaan (2001) characte- 
rises the linguistic-human rights stance versus the utilitarian stampede for ELF in- 
sightfully when he writes: 


Recently, a movement to right the wrongs of language hegemony has spread across the 
Western world, advocating the right of all people to speak the language of their choice, 
to fight ‘language imperialism’ abroad and ‘linguicism’ at home, to strengthen ‘lan- 
guage rights' in international law. Alas, what decides is not the right of human beings to 
speak whatever language they wish, but the freedom of everyone else to ignore what 
they say in the language of their choice (2001: 52). 


The omnipresence of ELF in Europe and elsewhere also gives rise to the following 
paradoxical situation: Using ELF as a language for communication often strengthens 
the use of indigenous languages for identification purposes and as a vehicle of pro- 
test against ELF dominance. We therefore witness today strong counter-currents, 
even in the modern pop music scene, saturated with English language products and 
the Internet, once thought to be classic ELF dominated killer media. Instead of 
large-scale language loss, we are thus confronted with a merging of ELF and native 
varieties which can be interpreted as a kind of "appropriation" in reverse. 


2.2. The linguistic perspective 


One of the main arguments against the use of ELF in Europe and elsewhere is that 
ELF speakers are at a severe disadvantage vis à vis native speakers of English. 
They will never be able to express themselves with the finesse and subtlety they 
have at their disposal in their L1, and often suffer from the "reduced personality" 
syndrome. Here it is advisable to abandon ideologised expectations and rather look 
at the results of recent empirical studies that have examined how ELF speakers ac- 
tually perform in interactions (cf. Firth 2009; House 2009/20092; Mauranen 2009; 
Seidlhofer 2009). 

ELF interactions show surprisingly few instances of miscommunication, mis- 
understandings and repairs — a stark contrast to most native-non-native interactions — 
and a tolerant “Let-it-Pass” behaviour that makes them “robust” and “normal”. 

In the following I will illustrate some recent work inside an ELF project at the 
University of Hamburg. Here we have collected four different types of data from 
international students of various disciplines, namely (1) L1 English interactions, 
(2) interactions between L1 English speakers and ELF speakers, (3) ELF interac- 
tions between speakers of many different L1s and (4) post-hoc retrospective inter- 
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views for collaborative interpretation of data sets (1) to (3). The results of an analy- 
sis of the ELF interactions largely confirm previous findings, but also revealed four 
additional characteristics of the use of ELF: 


(1) Transfer of LI discourse conventions 

For example: East Asian speakers' specific topic management, which results in a 
number of non-sequitur turns-at-talk (House 2002) — a phenomenon consistently 
ignored by the other participants, such that it never leads to a breakdown of the 
conversation (cf. data excerpt 1 which shows two such non-sequitur exchanges). In 
this excerpt, as in the following ones, length of pauses and other non-verbal phe- 
nomena will be described in brackets, laughter via the symbol @, latching via the 
symbol =, and the points focussed on in the analysis given will be highlighted in 
bold print: 


Data excerpt 1 


Wei: I think English is easy to learn than for example German or Chinese. 
(3 sec) 

Mauri: YesIthink it is about the important meaning of the erm of the English lan- 
guage is is (2 sec) this is erm business language is very important but do 
you agree then that all the people of the world that they eh would speak 
English? 

Wei: No they don't have problem like French French people I think erm (2 sec) 
in China student he likes speak learn English and European languages. 

Mauri: They had to they have to they have to learn English because this is getting 
more and more important in business world so if you cannot speak Eng- 
lish so you're lost. 


(2) Use of multi-functional gambit " Represent" 

In the Hamburg ELF data (parts of) previous speakers’ moves are very often “re- 
presented". “Represents” (Edmondson 1981), in the literature also referred to as 
"echo-", “mirror-” or “shadow-elements, are discourse markers or gambits which 
fulfil various functions, they can act as a strategy with which speakers’ working 
memory in comprehension and production is supported, as a coherence-creating 
strategy with which lexical-paradigmatic clusters are systematically built up, as a 
signal of receipt and a confirmation of comprehension, and as a meta-communi- 
cative procedure that strengthens interactants’ awareness of their own and others’ 
talk (House 2002). Represents typically occur in genres such as psycho-therapeutic 
interviews, instructional discourse and aircraft control discourse, where in- 
formation is deliberately and routinely restated to create coherence and ensure 
understanding. The fact that ELF speakers use this convention shows that their 
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strategic communicative competence is very well developed, indeed. Consider 
data excerpts (2) below: 


Data excerpt 2 


Brit: And if erm things like Nigerian English, Indian English which is a sort of 
variety in itself it should be respected 
Mauri: Should be respected 


(3) Co-construction of utterances 

As is already evident in data excerpt (2), there is a strong and consistent demon- 
stration of consensus in this ELF talk adding to a feeling of community and group 
identity (House 2002). Consider data excerpt (3), where three participants join 
forces to build up the discourse: 


Data excerpt 3 


Wei: The most of the most of Chinese in foreign countries they speak not Man- 
darin they don't speak Mandarin but can only these erm 

Mauri: Dialects? 

Wei Yes dialects 

Joy: Dialects 

Mauri: Dialects their dialects 


(4) Re-interpretation of various discourse markers: "you know", "I mean", 
“T think", “I don't know", “yeah” 

The marker you know is used in ELF talk as a routinised self-serving strategy to 
make salient various coherence relations and to “fumble” for words. It is thus NOT 
used — as we know from the rich literature on discourse markers in general and the 
use of you know in particular — as a polite hedge or a sociocentric expression with 
which speakers appeal to knowledge shared with their addressees. Rather, the data 
shows an amazing frequency of the co-occurrence of you know with the conjunc- 
tions but, because and and. The analysis also reveals that you know is often used 
instead of these conjunctions taking over their functions in the discourse (House 
20092). 

The discourse markers J think, I don't know and I mean also tend to be re-inter- 
preted in ELF talk (Baumgarten and House 2007). As compared to native speakers 
of English, ELF speakers frequently use the constructions I think and I don't know 
in their prototypical meanings, preferring formal structures (main clause comple- 
ment structures) over the more grammaticalized structures and pragmaticalized 
meanings as these are expressed in the verbal routine forms of these constructions. 
In other words, / think expresses the speakers subjective opinion, and J don't know 
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is used as a marker of the speaker's insufficient knowledge about the topic of the 
discourse. 

The expression / mean is often used with a strong evaluative element in ELF 
talk over and above the main function of reformulation/clarification (Baumgarten 
and House 2007), i.e. / mean functions as a focalizing device in the speaker's con- 
tributions which serves as a point of departure for an explicit expression of a sub- 
jective evaluation, expressing affective involvement in the topic and the discourse. 

The gambit yeah/yes is employed by ELF speakers with a variety of different 
functions (Spielmann 2007) such as, for example, uptaking (alter's message), 
backchannelling (both supporting alter's message and signalling that no claim for 
turn take-over is being made), agreeing with alter's moves; structuring discourse. 
All of these uses are potent tools for making the discourse (appear) “normal”. So- 
ciable and face-saving communicative experience, the inoffensive undertone that 
comes with the use of yeah seems to render it particularly suitable for ELF dis- 
course, where interactants persistently strive for an overall positive mood, avoid- 
ing everything that could threaten someone's face or otherwise hinder conver- 
sation. Most instances of yeah found in the Hamburg data match at least one of the 
three major categories: backchannel signals, agreement markers and discourse 
structurers. Like a non-verbal Hmm or gestures like nodding one's head, yeah is a 
non-invasive way of showing support for the interlocutor and initiating a positive 
mood in ELF talk, a mood in which the speakers can concentrate on content rather 
than form without having to worry about being stigmatized as non-natives. 

ELF speakers often exploit the positive import of yeah/yes to downtone objec- 
tions and opposing opinions. Much like you know, yeah can also be used as a self- 
supporting strategy. The frequent use of yeah/yes in ELF discourse can be ex- 
plained with its polyfunctionality — a characteristic that makes its use “communi- 
catively effective" because a lot of pragmatic content is packed into minimal ver- 
bal form. This does not mean, however, that the use of yeah/yes in ELF talk is 
"simple" and that it is an indication of an underdeveloped competence in English. 

Given the evidence from the Hamburg ELF data as well as from other recent 
corpus studies, notably work from the Vienna VOICE corpus (Seidlhofer 2009) 
and the Helsinki ELFA corpus (Mauranen 2009), ELF users' strategic competence 
is admirably intact. The claim that ELF users suffer from a “reduced personality" is 
not confirmed in recent empirical work. 

Another often heard claim in connection with the use of English today is that 
the massive influx of English lexis into other European languages damages pro- 
cesses of thinking and concept formation in these languages, and that in certain 
areas it will be difficult to express both scientific and everyday concepts in them. 
This claim is compatible with the strong Humboldt-Sapir- Whorf hypothesis (the 
so-called "linguistic relativity hypothesis") which, put in a simplified way, ascribes 
one's native language a strong and unavoidable influence on thinking and behav- 
iour. This hypothesis cannot be upheld for three reasons: the universal possibility 
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of translation, the fact that languages in performative use are “anachronisms”, i.e., 
their forms do not normally rise to our consciousness; that multilinguals possess a 
"deep" common conceptual store to which “lower level" language-specific sys- 
tems are attached. In other words, there is always a way out of the conditioning in- 
fluence of one's L1, and indeed any other language, through language itself, its cre- 
ativity and flexibility and through the amazing richness of the human brain. 

This argument is supported by neurolinguistic studies of translation (Price, 
Green and von Studnitz 1999) showing that multilinguals move flexibly between 
L1 and L2. The two systems are distinct but essentially permeable. Further, there is 
no proof of the claim there is a link between only one particular language — the 
native tongue — and thinking and conceptualising. An increased use of ELF es- 
pecially at the tertiary level of education must therefore not necessarily inhibit 
knowledge in students' L1s. However, any unnecessary use of English, e.g. at con- 
ferences in non-English speaking European countries, where another language 
shared by participants is available, might well be avoided, and alongside inter- 
nationally recognised publications in the English language, scholars are well ad- 
vised to also publish in their respective L1s. 

Another frequently heard argument against ELF use is that it “contaminates” 
other languages. This argument can be relativised on the basis of the results of an- 
other empirical research project currently conducted at the University of Hamburg: 
The project “Covert Translation" supported by the Deutsche Forschungsgemein- 
schaft in its Sonderforschungsbereich Mehrsprachigkeit SFB 538) since 1999 (cf. 
Baumgarten 2007; Bührig and House 2007; Bóttger 2008; Becher, House and Kra- 
nich 2009). The project's initial hypothesis is as follows: Due to ELF's global 
dominance and the massive uni-directional translation movement from English 
into other languages, ELF also gradually influences — over and above the well- 
known lexical invasions — communicative norms and preferences in other lan- 
guages via translations. For example, German communicative styles (House 2006) 
will be adjusted to Anglophone ones such that conventionally transactional and 
detached communicative styles in German texts where direct, content-oriented, 
and explicit expressions used to be preferred are now moving towards Anglophone 
interactional and involved stylistic preferences of generally more indirect, more 
interpersonally oriented and more implicit expressions. 

The project hypothesis is tested using a tripartite diachronic (1978—2006) 1 mil- 
lion word corpus of original English and German texts as well as German trans- 
lations of English texts (to a lesser degree also French and Spanish originals and 
translations) in two (potentially vulnerable) genres: popular science and business 
communication. The analysis of this data consists of a combination of qualitative 
and quantitative methods, with the qualitative part consisting of contrastive text 
analysis (House 2009b), the quantitative method comprising frequency counts as a 
basis for a renewed qualitative method using detailed re-contextualised analyses of 
the translation relation in individual texts. 
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Qualitative analyses of some 80 texts published in the late 70s and 80s of the 
20th century reveal initially no influence of English communicative norms on Ger- 
man ones, analyses of texts dated from the beginning of the new millennium, how- 
ever, reveal an impact of Anglophone norms on German texts (but not on French 
and Spanish ones) in the areas of subjectivity and addressee orientation. These 
areas were operationalised for quantitative analyses and statistical operations to 
comprise such phenomena as personal deixis, coordination, modality, mental pro- 
cesses, and various markers of connectivity. Quantitative analyses show, for in- 
stance, changes in the use of the coordinate conjunction and and und in sentence- 
initial position over the past 25 years (Baumgarten 2007) but also in the use of 
other functional categories such as personal pronouns, expressions of modality and 
conjunctions. Recontextualised qualitative analyses of the translation relation be- 
tween 1st person plural personal pronouns start from the remarkable increase of 
these pronouns in the economic texts over time (cf. Baumgarten and Ozcetin 
2007). While in the majority of cases the English pronoun we was translated as the 
formal and functional equivalent wir in German, there were also interesting vari- 
ations, where, e.g., we is expressed in a non-equivalent structure. This is for in- 
stance the case when congruent realisations of states of affairs in English are ren- 
dered by grammatical metaphors in German, or when inanimate noun phrases in 
subject position (common in English) are dispreferred in German. These variations 
show that stylistic preferences of German speakers and language specific aspects 
of the German language are retained. The most surprising finding is however that 
there is a higher frequency of wir in the German economic texts, which thus exhibit 
a stronger orientation towards persons and a presence of writer-reader interaction. 

Taken together, certain linguistic means (speaker-hearer deixis, modality, con- 
nectivity) of realising subjectivity and addressee orientation in German trans- 
lations and comparable, original German texts have increased over the past 25 
years in popular science and business communication. This increase may event- 
ually change the nature of communicative styles in German texts. We cannot say 
however that this change is due to direct contact with ELF in translations because 
original texts have sometimes changed more than translations, and because mono- 
lingual diachronic reference corpora need to be taken into account which can give 
important additional information about language change. 


3. Conclusions 


The use of ELF in Europe is not necessarily a threat to multilingualism in Europe. 
ELF is a useful tool for interlingual and intercultural communication (probably 
more useful, and certainly more widely employed than the EUROCOM model) - it 
is an additional language, never a substitute for European citizens' native lan- 
guages. Neurolinguistic studies of translation and code-switching do not support 
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the assumption that ELF inhibits or damages conceptualisation in L1. It is the re- 
sponsibility of the intellectual elite of non-Anglophone countries to make sure that 
publications in English are supplemented by publications in their respective native 
languages. 

ELF is a hybrid language: Transfer from L1 into ELF is widespread. Because of 
ELF speakers’ inner dialogicity, evaluation norms should not be any longer inner 
circle English speakers’ competence, but rather multilingual experts’ use of ELF. 
Acceptance policies in publication organs and institutions employing teachers of 
English should therefore no longer follow rigid native English speaker norms or 
consider exclusively native English speakers for teaching posts. 

Studies of the influence of ELF on other languages show that for instance Ger- 
man norms — but interestingly not French or Spanish ones — are open to English 
influence to a certain degree and in certain domains. However, no claims for their 
origin can be made at the present time. 

There is an understandable association of ELF with global US economic power 
and cultural domination. It should not be forgotten, however, that the English lan- 
guage is here but an instrument. What we need are neutral descriptions based on 
research, rather than facile value judgements. Power transported via language can 
of course lead to a deplorable linguistic sameness, as can be witnessed today in the 
type of routinized, supposedly consumer-friendly service encounters in US Ameri- 
can owned chains all over the world. But luckily we also witness mounting resis- 
tance to such communicative nivellation foisted upon traditional communicative 
norms in other countries. 

Language policies in Europe should opt for a sensible "third way" steering 
clear of the extremes of fighting ELF for its dangerous "linguistic imperialism" and 
accepting ELF uncritically for its benefits. Today in Europe, ELF is not used instead 
of other European languages but rather as a kind of additional or “co-language” 
(cf. Fishman's 1977 early characterisation), employed in conjunction with, but not 
necessarily in opposition to, other European languages. 


See also the following chapters in this volume: 25 by Extra, 33 by Pahta and Taa- 
vitsainen and 34 by Wodak and Krzyzanowski. 
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33 English in intranational public discourse 


Päivi Pahta and Irma Taavitsainen 


1. Introduction 
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5: Conclusion 

1. Introduction 


The rapid spread of English as a global language after World War II has been en- 
hanced by socioeconomic globalization and technological developments contribu- 
ting to increasing translocal contacts and transcultural flows (see Pennycook 2007). 
It has brought about a global phenomenon, also attested in European non-Anglo- 
phone societies, whereby English, primarily learned as a foreign language through 
education for international and lingua franca communication, is increasingly used 
in intranational contexts of discourse traditionally reserved for domestic languages. 
English has penetrated these societies through business, politics, instruction, 
media, popular culture and advertising, and is visible in various ways in people’s 
everyday lives, in public spaces, at work and at home. It has been stated that English 
is gaining new functions and plays a new role in European citizens’ identities with 
layers of local and global components. At the same time, Europe is reinventing 
itself with a new focus for foreign language learning across the continent with 


a wider ideological project to improve citizens’? awareness of the multilingual nature 
of Europe, to encourage a positive attitude towards linguistic diversity, and to promote 
the learning of several languages. The expected benefits of such a programme include a 
better understanding between neighbouring nations, improved mobility of people in 
work and study, and an enhanced sense of a shared European identity. In this respect, 
21st-century Europe seems keen to roll back the centuries-long modernity project which 
created largely monolingual societies. (Graddol 2006: 92) 


The above view with its idealistic goals describes the situation in 2006, and Grad- 
dol estimated the implementation of the project to take 30—50 years. In the same 
year, the results of a special Eurobarometer survey on Europeans and their Lan- 
guages, including 29 European countries, indicated that English had firmly estab- 
lished itself as the most widely-spoken language in addition to the mother tongue 
in Europe. Recent research also shows that in some parts of Europe changes have 
already taken place to the extent that English has acquired an integral part in the 
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linguistic repertoire of the population, appropriated in various ways in both private 
and public communication in international and intranational functions. For many 
speakers of other European languages, English has become a natural resource in a 
range of settings, used instead or together with the national and other community 
languages, and mixed with them in various ways. 

English in intranational communication within non-Anglophone Europe, a lin- 
guistically and culturally pluralistic region, is not a monolithic entity. Its status, 
uses and functions, and the attitudes towards it vary from one European country to 
another, and have been examined in research accumulating over the past decades. 
This chapter provides an overview of the general trends and highlights in the body 
of research to date in this field. Our main focus is on language use in the public do- 
main, including contexts of discourse such as public spaces and public institutions, 
the media and the workplace. For illustration of the actual use of English in intra- 
national public discourse, we focus on the North of Europe, especially Finland, 
where the sociolinguistic situation in the post- World War II period of globalization 
and European integration has undergone a gradual, large-scale shift, with English 
emerging from a traditional foreign language to a new kind of second language, a 
resource language used extensively in both private and public discourse. 


2. Research history in time and space 


The role of English in European non-Anglophone countries was addressed in 
occasional studies from the 1940s to 1970s, but on a larger scale the topic was 
thematized in research in the 1980s. The early studies examined the influence of 
English on other European languages, specifically on their lexicon, in the form 
of "Anglicisms", defined by Górlach (2002) as “the import of English words 
and phrases" (see Viereck and Bald 1986; see also Rosenhouse and Kowner 2009). 
A useful tool for tracing this research is provided by An Annotated Bibliography 
of European Anglicisms (Górlach 2002), with a selection of publications in the 
field up to 1995, and some later references up to 1998. The entries are organised 
by country; the coverage is uneven, as it was not possible to find contributors in 
all Buropean countries, and qualitative criteria were also applied in the selection 
process (Górlach 2002: x). Another indispensable research tool is A Dictionary of 
European Anglicisms, recording the phenomenon in sixteen European languages 
(Görlach 2003). 

Research on Anglicisms has been particularly prolific in German-speaking 
countries and shows that the influence of English was seen in intranational com- 
munication in German contexts already in the 1950s, especially in press publi- 
cations. The rapidly increasing amount of research and public debate towards and 
during the 2000s suggests that English is present in several social domains in Ger- 
man-speaking parts of Europe, although the actual use of English in many con- 
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texts, including journalistic writing, is often overestimated. A corpus-based study 
of Anglicisms in Austrian and German newspaper language shows that, despite an 
increase in the number of English in special terminology in fields like information 
technology over time, the quantitative impact of English on general written Ger- 
man is in fact marginal (see Onysko 2007). The Nordic countries also have a long 
tradition on Anglicism research. Recent ventures include a large-scale project, 
initiated in 1999 by the Nordic Language Council, investigating the linguistic con- 
sequences of globalization, particularly Anglicization, in seven Nordic speech 
communities, i.e. Danish, Faroese, Finnish, Icelandic, Norwegian, Swedish in Fin- 
land and Swedish in Sweden. Results are reported in several publications (e.g. 
Kristiansen 2005 and Ostman in this volume). 

With the growing impact of English within both international and intranational 
contexts, areas of interest and approaches (see section 3) have gradually become 
more varied. Geographically, Western and Northern Europe have received the most 
attention from the very beginning, while Southern European countries are also 
examined in several studies. The 2000s saw a rapid growth in research on English 
in intranational use all over Europe, including research on Eastern Europe; numer- 
ous articles addressing issues in this field appeared especially in the journals Eng- 
lish Today and World Englishes. Some studies focus on English in intranational 
communication within individual Buropean countries, while others explore domain- 
specific uses of English, for example, in advertising, media, education, science, 
and various professional contexts of language use. 

Larger territories have been examined in a comparative frame in individual re- 
search articles, monographs, and thematic collections of articles. A collection of 
studies by Anderman and Rogers (2005) is concerned with the impact of English as 
the lingua franca and its relationship with the languages of Europe. Other compre- 
hensive treatments of English in Europe are a volume of studies edited by Cenoz 
and Jessner (2002), and a special issue of World Englishes edited by Deneire and 
Goethals (1997), including contributions on France, Italy, Germany, the Nether- 
lands, Belgium and Austria, and a valuable selected bibliography on research 
on various aspects of English in Europe. Some studies adopt a domain-specific 
comparative angle. A volume on the dominance of English in science is edited by 
Ammon (2001), and research on the role of English within the media worlds of 
European youth is reported on in Berns, de Bot and Hasebrink (2007), while Ger- 
ritsen et al. (2007) examine product advertisements in five Central and Southern 
European countries. A special issue of World Englishes, edited by Hilgendorf 
(2007), explores the role of English in five Central and Northern European coun- 
tries (Finland, France, Germany, the Netherlands, Sweden) and a range of issues, 
including the intranational uses of English in specific domains (education, busi- 
ness, advertising, Internet-based media), the bilingual's creativity with English as 
exemplified in code-mixing and lexical hybridization, and the conflicting attitudes 
towards English. 
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Major political changes, such as the collapse of the Soviet Union, the fall of the 
Iron Curtain and the expansion of the European Union and NATO, have affected 
the linguistic situation in Europe in various ways since the 1990s. These changes 
are also reflected in research activity on the status and uses of English in Europe. 
The rapid increase of English in intranational communication in Eastern European 
countries and social contexts where Russian earlier dominated as the lingua franca 
is discussed in several studies, illuminating the practices, for example, in Poland, 
Bulgaria, Macedonia and Hungary, and in Russia. 


3. Approaches 


The intranational uses and functions of English in Europe have been examined 
from various theoretical and methodological backgrounds. Early research largely 
concentrated on occurrences of words, expressions and linguistic structures, 
whereas analyses of the functions and meanings of English, the motivations for 
using it and attitudes towards it have gained emphasis in more recent work. The 
bulk of research to date is qualitative, although some surveys of the use of English 
in various countries apply statistical methods in quantitative assessments. 

In early studies on Anglicisms, investigating the role of English in the frame 
of contrastive linguistic analysis or lexical borrowing, the adopted items were 
normally assessed out of context, e.g. using word lists, and focusing on form rather 
than function (see Górlach 2002). In contrast, more recent lexical studies examine 
the embedding of the originally foreign elements into native language discourse in 
their communicative context, for example, in various media, like Onysko (2007), 
analyzing newspaper writings in German-language contexts. Formal consider- 
ations have also given way to functional analyses. Here new language elements are 
viewed as a linguistic resource, used for creating subtle meanings in the local con- 
text of discourse and appropriated for a variety of purposes and functions, includ- 
ing identity-work, as illustrated in research on the local meanings of English in fan 
fiction by Finnish writers and on multilingual advertisements in Germany. Some 
studies also consider the fuzzy nature of these lexical elements on the continuum 
from borrowing to code-switching, thus problematizing the theoretical notion of 
discrete linguistic boundaries. With these changes, the whole paradigm of lexical 
analysis has become broader and moved across to more macrolevel assessments. 
The analysis of practices in quoting English phrases and idioms without expla- 
nations, observed in public talk in TV, radio or public speeches at local institutions 
like the Parliament, or playing with English segments to create humorous mean- 
ings in various printed materials targeted to local audiences, provides evidence of 
the increasing use of English in intranational public contexts of discourse as a re- 
source language, and contributes to the paradigm shift in research (Taavitsainen 
and Pahta 2008). 
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The theoretical foundation of research on English in non-Anglophone contexts 
had a boost in the 1980s from Kachru’s theory of World Englishes, dividing them 
into three categories, the Inner Circle, the Outer Circle and the Expanding Circle, 
representing “the types of spread, the patterns of acquisition, and the functional al- 
location of English in diverse cultural contexts” (Kachru 1992). The model roughly 
corresponds to another well-known division of Englishes based on speaker back- 
ground and identity into English as a native language (ENL), a second language 
(ESL) or foreign language (EFL). In Kachru’s model, primarily stemming from the 
need to address issues in post-colonial ESL varieties, non-Anglophone Europe is 
placed on the Expanding Circle. The notion of linguistic imperialism, adopted in 
critical views on the hegemony of English as a killer language also in non-Anglo- 
phone European contexts, largely derives from Kachru’s theoretical frame (see e.g. 
Phillipson 2004; see also the chapter by House, this volume). However, the rapid in- 
crease of English in intranational functions also challenges the three-way catego- 
rizations, and in several recent studies research questions have been posed accord- 
ing to the widening of circles and shifting positioning of countries within the 
models (see Mesthrie and Bhatt 2008: 27-36). A new kind of situation is witnessed 
in several European countries as they have changed or are changing positions from 
EFL to ESL, with English as a new kind of second language. Some multilingual 
countries like Switzerland and Belgium have been noticed to employ English 
widely for intranational communication (Mesthrie and Bhatt 2008: 211-212), and 
already in the early 1990s a shift from EFL to ESL was seen under way in the Nordic 
countries. However, recent corpus-based research indicates that, so far at least, the 
increased use of English has not led to the development of an endonormative conti- 
nental variety of English, the so-called Euro-English; instead, Europeans still turn 
to native, Inner Circle varieties of English as their target model (see Mollin 2006). 

Surveys of people’s own judgements about their language attitudes and lan- 
guage skills are regularly conducted in European countries by questionnaire 
methods in the frame of Eurobarometer surveys, commissioned by Directorate 
General for Education and Culture of the European Commission. The survey on 
Europeans and their Languages (2006) in 29 European countries shows that 38 per 
cent of the non-English-speaking respondents knew English enough to be able to 
carry out a conversation in it; the percentages in individual countries ranged from 
23 (Bulgaria and Hungary) to just below 90 (Sweden 89%, Malta 88 %, Nether- 
lands 87 %, Denmark 86 96). English was the most widely-spoken language in ad- 
dition to the mother tongue in 19 of the countries covered in the survey, while the 
next two languages, German and French, held this position in three countries each 
(p. 63). These figures testify to the wide ability of Europeans, particularly in some 
countries, to adopt English in their verbal repertoire and appropriate it as their lin- 
guistic resource in various contexts of discourse. 

Survey methods have also been used for examining the intranational functions 
and uses of English in individual countries or specific regions using interviews or 
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questionnaires with different scopes and foci. Three extensive surveys have been 
carried out in the Nordic countries. Kristiansen (2005) reports on a survey con- 
ducted through telephone interviews in seven Nordic speech communities with 
500-1000 informants in each community, focusing on loanwords. A large-scale 
investigation into the use of English, and on attitudes and problems related to it, in 
everyday life in Denmark was carried out by Preisler (1999). The project consisted 
of two complementary investigations: (1) a quantitative analysis of responses to a 
questionnaire given to a random sample of the Danish adult population (N=856 re- 
spondents), and (2) a discourse analysis of ethnographic interviews with represen- 
tatives of some of the youth subcultures assumed to be spearheading the use of 
English in Denmark (Preisler 1999: 240). A national survey was conducted in Fin- 
land in 2007 by the Research Unit for Variation, Contacts and Change in English 
(Leppänen et al. forthcoming). This survey was designed to provide up-to-date, 
quantitative information about the status, influence and functional range of English 
in Finland and the attitudes towards it. This research design was chosen in order to 
complement earlier qualitative research providing some more narrowly focused 
glimpses into contexts where English is used quite extensively in the country, and 
to point out further questions for detailed discourse-analytic and ethnographic 
study. The target group were Finnish residents from the age of 15 to 79 (N=3000), 
and the return rate of the questionnaire was 49 per cent (N=1495). The results of 
the comprehensive surveys in both Denmark and Finland reveal frequent exposure 
to and use of English in a wide range of everyday communicative situations, in- 
cluding professional, educational and leisure-time contexts of discourse. They also 
indicate that majority attitudes towards English are in general positive. At the same 
time, the results indicate that English even within one European country is not a 
monolithic entity. For example, the Finnish survey reveals great variation between 
the respondents, particularly in the actual active use of English by age, profession, 
education and place of residence. 

Triangulation, the use of multiple methodologies, is becoming increasingly 
common in research on the uses and functions of English. As in Preisler's survey 
(1999), questionnaire methods are in some studies complemented by interviews, so 
that both quantitative (including significance tests and cluster analysis to indicate 
points for further investigation) and qualitative methods can be combined. Corpus 
methods, suited for the analysis of data available in electronic form, have been 
used to study the extent of the influence of English in, for example, the Netherlands 
and Austria (Gerritsen et al. 2007, Onysko 2007). Corpus methods were also ap- 
plied in a comparative analysis of English-Swedish code-switching in spoken dis- 
course collected from business meetings and casual conversations between young 
adults in Sweden (Sharp 2001). These studies show that corpus-linguistic analysis 
of large data pools, so far an underused method, has much to offer to research in 
this area, particularly in a variationist frame and combined with careful contextual 
analysis of the data. 
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A different approach to studying multilingualism and the increasing impact of 
English, called Linguistic Landscape (LL), has been developed more recently (see 
e.g. Shohamy and Gorter 2008). The notion refers to linguistic objects that mark 
public spaces, and before the name of the approach was coined, outdoor media 
were named as the object of research. The approach has its theoretical foundations 
in sociology and shows the successful cross-fertilization of different disciplines. 
With regard to its theoretical and methodological background, LL research draws 
from several sources such as ethnography of communication, geosemiotics, gram- 
mar of design, genre studies, and research on multilingual advertising. Multidis- 
ciplinary and multimodal views are advocated, and studies move at the interface of 
sociology, linguistics, and semiotics. Any written sign outside private homes, 
from road signs to private names to names of streets, shops or schools, may count 
as linguistic landscape items. The study of LL deals with these items from various 
points of view, for example according to the relative salience of languages used in 
these signs, assessing what languages are available for communication and what 
their symbolic functions are. Items in various languages can be interpreted from 
the point of view of status and power, and attitudes towards the attested languages. 
LL studies are directly relevant for concerns of intranational use of English in 
various European countries. Studies can have various foci of attention depending 
on the research question. The method has been adopted, for example, in research 
dealing with Eastern European countries from the ideological point of view after 
the post-communist period. In Finland, studies applying LL have paid attention to 
"streetwise English", i.e. to the use of English and quasi-English in the outdoor 
media in urban and rural areas, including trilingual practices in Helsinki, account- 
ing for the domestic languages Finnish and Swedish accompanied by English (see 
Taavitsainen and Pahta 2008). These studies show that English items in public 
spaces are mostly targeted at local audiences, though the increase of tourists and 
migrants not proficient in the local languages has to be taken into account as well 
as an explanatory factor. Nevertheless public spaces function locally, and a central 
issue of the LL approach is the relation of local and global elements in urban en- 
vironments. 


4. Communicative contexts: Examples of English in the media 


After the broad strokes of the previous sections, this section provides a closer view 
into actual use of English in one of the key domains where English has become par- 
ticularly prominent in intranational public discourse, i.e. the media. The examples 
derive from our own research on various types of mass media contexts in Finland, 
including commercial discourse in the outdoor and print media, and uses in news- 
papers and magazines. Our reasons for focusing on written mass media contexts 
stem from their nature as societal forums that are particularly visible and influen- 
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tial, and thus important agents in the dissemination of practices among local popu- 
lations, also contributing to sociolinguistic change. 

The prominence of English in commercial advertising and naming practices in 
posters, billboards, electric displays and shop signs is one of the most noticeable 
manifestations of the global spread of the language. It is also attested in Europe, 
and has been addressed in numerous studies on different European language com- 
munities. In Finland, like elsewhere, internationalisation has caused many com- 
panies with originally Finnish names to adopt English or English-sounding names 
as part of their new corporate image, but English is also common in the names of 
companies operating in the domestic market. In addition to the outdoor media vis- 
ible in the linguistic landscape, the phenomenon can be studied and traced in 
printed texts, including various types of promotional materials and the print media. 
Here English is often used in an emblematic, symbolic way to conjure up an image 
of something young, trendy, fashionable and dynamic. For example, the yellow 
pages in the Helsinki area telephone directories from the mid-2000s list a fair 
number of locally operating enterprises appropriating English in their image cre- 
ation. These include fitness centres (Finnish kuntosali) like Blue Fitness, Finn- 
body, City Gym, Hot Gym, Move! Wellness Center, Let's Go Center, and a large 
number of Fitness Clubs with different Finnish attributes. The long list of hair- 
dresser's and barber's shops in the same source contains items like Salon Hair 
West, Salon Sir, Clippers, Cuttin’ Club, Design Point, Hair Art, Hair Gate, Hair 
Space, Nice cut, Shaggy Head and Top-Hair. Restaurants with English names are 
numerous, although other languages also feature in this field. English has also 
found its way into less obvious fields. Companies providing temporary workers 
include Office Help, Banquet Service, Businesslike, Star People and Manpower. 
A service called Smart Repair fixes windscreens. Some names are a mixture of 
Finnish and English: instead of Star People or Manpower, one can hire staff from 
Apu Team (‘help team’); for an electrician, one can turn to Sähkö-Eagles (‘electric- 
ity eagles’); and for dance lessons, to PilviSteps (‘cloud steps’). The English words 
Center, Shop, Group, Service and Systems are commonly used to form compounds 
with Finnish general or proper nouns In commercial advertising aimed at Finnish 
consumers, English is frequently used for key words and slogans, often by the side 
of the domestic languages. Monolingual English advertisements are also common, 
and advertisers frequently make use of the Anglo-American cultural knowledge of 
consumers (Taavitsainen and Pahta 2008). Similar findings on advertisement are 
reported from other European countries. For example, in an empirical study of 
2384 product advertisements aimed at young women in Dutch-speaking Belgium, 
French-speaking Belgium, France, Germany, the Netherlands and Spain (Gerritsen 
et al. 2007), more than two-thirds of the advertisements contained at least one Eng- 
lish word, and 13 per cent were entirely in English. English was used especially for 
advertising products that can be associated with modernity. Gerritsen et al. (2007: 
293) point out that the results of their study "can be considered as one indication of 
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the penetration of English within society as a whole, since advertising is used 
across a society and is not preserve of a specialized group". 

As with any changes in patterns of language use, the increase of English in ad- 
vertising in Finland has taken place gradually over a long period of time. It has 
been claimed that the use of English has ranked high in strategies for giving 
fashionable names to Finnish companies and products since the 1980s, but in fact 
American influences prevailed in advertising as early as the 1960s, when modern 
consumer society was rapidly developing in Finland. A diachronic survey of multi- 
lingual practices in advertising on the business pages of the Helsinki telephone di- 
rectory from 1950 to 2003 shows that over the 50-year period English has become 
increasingly common in the names of many different kinds of local enterprises, in 
industries like beauty and fitness, entertainment, and IT, but also for companies 
providing, for example, security services or temporary workers. The study also in- 
dicates a clear trend from functional to symbolic, image-creating uses. In recent 
years, creative use of English has increased, resulting in names that contain puns or 
other types of word play. Examples are provided by names like Shortcut (a hair 
salon), /deal fit (a fitness centre) or My Way (a driving school). In this context, vari- 
ous types of hybrids and mixtures that can only be understood by speakers of Finn- 
ish have also become common. The most innovative creations include the name of 
an electrical goods shop, LIGHTinen, which plays on the translingual homonymy 
of the shop-owners' Finnish family name Laitinen and their merchandise. 

Another trend in the use of English in intranational functions in the business 
domain becomes obvious in the analysis of job advertisements in daily news- 
papers. For example, in Helsingin Sanomat, the biggest daily newspaper in Fin- 
land, job advertisements entirely in English by international, Scandinavian or 
Finnish companies are quite common. In the era of globalization this is hardly sur- 
prising, also considering the requirements in the English skills of the prospective 
employees. However, it is equally common to find job advertisements by com- 
panies operating on a more local scale with Finnish and English co-occurring in a 
fixed pattern: the text of the advertisement is in Finnish except for the actual job 
designation, which is given in English. The same pattern has also become a regular 
feature in business news in the reporting of appointments; in both contexts, English 
job titles often receive a Finnish inflection, sometimes resulting in surprising hy- 
brids, as in the first item in example (1). The inflection is attached to the last item in 
the noun phrase, as it would be in the corresponding Finnish title, but in the first 
item this yields a nonsensical expression. 


(1) NNon nimitetty [‘has been appointed’) Director of Software Developmen- 
tiksi [‘as development ], NN Account Directoriksi ja [‘and’] NN Technical 
Project Manageriksi (Helsingin Sanomat, July 2005) 


A similar bilingual pattern is also attested in job advertisements in several other Euro- 
pean countries, e.g. in France, Germany, Sweden, Switzerland and the Netherlands. 
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English frequently occurs in other kinds of notices in the print media. Examples 
of various types of use abound in the *Where to go" section of Helsingin Sanomat 
Nyt, a weekly magazine delivered with the newspaper. These include expressions 
of stance (example 2) and wordplay (3). Example (4) contains a humorous presen- 
tation of a long and complicated lecture title that, without a translation, requires 
considerable fluency from the readers, and a catch phrase that requires fluency of a 
different kind. 


(2) Paneelikeskustelua hostaa The Man, kansleri Kari Raivio! [^The panel is 
hosted by The Man, Chancellor of the University'] (Helsingin Sanomat Nyt, 
March 2006) 


(3) Klubilla nähdään stand up ja close up -taikuutta. Ei kai sentään sit down 
-magiaa? [*The club presents stand-up and close-up magic, but not sit-down 
magic, one hopes.'] (Helsingin Sanomat, Sept. 2006) 


(4) Kuka voisi vastustaa ... alustusta, jonka otsikko on niin napakka kuin “RNA 
polymerase structure-function studies — 20 years of groundbreaking 
work yields Nobel prize and basis for understanding gene transcrip- 
tion”. Be there or be square! [^Who could resist a talk with a catching title 
like ...”] (Helsingin Sanomat Nyt, March 2006) 


Wedding announcements make use of onventionalized expressions in English or 
Anglo-American cultural scripts by which young couples can evoke appropriate 
images of parallel well-known love stories, as in (5), from the title of a song from a 
well-known romantic film, An Officer and a Gentleman (1982). 


(5) Up where we belong (Helsingin Sanomat, Oct. 2006) 


Similar use of prefabricated life-script phrases can also be found in sources such as 
interviews with celebrities in a range of other print publications, including various 
traditional quality magazines. (6) is an extract from a feature introducing a young 
film actor, and (7) from a portrait of a familiar TV face. 


(6) Se elokuva oli Helmiä ja sikoja ja se muutti kaiken. A Star Was Born. 
[That film was Pearls and Pigs and it changed everything ...’] (Anna, May 
2004) 


(7) Olen edelleen siitä ohjelmasta kauhean ylpeä. Se oli once in a lifetime. 
[T am still terribly proud of that programme. It was ...'] (Helsingin Sanomat 
Nyt, Feb. 2005) 


In general, interviews are a fecund source of examples of English in the print 
media. Given the nature of interviews, the phrases that occur are often typical 
of spoken language (see examples 8-10). Another common type is formed by ex- 
pressions that are in some way related to the interviewee's profession (11). In (12), 
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the interviewee has told a joke that would not even work in Finnish translation — 
the assumption is that Finnish readers are able to understand the implications of the 
original English sentence. 


(8) Turha luulla, että nykyään voisi tavata kaikki äänestäjät face to face. ['It 
is unimaginable that you could meet all voters face to face’] (Politician, 
Helsingin Sanomat, January 2008) 


(9) “Very good, very good", Stubb kiittelee [... ‘Stubb thanks'] (Politician, 
Helsingin Sanomat, April 2008) 


(10) ... jos kerran Marimekko on niin rakas teille, suomalaiset, niin show it [‘... if 
Marimekko is so dear to you, Finns, show it'] (Director, Helsingin Sanomat, 
March 2008) 


(11) NN:n motto “A goal is a dream with a deadline" on vienyt pitkälle [‘NN’s 
slogan ... has taken her far'] (Scientist, Public Health Care, Nov. 2006) 


(12) Viinidieetti toimii hyvin. I have lost three days. [/The wine diet works 
well ...'] (Retired TV news presenter, Helsingin Sanomat Nyt, May 2005) 


Even long quotations in interviews or reports are not necessarily translated. The 
example in (13) is a case in point. It comes from an article on an art exhibition at 
Kiasma, a museum of contemporary art in Helsinki. The article cites written feed- 
back from people who had visited the exhibition, and the English comment of a 
foreign visitor appears without any translation. 


(13) Osa on innoissaan: “Lamppu välähti useammankin kerran.” “It’s amazing 
to be here and to understand a little bit more about your culture and 
your people!” [‘Some are thrilled: “The light flashed several times." ...’] 
(Helsingin Sanomat Nyt, Feb. 2005) 


Similar examples of English words and phrases are also found in newspaper col- 
umns (14 and 15), often written in an informal style resembling spoken language 
and driven by the columnist’s personality. 


(14) Elämä on kiinni susta ittestäs! Hanki itsellesi oma personal trainer! [‘Life is 
up to you! Get yourself a personal trainer" ] (Anna, April 2004) 


(15) Arvoisa tuleva kulttuuriministeri Ilkka Kanerva, Marja Tiura tai whoever 
alueellinen kiintiöhenkilö ... [“Your Excellency, our future minister of cul- 
ture ... or whoever happens to be the person in office by regional quota 
requirements’ ] (Helsingin Sanomat, April 2007) 


At the same time, a new style is developing in newspaper columns, with entire 
English utterances mixed with Finnish to create a colloquial style. Example (16) is 
taken from a free morning paper in the Helsinki region. The column is about flirt- 
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ing, and the author uses English to sum up and provide a climax. Similarly, (17), 
from Helsingin Sanomat Nyt, occurs in a column about metaphors used when talk- 
ing about sex. The paragraphs end with short questions in Finnish and answers in 
English. 


(16) ... katso mihin suupielet osoittavat. Jos ne näyttävät alaspäin, forget it! 
[ook at the corners of their mouth. If they turn downwards, forget it!’] ... 
mikään ei korvaa uusia arkisia ihnmiskontakteja. Thanks Eila! [‘nothing 
replaces new everyday contacts with people. Thanks Eila!’] (Uutislehti 100, 
Sept. 2005) 


(17) Kuumaa? No ei, beibi. [‘Hot? Well no, baby.’] 
Seksikästä? Don't think so. [‘Sexy? Don't think sol 


Kiihottavaa? No-ot. [‘Arousing? No-ot.’] (Helsingin Sanomat Nyt, Nov. 
2006) 


In the above examples, English phrases are positioned at the end of paragraphs. 
They provide a climax, summing up, condensing and typifying, or mark a change 
of footing, indicating a closing or a contrast or adding a jocular tone. But English 
has also found its way into headlines, particularly in tabloids, where examples (18) 
and (19) were found. 


(18) Paavo Lipponen suuttui: SO LONG! [‘PL got angry ...’] (Ilta-Sanomat, 
August 2008) 


(19) Kirjailija Juha Vuorisen neuvo Tero Pitkämäelle: Think big! [‘Novelist JV’s 
advice to TP ..."] (Ilta-Sanomat, August 2008) 


5. Conclusion 


The overview provided in this chapter shows that English is an integral part of 
intranational public discourse in non-Anglophone Europe. It is visible in the 
linguistic landscape, heard and seen in the media, and used in a wide range of com- 
municative contexts in public and private domains for a variety of purposes. The 
attitudes towards it vary. Conflict talk and critical attitudes towards English receive 
frequent exposure in the media, for example in letters-to-the-editor and in expert 
views, particularly concerning the danger of register loss in other languages, such 
as scientific writing. In some countries, measures have been taken to protect the 
local languages against English. One of the most extreme examples of official lan- 
guage planning is France, where the so-called Toubon Law was introduced in 1994 
to restrict the use of English in the media. On the other hand, surveys on ordinary 
European citizens' attitudes towards English indicate that it is in fact seen in a 
positive and pragmatic light. These attitudes are clearly reflected in the Eurobaro- 
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meter survey on Europeans and their Languages (2006), where English was as- 
sessed from a personal point of view as being the most useful language to know by 
an average of 68 per cent of Europeans. In comparison, 25 per cent considered 
French as the most useful language, and 22 per cent German. Future trends for 
English in Europe are perhaps implied in the figures indicating Europeans’ assess- 
ment of languages that children should learn: an average of 77 per cent of Euro- 
peans thought that children should learn English, while 33 per cent favoured 
French and 28 per cent German (p. 64). The national surveys in the seven Nordic 
speech communities (Kristiansen 2005), Denmark (Preisler 1999) and Finland 
(Leppänen et al. forthcoming) also provide evidence of mainly positive or neutral 
attitudes. For example, despite the strong presence of English in Finland, 80 per 
cent of the respondents in the national survey did not consider English as posing a 
threat to Finnish language or culture. 

To date, there is no systematic comparison of the uses and functions of English 
covering all the various regional, social and situational contexts where English is 
being used in intranational communication across the whole of Europe, but, as we 
hope to have shown in this overview, the research that has accumulated already 
provides considerable insight into the matter. For future directions in research, sev- 
eral suggestions have been made (see Berns 2005). Some emphasize the import- 
ance of research on youth language as a means of tapping into the ways in which 
English is used in identity work to negotiate meanings and a sense of belonging. 
New media environments are also seen as an important site for future research, and 
particular emphasis is placed on interactive social media. Some researchers under- 
line the need to study the reception of English and attitudes towards it in the 
diverse speech communities (Gerritsen et al. 2007, Onysko 2007, Leppänen et al. 
forthcoming). The documentation of the new emerging varieties of English is one 
of the priorities in research on English linguistics. At the same time, research on 
the linguistic effects of the spread of English on other languages is a high priority. 
As Kristiansen (2005: 155) reminds us, “maybe the most important of new lin- 
guistic contact zones in the world today develop between ‘local’ languages and the 
*global' English language." The consequences of language contact on such an un- 
precedented massive scale certainly deserve attention. 


See also the following chapters in this volume: 32 by House, 34 by Wodak and 
Krzyzanowski and 44 by Wright. 
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34 Language in political institutions 
of multilingual states and the European Union 


Ruth Wodak and Michal Krzyzanowski 


1. Introduction: Multilingualism in Europe 

2. European Institutions and Languages 

3. Multilingualism in EU Documents 

4. Members of European Parliament (MEPs) and Their Perspectives on Multilingualism 
3. Citizenship and Multilingualism 

6. Concluding Remarks 

1. Introduction: Multilingualism in Europe 


In this chapter, we identify and analyze processes of identity construction within 
Europe and at its boundaries, while focusing primarily on the many (also contra- 
dictory) functions of multilingualism for processes of identification. In the con- 
struction of the “New Europe”, there are many complex issues to be coped with: 
Europe consists of 27 nation-states with their respective histories, different cul- 
tures, different languages, different political, national, regional and local interests 
and traditional ideologies, different economic concepts, and so forth. Among the 
EU member states, some belong to the Schengen zone and to the Euro community, 
others do not; some have supported certain treaties, others have not; some have co- 
lonial pasts, others do not; some have been among the losers of both world wars, 
others have not. 

The central issues for politicians, who have been dealing with these aspects of 
European identity formation, involve the re-organization, the legitimization and 
the representation of this enlarged Europe (Triandafyllidou et al. 2009). National 
identities are defined and constructed mainly by the respective common pasts, 
presents and futures, by narratives and collective memories that are constantly res- 
haped and negotiated (see Wodak et al. 2009; Heer et al. 2008; Krzyzanowski and 
Oberhuber 2007). Hence, the official “Europe” has to find a “new narrative’, a new 
perspective or vision, perhaps some common ideologies, or even a utopia in which 
at least some European citizens would identify with and believe in (Krzyzanowski 
2010, Wodak and Weiss 2007). 

When studying multilingualism the work of the French sociologist Pierre 
Bourdieu, his (metaphorical) concepts market and capital to describe elements of 
linguistic interactions are salient (see Unger 2006). In Bourdieu’s view, a linguistic 
market exists “whenever someone produces an utterance for receivers capable of 
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assessing it, evaluating it and setting a price on it" (Bourdieu 1993: 78). The value 
of an utterance is always dependent on the immediate context. However, someone 
who is very competent at using a feature highly valued in that particular market 
could be said to hold capital in that market (Bourdieu 1993: 80). Bourdieu states 
that those who hold the most capital need only speak to receive linguistic profit, ir- 
respective of what they say (Bourdieu 1993: 80). 

Bourdieu's theory serves well when investigating the different prestige and 
symbolic capital of various languages in EU organisations. This capital has devel- 
oped historically and synchronically, depending on the language ideologies con- 
nected to specific languages (see Wodak and Wright 2006), to the number of speak- 
ers, and to the prestige and position of the country where this language is spoken. 
However, the specific symbolic capital, of course, depends on the specific com- 
municative context. Thus, there is clearly a case to be made for the existence of 
more hegemonic and less hegemonic languages. Taking this argument further, we 
propose to speak of a “hegemonic multilingualism” (Krzyzanowski and Wodak 
2007) where only a few languages in the EU are regarded as salient such as English 
and French, and to a lesser extent German and Spanish. 

The equal status of all national languages as official and as working languages 
within the EU has been repeatedly discussed since the founding treaties. In the 
same way, European multilingualism has been seen as an essential component in 
the future construction of an official European identity. At least the knowledge of 
three languages (tri-lingualism) has frequently been declared an essential compo- 
nent of European politics by various European authorities in declarations of politi- 
cal intent on matters of language, education and pedagogy (e.g. Article 2 of the 
European Cultural Convention of 19* December 1954, “Recommendation 814 on 
Modern Languages in Europe" of the European Council on 5th October 1977, the 
OSCE final document of 1st August 1975, cf. Schróder 1992). Finally, in the 
treaties of Maastricht and Amsterdam, the EU again committed itself to European 
multilingualism, which was echoed by the European Council of Ministers’ “Rec- 
ommendation concernant les langues modernes 98/6". 

Multilingualism in Europe is, however, seen by many as overly elaborate, cum- 
bersome, and as a competitive disadvantage (especially in particular areas such as 
science), in comparison, for example, to the USA (Ammon 1994). Demands have 
been repeatedly voiced for a common, dominant, European language (Zimmer 
1997), or at least a single internal working language within the EU. International 
developments in language policy may support such demands, as there is mean- 
while a tendency towards a degree of hegemony experienced never before. Such 
arguments are counterweighted by recent linguistic differentiation that has led to 
the emergence of “new” languages, as it were. 

Haarmann (1993: 53-55) lists 76 autochthonous languages in Europe. Euro- 
pean languages belong to the most important languages in the world: 350 million 
people speak English, 250 million Spanish, 150 million Russian, 100 million Ger- 
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man and 70 million French; meanwhile over 1 billion speak Chinese (Mandarin 
and/or Cantonese). About 1.6 billion people speak English as second or foreign 
language, 50 million German, 35 million French, and 30 million Spanish (de Cillia 
2003: 232-233). English is the language of bureaucracy in 60 states (French in 30, 
Spanish in 20, German in 8), English is the official language in 16 multinational or- 
ganizations, and French is spoken in 12. From an economic perspective (languages 
spoken in the business world), English ranks first ahead of Japanese, Chinese and 
German. In the sciences, English is clearly leading with 90.7 % of publications in 
the natural sciences written in English (Ammon 1998). This implies a strong he- 
gemony of a few languages, especially English, in our globalised world. This also 
implies that native speakers of English need to know very few foreign languages, if 
at all: 66 % of the British speak only English! 

For the purposes of this chapter, let us briefly review the concept of "language 
policies", which is clearly related to the status of multilingualism in any given so- 
ciety: Following Herbert Christ (1995: 75), we view language policy as: “every 
public influence on the communication radius of languages, the sum of those “top- 
down" and “bottom-up” political initiatives through which a particular language or 
languages is/are supported in their public validity, their functionality and their dis- 
semination. Like all policies it is subject to conflict and must regularly be re-or- 
dered through constant discussion and debate." 


2: European Institutions and Languages 


The language policies within EU institutions derive their foundations from Regu- 
lation No. 1 of the European Council (1958), which stipulates that each EC 
member state brings into the European community its own official language. The 
official languages of the EC/EU (cf. below) are also their working languages which 
means that — unlike for example in the UN (cf. Phillipson 2003; Nic Craith 2006) — 
the EU should not differentiate between official/working languages, and all official 
languages should function equally in the everyday practices of the EU-institutional 
bodies (cf. below). 

Currently, there are 23 official languages of the European Union (cf. Table 1), 
with practically each EU enlargement to date implying new ‘official and working’ 
languages (cf. Table 2) as well as debates about the growing linguistic repertoire of 
the EU institutions. Such debates appeared already in 1973 (with the British, Da- 
nish and Irish accession to the EC) when it was discussed — eventually to the det- 
riment of Irish — whether the latter should not become an official language because 
English is also an official language of Ireland (NB: Irish eventually did become 
an official EU-language in 2008). 1973 there was also a first (so-called ‘Danish’) 
proposal on limiting the number of official languages (cf. van Els 2005). Later 
enlargements, particularly the *grand' one by 10 Eastern- and Southern-European 
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Table 1: 


Ruth Wodak and Michat Krzyzanowski 


Official Languages of the EU (as of 2008) 


The Official Languages of the EU 


IN RD 


Bulgarian 13. Italian 
Czech 14. Latvian 
Danish 15. Lithuanian 
Dutch 16. Maltese 
English 17. Polish 
Estonian 18. Portuguese 
Finnish 19. Romanian 
French 20. Slovak 
German 21. Slovenian 
Greek 22. Spanish 
Hungarian 23. Swedish 
Irish 


countries (2004) triggered debates whether (1) each of the acceding states should 
bring their official languages into the EU - providing huge differences in the 
numbers of their speakers (e.g. Polish vs. Maltese) — and (ii) whether the *new' lan- 
guages should have the same rights as enjoyed by the ‘old’ EU member states. 


Table 2: EU Enlargements and Official Languages (1958-2007) 

Year Acceding States Incoming Official Languages 

1958 Belgium, France, Germany, Italy, Dutch, Italian, German, French 
Luxembourg, the Netherlands 

1973 Denmark, Ireland, UK Danish, English 

1981 Greece Greek 

1986 Portugal, Spain Portuguese, Spanish 

1995 Austria, Finland, Sweden Finnish, Swedish 

2004 Cyprus, Czech Republic, Estonia, Hun- Czech, Estonian, Hungarian, Latvian, 
gary, Latvia, Lithuania, Malta, Poland, Lithuanian, Maltese, Polish, Slovak, 
Slovakia, Slovenia Slovene 

2007 Bulgaria, Romania Bulgarian, Irish, Romanian 


Whereas the EC Regulation No. 1 mentioned above provides a general principle on 
official/working languages for the entire EU-institutional organism, it also allows 
for the institution-specific interpretation of that rule, as stipulated by the Regu- 
lation: 


Article 6: The institutions of the Community may stipulate in their rules of procedure 
which of the languages are to be used in specific cases. 

Article 7: The languages to be used in the proceedings of the Court of Justice shall be 
laid down in its rules of procedure. (European Council, 1958). 
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Thus, depending on their definition of 'specific cases' and their own needs (incl. 
political goals) EU institutions mould their internal rules of everyday functioning 
(‘rules of procedure’ or their implementation instruments such as ‘codes of con- 
duct’) and thus their own linguistic repertoires: 


e The Council of the European Union (EC) - the assembly of representatives of 
EU member states — operates the fullest possible language policy. Based di- 
rectly on EC Regulation No. 1 (above), it uses all official languages of the EU 
both in all official meetings and in all official documents prepared for those 
meetings. It must be taken into consideration that, allowing for its closed- 
door organisational culture, little is officially known about language regu- 
lations as well as de-facto language practices in the smaller-scale meetings of 
the EC — though it is traditional that French and English are used in those 
meetings. 


e The European Parliament (EP) — the only directly elected institutional body of 
the EU - stipulates in its rules of procedure (Rule 138: Languages) that “all 
documents of the Parliament shall be drawn up in the official languages" (Euro- 
pean Parliament 2004: 66) and that, in the plenary, all members of the EP have 
the right to speak and be addressed in all official languages. In smaller-scale 
meetings (committees, delegations) only "official languages used and re- 
quested by the members and substitutes of that committee or delegation" 
(European Parliament 2004: 66) are used. The 2004 Rules of Procedure of the 
EP (specifically Rule 139) also allowed for a transitional period — until the end 
of 2006 - for the enactment of full translation services for the official languages 
of the EU member states acceding in 2004. However, as shown by the Code of 
Conduct on Multilingualism (European Parliament 2006), the recent enlarge- 
ment added up to the complexity of language regime in the EP and forced the 
latter to introduce new measures and even further differentiation between dif- 
ferent types of meetings (according to their importance, their location in-house 
or externally, etc.) in order to cater for the growing number of official lan- 
guages. (Cf. also Gazzola 2006 for a closer look at the language policies in the 
EP). 


e The European Commission (CEC) - the executive branch of the Buropean 
Union - officially operates on a more or less similar basis to that of the EC. 
However, the rather ambiguous term ‘authentic languages’ (European Com- 
mission 2000) - specific for the CEC Rules of Procedure — opens up the way 
for a more or less unclear procedure of selection of different languages used 
in different types of communication (written/oral) addressed to different 
types of audiences. According to the Rules of Procedure “‘authentic language 
or languages' means the official languages of the Communities in the case 
of instruments of general application and the language or languages of those 
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to whom they are addressed in other cases" (European Commission 2000: 
Article 18). Hence, the CEC language regime is rather fuzzy and almost en- 
tirely dependent on the interpretation of what constitutes ‘instruments of gen- 
eral application’ and what does not. Whereas the former definitely applies 
to all official written documents, it is known that, in its oral communication 
(particularly at semi-official level) members and officials in the Commission 
traditionally use English, French and German. Among those languages, 
French used to be traditionally the leading one, though recently (and in par- 
ticular with the arrival of the 2004 Enlargement states), English has clearly 
been gaining in importance. The narrowing of the linguistic scope of the 
CEC could also be observed in other contexts — as was the case with the se- 
lection of English as the official vehicle of communication between the Com- 
mission and applicant states prior to the 2004 EU Enlargement (cf. Nic Craith 
2006). 


Whereas the EU-internal language policies and practices have been characterised 
by multiple tensions in recent years, such as those between ‘old’ and ‘new’ or be- 
tween ‘big’ and ‘small’ EU languages, the main tension to date remains that be- 
tween ‘official’ and ‘working’ languages (cf. Ammon 2006). While the former and 
the latter should not be differentiated at all (as stipulated in EC Regulation No. 1, 
cf. above), the reality illustrates, particularly in recent years, that EU institutions 
have progressively moved towards narrowing their linguistic repertoires and used 
a limited set of ‘traditional working languages’ (mainly English, French or Ger- 
man), in a process which we define as ‘hegemonic multilingualism’ (cf. Krzyza- 
nowski and Wodak 2007). The latter can be viewed as a process in which a set of 
selected working languages of the EU is given preference over other EU working 
languages, thus constituting a set of “core languages’, which are ideologically pre- 
ferred (or de facto more frequently used). This phenomenon implies salient conse- 
quences for democratic participation and representation: the hegemonic multilin- 
gualism prevents the development of a public-wide image of the EU institutions as 
truly multilingual, and hence diverse and inclusive. Thus, there is little identifica- 
tion with the Union’s institutions: they are not perceived as reflecting the growing 
linguistic and cultural diversity of a wider European space (cf. Krzyzanowski 2009 
and 2010; Wodak 2009). 
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3. Multilingualism in EU Documents 


Below, we present a brief analysis of a corpus of Key EU Documents on Multilin- 
gualism 1997—2007 (cf. Appendix L for details). The collected corpus of 22 docu- 
ments adds up to 366 pages in total, with the documents spanning between 2 and 73 
pages, and an average of ca. 8000 words in length. The corpus reflects the vibrant 
policy developments in that period (unlike before 1997). All documents were ob- 
tained in electronic form by means of extensive web-search, mainly in data bases 
of the European Union such as the the Official Journal of the European Commu- 
nities (http://eur-lex.europa.eu). 

Although the present set of documents also relates to the broader context of de- 
velopments in the EU (EU Constitutional and institutional reform, EU Enlarge- 
ment 2004 and 2007, etc.), our context analysis illustrates that the 2000 EU Lisbon 
Strategy constitutes the major factor which triggered the renewed interest of the 
EU in issues of multilingualism!. The 2000 Lisbon Strategy stipulated that the EU 
has “set itself a new strategic goal for the next decade: to become the most com- 
petitive and dynamic knowledge-based economy in the world capable of sustain- 
able economic growth with more and better jobs and greater social cohesion" (Lis- 
bon Presidency Conclusions, 2000, pt. 5, original emphasis). The 2000 Lisbon 
Strategy constitutes a clear follow-up to the economic concerns well known from 
the pre-EU traditions of the European Communities. As the Lisbon Strategy 
clearly listed (foreign) languages among the basic skills which, sustained through 
life-long learning, are to foster the EU's new goals, language learning, multilin- 
gualism and linguistic diversity of Europe/EU became central on the Union's 
agenda. In what followed, we have encountered manifold EU-initiated activities, 
action plans, declarations, resolutions, reports or surveys dealing with Europe's 
multilingualism: for example, the grand-scale set of actions of the ‘2001 European 
Year of Languages’, through the European Commission's Action Plan 2004—2006 
on ‘Promoting Language Learning and Linguistic Diversity’ (2003), or its ‘New 
Framework Strategy for Multilingualism' (2005). 

Summarised in Table 3, the structural analysis of our corpus supports the re- 
sults of our contextual analysis. Firstly, our analysis of different genres (cf. Table 3) 
points to the fact that the majority of the EU documents relating to multilingualism 
and language were programmatic in character and aimed at diverse forms of im- 
plementation of actions governed by the Lisbon strategy. Hence, the most popular 
genre was that of resolutions (4 in total) which, issued mainly by the EU Council, 
always aimed at (1) emphasising the vast number of actions undertaken by the EU 
towards promoting language learning and linguistic diversity, and (ii) stipulating 
further actions to be undertaken (by either the EU institutions — specifically the 
Commission as the executive branch — or by the EU member-states) when imple- 
menting Lisbon. 
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Table 3: Genres of the Key EU-Documents on Multilingualism (1997-2007) 


Genres Institutions2 Number 
Speeches/Statements CEC 3 
Codes of Conduct / Rules of Procedure EP 1 
Survey Results / Summaries Eurobarometer, CEC 2 
Resolutions EC, EC/EP 4 
(with/without Recommendations) 

Presidency Conclusions EC 2 
Decisions EP/EC 1 
Community Action Plans CEC 

(incl. Framework Strategies) 

Recommendations EP/EC 1 
Reports / Interim Reports CEC, EP 

(with/without Recommendations) 

Other? CEC 2 
TOTAL 22 


Other genres included (Community) Action Plans and Framework Strategies (3 in 
total), which not only called for certain actions (like the resolutions) but also stipu- 
lated concrete areas and types of activities. Those practice-related genres were 
supported by diverse (interim) reports (with/without recommendations, 3 in total), 
which were specific analyses either issued in the course of preparing diverse policy 
documents (such as Action Plans) or helped legitimise their implementation and 
prove their effectiveness. Similarly, the genres of speeches/statements (also 3 in 
total) were used to promote EU's language- and multilingualism-related actions 
(although it must be emphasised that speeches/statements become particularly sa- 
lient since 2007, i.e. with the arrival of the Commission's portfolio on Multilin- 
gualism - the actions and results of which are still being developed). 

Like the analysis of genres, our analysis of documents according to the institu- 
tions in which the former originated illustrates their highly action-oriented (or Lis- 
bon-implementation motivated) character: Half of the analysed documents (11 in 
total) were issued by the European Commission as the executive branch of the EU- 
institutions, which is able to directly implement its regulations. This finding proves 
that the political relevance of language-related issues has been growing. Moreover, 
the other two major EU-institutions (i.e. the Council and the European Parliament) 
were also very active in issuing multilingualism-related documents (4 documents 
each, with 2 further documents issued jointly by the EC and EP). 

Finally, our analysis of the scope of the collected EU documents illustrates that 
the documents concern diverse conceptions of, and actions oriented towards, intro- 
ducing, promoting, and fostering multilingualism in the EU-ropean space rather 
than in the EU-institutional realm. Hence, the majority of the analysed documents 
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Table 4: Recurrent Topics in the Key EU Documents on Multilingualism (1997-2007) 


Topic Contents No. Docs 

Topic 1 Teaching and Learning of EU languages 7 
(or of foreign languages in the EU, incl. EU-funded actions) 

Topic2 Lisbon strategy and EU as a a Knowledge Based Economy 2 
(incl. language-related actions) 

Topic 3 2001 European year of languages 1 
(incl. planning / results / implementation) 

Topic 4 ` Europeans (EU) and languages 2 

Topic 5 Minority, regional and lesser-used languages in the EU 3 

Topic 6 Language regulations in EU institutions 1 

Topic 7 Multilingualism in the EU (incl. learning/teaching, multilingual 6 
society and economy) and pro-multilingualism EU-actions 
(incl. life-long learning) 

TOTAL 22 


(9 in total) were ‘External’ in their scope, i.e. they concerned actions to be imple- 
mented by the EU member-states. Equally popular (8 in total) were the “External/ 
Internal’ documents which described actions to be undertaken by the member 
states (in the ‘External’ part of the remit) which were supposed to be monitored)by 
the EU-institutions, i.e. mainly by the Commission (in the ‘Internal’ strand of the 
remit). Those findings prove that the interest in language issues by the EU-institu- 
tions is strongly motivated by the Lisbon agenda which are directed almost solely 
on the member states rather than on the EU-internal institutional areas. 

In Table 4, we summarise the results of our analysis of key language- and 
multilingualism-related macro-topics elaborated in the EU documents in the last 
decade (1997-2007). 

The results of this analysis support our other findings on the predominance of 
the EU Lisbon Strategy over the Union's interest in issues of languages, multilin- 
gualism and linguistic diversity. The four most frequently represented macro- 
topics (Topics 1, 2, 3 and 7 — in 16 of 22 documents) directly or indirectly refer to 
Lisbon. Interestingly, while the latter is explicitly addressed in only one document 
(i.e. the 2001 European Year of Languages), the Lisbon Strategy related to issues 
of language and multilingualism (Topics 1, 7) is discussed in 13 documents. 
Hence, Lisbon seems omnipresent in the analysed texts, while it is rarely inscribed 
explicitly and tends to be operationalised via its instrumental policies and policy- 
areas, such as language learning/teaching, life-long learning, etc. 

Moreover, as the use of Topic 5 suggests, the EU has recently revived its inter- 
est in minority and lesser-used languages, which were rarely in the Union's scope 
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of activity before the early 2000s. This move to 'colonise' the policy areas specific 
for CoE stems from the political motivation of the EU: to collect all language-re- 
lated issues within its areas of interest and activity. The latter may, however, prove 
detrimental to the CoE, which may lose the last key policy areas so far unique for 
that organisation. 

Finally, it must be stressed that, mainly in the documents from 2006-2007, 
Topic 7 points to the activity of the newly-established EU commissioner for multi- 
lingualism. However, so far, the new Commissioner (in the documents originating 
within his portfolio) has done little more than discuss multilingualism in general. 


4. Members of European Parliament (MEPs) 
and Their Perspectives on Multilingualism 


The European Parliament (EP) is the only directly elected parliamentary institu- 
tion of the EU. Together with the Council of the EU, it forms the bicameral legis- 
lative branch of the Union's institutions and has been described as one of the most 
powerful legislatures in the world. The Parliament and Council form the highest 
legislative body within the Union. However their powers as such are limited to the 
competences conferred upon the EC by member states. Hence the institution has 
little control over policy areas held by the states and within the other two of the 
three pillars of the EU. The Parliament is composed of 785 MEPs, who serve the 
second largest democratic electorate in the world (after India) and the largest trans- 
national democratic electorate in the world (342 million eligible voters in 2004). 
The MEPs are elected every 5 years since 1979 by universal adult suffrage and sit 
according to political allegiance; about a third are women. MEPs in Parliament be- 
long to seven different parliamentary groups, including over thirty non-attached 
members known as non-inscrits. The two largest groups are the European People's 
Party — European Democrats (EPP-ED) and the Party of European Socialists 
(PES). 

The method which has slowly become the dominant procedure (about three- 
quarters of policy areas) is the Co-Decision Procedure, where powers are essen- 
tially equal between Parliament and Council. Co-decision provides an equal foot- 
ing between the two bodies. Under the procedure, the Commission presents a pro- 
posal to Parliament and the Council. They then send amendments to the Council, 
which can either adopt the text with those amendments or send back a ‘common 
position’. That proposal may either be approved or further amendments may be 
tabled by the Parliament. If the Council does not approve these, then a ‘Concili- 
ation Committee’ is formed. The Committee is composed of the Council members 
plus an equal number of MEPs who seek to agree a common position. Once a posi- 
tion is agreed, it has to be approved by Parliament, again by an absolute majority 
(see Wodak 2009, for more details). 
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All 14 MEPs which we interviewed in 1997 emphasize that they enjoy their job 
even though it is stressful and that multilingualism does not cause them any diffi- 
culties; quite to the contrary, most MEPs welcome the diversity associated with 
multilingualism (Wodak 2009: 90). This was surprising for us as interviewers be- 
cause we often had the impression that misunderstandings of many kinds (arising 
from linguistic diversity) occurred on a regular basis. One MEP summarizes the 
commonly held view that multilingualism is not an obstacle. Of course when a 
speaker uses many mitigation devices, disclaimers, and explicit assertions that 
there is ‘no real problem’, as in the extract below, this might indicate the opposite 
as subtext; however, this interpretation necessarily remains speculative. 

Text 1 

There is no real problem with linguistics which/of course/I mean there is no/is/is a won- 

derful language of course but — terms of its appreciation across the European Union I 


haven't found a single if you like person who hasn't been able to express themselves 
part in and interrupts me when I am trying to speak French or whatever (MEP 11) 


A second MEP describes some of the most acute issues of MEPs' everyday life: 
problems of clarity due to translation mistakes leading to conflicts over the content 
which are actually caused by mistranslations; this extract thus points to major — 
also negative — implications of multilingualism and related translation services 
(Wodak 2009: 93); in this way, the enacted diversity remains problematic in some 
ways, and positive in others: 


Text 2 

Frequently there are difficulties in the parliament itself, because the foreign language 
versions of the text arrive too late, and then you just stand there and only have English 
and French, and generally the Finns and the Swedes suffer the most. They receive/have 
constant problems with their translations, and others likewise, and when/when I am sup- 
posed to work directly with a text as the correspondent or (xxxx), then I want to hold a 
text that is in my mother tongue. And the others feel the same way, because technical- 
ities are nonetheless frequently vital, not, (xxxx) also in these translations every now 
and then severe mistakes pop up, which in turn have a totally different meaning in a 
foreign language, which is why it's always necessary to work in one's mother tongue 
and then check the other version, (xxxx) and this always results in some mistakes. Fre- 
quently the mistakes are just small, but sometimes they're quite significant. [We] argue 
for an hour in the committee, an hour until a colleague comes along and says: man, 
there's a translation error, look here, and then we will realize: my god, a translation error 
(xxxx), well at that point everything is over. It's been this complicated since [...] 
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5. Citizenship and Multilingualism 


Recent research into European citizenship policies (but not specifically the lan- 
guage aspect of them) argues that governments tend to approach immigration as 
either a threat or an opportunity (Baubóck et al. 2006). With regard to language, a 
general trend identified in a first pilot survey conducted by Dray (2006) implies 
that states which encourage immigration (e.g. Romania, Poland, Hungary) will 
place less emphasis on language and assessment than states that perceive immi- 
gration as a problem (e.g. Austria, Finland, France, Germany). Language (and as- 
sessment procedures) may be employed as an instrument of control, with language 
tests being legally required of some but not others. 

At present there is no consensus in Europe on the minimum level of proficiency 
in an acquired language. States are working independently to establish their own 
thresholds, drawing on a range of alternative solutions. For example, in Austria 
and Finland, where language tests are enforced by law, language assessment crite- 
ria correspond to the Common European Framework of Reference for Languages 
(CEFR) (Council of Europe 2001). However, the baseline standards in the two 
countries are different. Furthermore, in some countries there is no formal language 
test. In Greece, for example, the command of Greek is assessed during the “citi- 
zenship" interview. Some examples illustrating the variety of approaches and so- 
cial contexts involved are summarised in Table 5. 

The current situation in Europe is a patchwork of solutions to an array of per- 
ceived "citizenship" problems which show varying degrees of familiarity and 
awareness of language assessment practices. Dray's pilot survey (2006) found no 
information about how language assessment was developed in any of the countries 
where language tests are implemented. 

Academic awareness and activity surrounding the issue of developing lan- 
guage tests for citizenship is continuing to grow. Amongst this work is research on 
the ideologies of language that underpin citizenship legislation (e.g. Milani 2006), 
and the role of language rights in democratic citizenship (Starkey 2002). Whilst 
there has been some policy response to academic work in the area of language tests 
for identity (i.e. the establishment of citizenship by descent) in the form of a set of 
guidelines (proposed by linguists and adopted by a number of international profes- 
sional organisations; McNamara 2005), this test addresses only those claiming 
rights on the basis of their ethnic heritage. Thus, this is only one dimension of the 
way in which language is used to determine citizenship. In addition, the Council of 
Europe has identified, and is currently addressing, the issue of language regarding 
"adult migrants". 
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Table 5: 


Summary of Six Selected Approaches to Language Assessment for Citizenship 
in Europe 


Austria 


Czech 
Republic 


France 


Germany 


The "integration agreement" (2003) obliges applicants for permanent resi- 
dence (except asylum seekers and refugees whose status is regulated under 
different laws) to complete a “Deutsch-Integrations-Kurs” (German Inte- 
gration Course). Those who can prove their knowledge of German are exempt 
from taking the course, but there are no specifications for proving or evaluat- 
ing linguistic ability (De Cillia and Wodak 2006). Applicants are required to 
reach Level A1 of the 6 levels identified in the CEFR within 100 sessions of 45 
minutes. Applicants must pass the test within 18 months of arriving in order to 
have 50 % of their costs paid for by the Austrian government. If the applicant 
continues to fail, does not take the exam or does not attend the course, they can 
be subject to fines and eventually lose their residence permit. This is the first 
time that the Austrian government has given financial support to immigrants 
for learning German. 


A stipulation of the Czech Citizenship Act is that an applicant can "prove 
his or her Czech language proficiency". A pilot project entitled "Selection of 
Skilled Foreign Workers" was launched in 2001 to test the future of a larger 
scale immigration policy aimed at facilitating immigration. Although, in- 
itially, desirable immigrants were considered to be “linguistically and cul- 
turally akin ethnics”, since 2004 the opportunity for “experts” to immigrate to 
Czech Republic has been increasingly extended to nationals of more coun- 
tries. 


Currently there appears to be no established test despite an intention to imple- 
ment a "new certification for migrants" by the end of 2005, which would test 
competence in French after a French course of 200—500 hours. The test was 
to be obligatory for citizenship and for a "carte de residence". In 2005, how- 
ever, further developments heading towards a US-style points-based system of 
immigration was being investigated, in which knowledge of languages would 
be an element. 


Germany's new immigration law (2005) requires a "satisfactory knowledge" 
of German in order to become a German citizen by naturalisation. In the 
absence of language proficiency certificates, authorities can request that the 
applicant take a language test. The exact test procedure is not clear. “Citizen- 
ship" courses, however, are currently under discussion. These are based on the 
US system, and aim to teach and test applicants on the German constitution 
and culture. They implicitly require knowledge of German. Germany also has 
a “law of return", which grants people of German descent (“Aussiedler”) the 
right to settle in Germany, subject to a language test. The purpose of this test is 
to identify applicants as being legitimately German and knowledge of dialect 
is preferred over "correct" or standard German. This test consists of a brief 
series of oral questions to test understanding, vocabulary and “fluency” 
(McNamara 2005). 
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Netherlands Applicants for Dutch citizenship who are resident foreigners in the Nether- 
lands must have lived there for 5 years and must demonstrate a "sufficient 
command" of Dutch. For those outside of the Netherlands, the “civic 
integration examination" requires a "fundamental knowledge of the Dutch 
language and Dutch society". It is an oral test held in the applicant's home 
country. There are no courses available but there is an “education pack” 
costing 65 Euros. The test costs 350 Euros. 


Poland Resident foreigners may apply for naturalisation after 5 years in Poland. 
A new Repatriation Act became effective in 2001 restoring citizenship for 
those who had been displaced during communist rule and the Second World 
War. To qualify under the repatriation law a person must demonstrate that 
a parent, grandparent or two great-grandparents had Polish citizenship. There 
is no explicit mention of a language test. 


Concluding Remarks 


Recent global, regional and local developments imply huge challenges for the 
European Union and thus also for multilingual communication, both inside the 
institutions as well as among and with the EU member states. On the one hand, 
hegemonic multilingualism ensures that English (and French and German) remain 
the salient modes of communication. On the other, migrant populations and new 
migrant languages have to be integrated into the everyday lives of member states. 
The focus on the Lisbon Strategy and thus, on economic issues, supports multi- 
lingualism in some ways; in other ways, lesser-used languages might be neglected. 
Thus, in sum, we are confronted with paradox and contradictory tendencies whose 
effects on multilingualism are difficult to predict. 


See also the following chapters in this volume: 29 by Darquennes, 32 by House and 
33 by Pahta and Taavitsainen. 
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Notes 


1 In this context, we also view the Bologna Strategy (cf. Bologna Declaration 1999), aim- 
ing to create a European Higher Education Area by 2010, as one of the key elements 
of operationalisation of the Lisbon provisions. Bologna and Lisbon are overtly linked on, 
e.g. web-page of the European Commission's Education and Culture portfolio, where it is 
emphasised that, within implementation of Bologna, “all across Europe, countries and 
universities are engaged in a process of modernisation. From an EU perspective, these re- 
forms are part of the Lisbon Strategy for Growth and Jobs" (Source: http://ec.europa.eu/ 
education/policies/educ/bologna/bologna en.html, last accessed on May 1%, 2008). For 
the extensive analysis of Bologna strategy, cf. Jessop, Fairclough and Wodak (2008). 

2 Abbreviations used in the tables: EC = Council of the EU, EP = European Parliament, 
CEC = the European Commission, ESC = Economic and Social Committee, CoR = Com- 
mittee of the Regions. 

3 This category includes: European Indicator of Language Competence (2005, Doc. No. 13) 
and Framework for the European Survey of Language Competences (2007, Doc. No. 21). 
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35 Language and the church: 
European perspectives 


John Myhill 

1. Introduction 

2: The Armenian church 

3. The Catholic and Orthodox churches 

4. The Reformation, the decline of Latin, and the development of national languages 
in Western Europe 

5. The decline of Church Slavonic and the development of national languages in 
Eastern Europe 

6. The influence of sacred languages on the development of national languages 

7. Language, religion, and national identity 

1. Introduction 


From the perspective of the linguistic influence of the Church, it is convenient to 
divide Europe into four areas, (1) the Catholic southwest, (2) the Protestant north- 
west, particularly from the 16th century, (3) the Orthodox east, particularly from 
the 9th century, and (4) Armenia, that is, Eastern Anatolia and neighboring areas of 
the Caucasus.* There has been a radical ideological difference between the Cath- 
olic areas, where Latin was understood to be the only ‘universal’ language which 
was sanctioned for religious usage until the 1960s, and the other three areas, where 
national languages could be used in religious functions from much earlier times. 

Prior to the advent of the idea of mass literacy in modern times, sacred writings 
were read by an incomparably greater number of people than was secular literature, 
and thus the influence of the Church on the development of written language was 
far greater than it is today. Secular writings such as The Inferno, The Canterbury 
Tales, and El Cid were only read by a tiny elite, and there was no ideological moti- 
vation supporting the usage of written language outside of this elite. Sacred writ- 
ings, on the other hand, were in principle associated with all Christians, so that the 
usage of written languages in sacred contexts such as Bible translations and lit- 
urgies was in principle to be extended to everyone (although Catholics restricted 
the reading of the Bible in particular). In fact, among Protestants it was naturally 
assumed that people would learn to read by reading the Bible and above all in order 
to read the Bible. This was less true among the Orthodox and the Armenians, be- 
cause although vernacular Bible translations existed, they were not read by a large 
proportion of the people, but even for most of these groups there was a very strong 
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emphasis on the masses participating several times a week in reciting and singing 
vernacular liturgies together in church. Sacred writings thus necessarily played an 
enormous role in the development of European languages. 


2. The Armenian church 


The first nationality in Europe to adopt Christianity — and indeed the first Euro- 
peans who could be called a nation at all — was the Armenians, and the written Ar- 
menian language developed together with the establishment of Christianity as the 
state religion of Armenia at the beginning of the 5th century (see Garsoian 1997). 
This happened particularly in the context of a struggle against the Persian Empire, 
which controlled about 80 % of Armenia at the time. The Persians had traditionally 
given the Armenians a considerable degree of autonomy, but this changed in the 
early part of the 3rd century with the advent of the Sasanian Dynasty, fanatical pro- 
ponents of Zoroastrianism who attempted to force this religion on the Armenians, 
who responded by enthusiastically turning to the new religion Christianity. In this 
particular context, St. Mesrop devised the Armenian alphabet and large numbers of 
religious works — including the Bible and a vernacular liturgy — were translated 
into Armenian, and this written language soon came to be put to all religious 
usages, and indeed all usages in general, in Armenia, replacing Syriac and Greek 
as the main local written language. It is significant to note in this context that the 
Armenian Church follows a religious doctrine, known as monophysitism (associ- 
ated with the idea that Jesus had only one nature), which is completely different 
from that of all other European Christians, and it has therefore been entirely inde- 
pendent of any European Church throughout its history; this meant that the adop- 
tion of a version of Christianity associated with vernacular language usage by the 
Armenians did not really have any institutional implications for other European 
churches. 


3. The Catholic and Orthodox churches 


Aside from the Armenians, Christianity in Europe was at the time associated ex- 
clusively with the church of the Roman/Byzantine Empire (that is, the ancestor of 
the Catholic and Orthodox churches, which split in the 11th century), and prior to 
the 9th century Latin and Greek were the only sanctioned languages of this church, 
notwithstanding sporadic religious usages of other languages (e.g. Ulfila's Gothic 
translation of the Bible in the 4th century). Over the course of time, a number of 
differences in doctrine and policy developed between the Patriarchate of Rome on 
the one hand and the Patriarchates of Constantinople, Antioch, Jerusalem, and 
Alexandria on the other hand, in terms of issues such as clerical marriage, the use 
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of leavened or unleavened bread in the Eucharist, the wording of the liturgy, etc., 
and one of these differences involved the language of religion: Those areas within 
the purview of the see of Rome used Latin, while all other areas used Greek. Ulti- 
mately, because of these differences, these two branches would divide into the 
Catholic and Orthodox Churches, and disputes over language would play a role in 
this; in fact, in the run-up to the final schism of 1054, one of the central points of 
contention was attempts by Rome to enforce the use of a Latin liturgy on the Greek 
community of what is today southern Italy and corresponding attempts by Con- 
stantinople to require the Roman Christians in Constantinople to use Greek in their 
religious observance (Congar 1959). 

Greek and Latin had of course been written, developed, and propagated long 
before they came to be used in specifically Christian contexts, and their recognized 
status as ‘real languages’ was therefore independent of the role which they came to 
have in Christian institutions. But the development of Armenian began a new phase 
of European linguistic history. In the following years, particularly between the 
9th century and the 17th century, the development, acceptance, and propagation of 
additional European languages would to a large extent be tied to the Church. The 
connection between Christianity and the development of written languages, 
through Bible translations and vernacular liturgies, has been most clear for the 
written languages developed during this time whose speakers have remained Prot- 
estant or Orthodox since then, such as English and Danish. But this was also true to 
a lesser extent for some languages whose speakers are mostly or entirely Catholic 
today (e.g. Czech, Slovenian, and Lithuanian), because the Reformation resulted in 
an influential Bible translation, done before the Counterreformation recatholicized 
the population, which was ‘dug up’ in the 19th century and helped to provide a 
basis for legitimizing and constructing the modern language. Even for languages 
whose speakers have been entirely or almost entirely Catholic throughout history 
(e.g. Irish, Spanish, etc.), the Church played an important role in their development, 
in that many of the earlier written usages were devotional in character, even if the 
language was clearly understood to be theologically subservient to Latin. 

The eastern branch of the Roman/Byzantine Church became multilingual even 
before the schism of 1054. This first took the form of the de facto acceptance of 
usage of Georgian alongside Greek for sacred purposes in Georgia dating from the 
5th century, although the Georgian Church remained under the formal authority of 
the Greek Church until the early 11th century; like Armenian, Georgian was devel- 
oped as a written language, with its own alphabet, specifically in the context of the 
process of Christianization. A more far-reaching step, however, was the develop- 
ment and propagation of Church Slavonic in the 9th century. The materials for 
writing Church Slavonic began to be developed in the 860s by a mission to Mora- 
via, led by the monks Cyril and Methodius, which was intended to convince the 
Slavic speakers there to accept the religious authority of Constantinople rather than 
Rome; it is the members of this mission who are popularly credited with inventing 
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the Cyrillic script currently in use in writing the Slavic Orthodox languages and 
some other ex-Soviet languages, although contemporary scholars have concluded 
that what they invented was actually a precursor to Cyrillic script, known today as 
Glagolitic script (see Cubberley 1996). The mission failed to prevent the Mora- 
vians from coming under Roman religious authority, but beginning in the 880s, 
King Boris of Bulgaria, who had recently converted himself and his kingdom to 
Christianity and provisionally accepted Byzantine as opposed to Roman tutelage in 
the religious sphere, instituted a program to use the materials developed by the 
mission to Moravia to propagate a written Slavonic language, in particular a Bible 
translation and a liturgy. Boris' purpose was to establish an autocephalous Bulgar- 
ian National Church which would be Orthodox in terms of general practice and 
ideology but independent of the Byzantine Church, and which would use Slavonic 
rather than Greek as its liturgical language. In the 920s, after repeated military 
clashes between the Bulgarians and the Byzantines, the Byzantines yielded and ac- 
cepted the Bulgarian Church as independent and its patriarch as equal in status to 
its own. Slavonic, like Armenian and Georgian, was thus initially developed as a 
written language in the context of Christianization, although it immediately began 
to be used in non-sacred contexts as well as serving as the official language of the 
Bulgarian state (see Browning 1975). It is important to note that no development 
parallel to the linguistic/political/religious division between the Bulgarians and the 
Byzantines took place in the western branch of the Church — the Holy Roman Em- 
pire, established in 936, gave political preeminence to Germanic speakers in the 
north but religious authority to Romance speakers in the south until its dissolution 
at the beginning of the 19th century. 

The fact that the eastern branch of the Great Church developed a multilingual 
policy so early while the western branch remained monolingual much longer re- 
flected contrasting conceptions of religious authority among eastern and western 
Christians. In the east, it was understood that the five existing patriarchates — of 
Alexandria, Jerusalem, Antioch, Constantinople, and Rome — were equal in stature, 
while in the west, that is, in Rome, it was understood that the Roman Patriarch (who 
came to be known as the Pope) had supreme and undivided authority. Thus for east- 
erners, granting the Georgians and the Bulgarians independent patriarchates with a 
status equal to that of the others could be interpreted as simply applying the tradi- 
tional organizational principle to a new case, and accepting the religious legitimacy 
of Georgian and Slavonic Bibles and liturgies merely added two more officially- 
sanctioned languages to Latin and Greek. This is not to say that the Byzantines did 
not resist these developments for a while, but conceding religious and linguistic in- 
dependence to the Georgians and the Bulgarians did not really represent a radical 
break with the past for them. For the Patriarchs of Rome, however, such a step was 
totally unacceptable — the very idea of a legitimate Christian leader who was com- 
pletely independent of their religious authority was rejected as a matter of prin- 
ciple, and they would only tolerate the historically-motivated union with the East- 
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ern Patriarchates and the associated use of Greek until the mid-11th century, after 
which they accepted only one Patriarch and one language. Thus the doctrinal dis- 
tinction between Rome and the other Patriarchates regarding the relative authority 
of the different sees came to motivate a difference in linguistic ideology — for more 
than 900 years, the Western Church adopted the position that only one language, 
Latin, was appropriate for religious matters, while the Eastern Church accepted the 
idea that different languages could legitimately be used for this purpose, although 
in practice only Greek, Georgian, and Slavonic were used until the 18th century. 

Ultimately four Orthodox national churches formally separated from the By- 
zantine Church before modern times: The Bulgarians in the 10th century, the Geor- 
gians in the 11th century, the Serbs in the 13th century, and the Russians in the 16th 
century (see Myhill 2006: 37-49). It is important to note, however, that although 
differences in the written forms of Church Slavonic used in different places devel- 
oped before long, there was no idea that there were separate written languages as- 
sociated with the Bulgarians, the Serbs, and the Russians — they were understood to 
share the written Slavonic language. A written specifically Russian language only 
developed in the 18th century, and for written Bulgarian and Serbian this did not 
happen until the 19th century. The idea of a distinction between the Orthodox 
Slavic languages therefore did not develop until several hundred years after this 
process had taken place in the Romance languages (turning Latin into Spanish, 
Portuguese, French, Italian, Catalan, etc.). Before modern times, then, Bulgarian, 
Serbian, and Russian nationalities were based not on the idea that these groups 
spoke or wrote different languages but rather upon each group's national churches 
and their political institutions, when they were independent. This does not mean, 
however, that language was only a minor component of the national identities of 
these groups before modern times, because for all three the national struggle for 
institutional religious independence in practice involved overthrowing Greek and 
replacing it with Slavonic rather than competition between the different Slavic- 
speaking groups. This was a particularly important issue for the Bulgarians and 
Serbs, because for a long time each of them were under the control of the Ottoman 
Empire, which placed them under the religious and administrative control of the 
Greeks, and consequently the conflict between Slavonic and Greek as religious 
languages went on until the 19th century. 

In Slavic Orthodox societies, religious texts, including the Bible, were essen- 
tially understood to be the property of the whole society rather than the Church 
alone, and so — unlike in the Catholic West — the masses were allowed and even in 
principle encouraged to read the Bible. This was easier for them because, while 
Church Slavonic did not change as much as the spoken language did and thus 
eventually became a 'frozen' language like Latin, it was nevertheless *frozen' per- 
haps 700—800 years later than Latin was, so that the difference between the spoken 
and the written language was not as great as in the case of Latin and the emerging 
Romance languages. The connection between the religious texts and the people in 
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general in the Orthodox Slavic-speaking areas was further strengthened by the fact 
that almost everyone who used a Slavonic liturgy and Bible spoke Slavic lan- 
guages (the only exception being that Romanians sometimes used it for religious 
purposes as well, although this was much less common than Greek), while in the 
west many Catholics spoke Germanic, Finno-Ugric, or Celtic languages, or 
Basque, which were completely different from Latin. For these reasons, the use of 
Slavonic for religious purposes in the East served to unite the societies in which 
it was used, while the reverse was true of Latin in the West, which served rather 
to unite the institutions of the Church and to separate it from the secular world. It 
should be stressed, though, that there was no particular emphasis upon widespread 
literacy in Slavic Orthodox society comparable to what would appear during the 
Protestant Reformation, and it is therefore not necessarily clear that the literacy 
rate was much higher there than it was in the West at the time. What was deemed to 
be important for the Slavic-speaking masses was not necessarily reading the Bible 
but rather participating in reciting and singing the liturgy in something resembling 
their native language which they could readily understand. 


4. The Reformation, the decline of Latin, 
and the development of national languages in Western Europe 


The Reformation brought a much more radical change in linguistic ideology to 
those areas which it affected. Whereas the ideology of Orthodoxy was inherently 
‘top-down’ and oriented towards the state, Protestantism was in theory ‘bot- 
tom-up' and aimed at the individual, and in terms of linguistic ideology this par- 
ticularly meant that individuals were supposed to read the Bible. The consequence 
of this was that Protestantism supported the development of more distinctive lan- 
guages than did Orthodoxy, particularly those languages whose speakers were 
politically dominated - that is, they did not have their own states — but which were 
clearly distinct from the language of those who dominated them (the practical dif- 
ferences between Orthodoxy and Protestantism in terms of their different effects 
upon language development will be elaborated upon in the course of the following 
discussion). On the other hand, this difference should not be overestimated: while 
it was understood by Protestants that individuals should read the Bible in ‘their 
own language,’ in practice the written language which was used in an area was in 
some cases quite different from the spoken language of the area, particularly in 
what is today northern Germany, Switzerland, Norway, and Scotland — but at least 
it was not Latin and it was in the same general branch of the same family as the 
local spoken language. 

The most obvious short-term linguistic effect of the Reformation was to 
decrease the role of Latin in the states which officially adopted Protestantism — the 
United Provinces of the Netherlands, England, Scotland, Denmark, Sweden, and 
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the Protestant states of the Holy Roman Empire and Switzerland — and to corre- 
spondingly increase the roles of the dominant written languages of these states so 
that they came to be able to function in essentially all functions. Furthermore, *bot- 
tom-up' Protestant ideology meant that this trend extended even to some minority 
languages which were not normally used for administrative functions until modern 
times, provided that (1) they were not directly related to the local politically domi- 
nant language and (2) their speakers had some interest in Protestantism. So for 
example within the territory which Sweden controlled in the 17th century, it was 
legitimate to have Bible translations, liturgy, and sometimes sermons in Finnish, 
Estonian, and Latvian, because these are not Scandinavian languages and thus not 
related to Swedish (although vernacularization took much longer for Estonian and 
Latvian than for Swedish) (see e.g. Bilmanis 1951). The same status was similarly 
accorded to Welsh in Great Britain, and presumably would have been to Irish and 
Scottish Gaelic as well, if their speakers had displayed a more positive attitude to 
Protestantism. The situation of non-Germanic languages associated with Protes- 
tant groups, such as Finnish and Welsh, was thus at the time in principle exactly the 
opposite of that of politically dominant languages of Catholic countries such as 
France and Spain: Finnish and Welsh were understood to be appropriate for relig- 
ious worship but not for administration, while the reverse was true of French and 
Spanish. This is summarized in Table 1: 


Table 1. | Religious and administrative usages of some European languages ca. 16th—18th 
centuries 


Religious usages Administrative usages 


English 
Dutch 
Danish 
Swedish 
Finnish 
Welsh 
Estonian 
Latvian 
French 
Spanish 
Portuguese 
Italian = 


++ ++++++ 


| 
+++ + 


Here it is interesting to contrast Protestant and Orthodox thinking and policy. To 
the Orthodox mindset, languages such as Finnish, Welsh, Estonian, and Latvian 
would have been accorded no status at all in the 16th-18th centuries, because the 
groups speaking these languages did not have their own states at the time. Con- 
versely, Protestant ideology would have called for Bible translations in Romanian 
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and Albanian, two languages whose speakers have been largely or partially Ortho- 
dox but which were basically ignored as religious languages until relatively recently 
because their speakers were always politically dominated by others. This difference 
in ideology is strikingly demonstrated by the fact that the first Romanian Bible 
translation and liturgy were developed by Transylvanian Protestants at the end of 
the 17th century, even though Protestants have never constituted more than a de- 
mographically insignificant minority of Romanians - it was only later that the Ro- 
manian Orthodox tried to put these materials to use in religious contexts, because 
after all the Orthodox Church in principle allowed multilingualism (Mazilu 1998). 

Even in areas in which Protestantism was ultimately limited to a small section 
of the population or eliminated entirely by the Counterreformation, its brief and/or 
limited flowering nevertheless sometimes ultimately had a significant effect. This 
was because Protestantism supported the development of written materials — par- 
ticularly but by no means exclusively Bible translations — which gave the language 
prestige and a historical basis, and these would prove important when language- 
based nationalism spread around Europe in the 19th century, as for example with 
Czech, Slovene, Croatian, and Lithuanian. 

It is important to emphasize, however, that Protestantism did not encourage 
the development of written languages which were in the same branch of the same 
family as the local politically dominant language, and in some cases Protestants 
even borrowed standard languages based upon related dialects spoken in neighbor- 
ing states from which they were politically independent. We can consider, for 
example, the case of written versions of Low German dialects, which had been 
commonly used in the territory which is today northern Germany before the Ref- 
ormation. In the first generation or two of the Reformation these written languages 
were used for religious writings in these areas — but beginning in the mid-16th cen- 
tury, they were replaced by the written version of the radically different Upper 
Saxon dialect of Luther's Bible, and this written form was even adopted in the 17th 
century in Switzerland, which was politically independent (van der Plank 1988). 
In the same way, the written languages corresponding to Scots and Norwegian 
declined precipitously during the Reformation in the face of English and Danish 
respectively — the former in spite of the fact that the crowns of Scotland and Eng- 
land were not united until 1603 and the countries were not politically joined until 
1707. In fact, among the Germanic languages, whose speakers were politically 
dominant in northern Europe, and setting aside the increase in functional domains 
associated with the replacement of Latin in religious functions, it can be said that 
the Reformation was actually of decisive help only for Swedish and Dutch, specifi- 
cally because these two groups established independent states during this time 
period by virtue of successfully rebelling against foreign rule (against the Danes 
and the Spanish respectively). On the other hand, the effect of the Reformation 
upon the development of non-Germanic languages — Celtic, Finno-Ugric, Baltic, 
Slavic, and Romanian — was enormous. 
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While there is no question of the importance of the Reformation to the devel- 
opment of the linguistic landscape of Europe, it is important not to exaggerate this. 
In some respects, Protestant states and Catholic states went through similar devel- 
opments around the same time. In the time around the Reformation, written lan- 
guages associated with Catholic peoples, such as Spanish, French, and Italian, were 
legitimized by the establishment of officially-sanctioned language academies, 
whereas languages associated primarily with Protestants — English, German, 
Dutch, Danish, and Swedish — were similarly legitimized by Bible translations 
which served as the models for linguistic correctness (see Joseph 2004). Fur- 
thermore, during this time, usage of Latin decreased in administrative functions in 
Catholic areas even though the language retained its dominance in the religious do- 
main. And while it is true that Dutch and Swedish survived among Protestant lan- 
guages by virtue of successful revolts and the establishment of political indepen- 
dence, the same was true of Portuguese and to some extent Catalan among Catholic 
languages. The difference between Protestantism and Catholicism was most 
marked in the cases of the languages of out-of-power groups who spoke neither 
Romance nor Germanic languages, as may be seen by comparing the fates of e.g. 
Finnish, Estonian, Latvian, and Welsh with those of e.g. Irish, Breton, and Basque. 

Itis also worth noting in this context that it is certainly not a coincidence from a 
linguistic point of view that the Reformation was incomparably more successful in 
Germanic-speaking territories than in Romance-speaking territories. The same 
proto-nationalist impulses which led the states of Western Europe to begin to de- 
velop and legitimize national languages naturally encouraged Germanic speakers 
to view Latin as a foreign language, while on the other hand Romance speakers 
could understand it to be ancient/sacred/outdated but not foreign. This was doubt- 
less a factor in the radical difference in the attitudes of the Austrian and Spanish 
Habsburgs towards the Reformation; even to Austrian Catholics, Luther's ideas 
did serve the useful purpose of emphasizing their independence from Rome, 
whereas the Spanish Habsburgs preferred to focus their energies on literally trying 
to be more Catholic than the Pope (see Winter 1989). 


5. The decline of Church Slavonic 
and the development of national languages in Eastern Europe 


Particularly because Orthodoxy was associated with the top-down imposition of 
language and because most of the Orthodox peoples were under Ottoman occu- 
pation until the 19th century, Church Slavonic (with minimal local variation) re- 
mained for some time as the only written language for the Orthodox Slavs. New and 
nationally specific written Orthodox Slavic languages — parallel to the Romance 
languages in the West — did not develop until modern times. As the Russians were 
basically the only Orthodox people to escape Ottoman domination, this process nat- 
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urally started with Russian, which began to be developed based upon the spoken 
language of Russia, and thus distinct from Church Slavonic, from the mid-18th cen- 
tury. This was followed by Serbian and Bulgarian in the course of the 19th century, 
as the Serbs and the Bulgarians struggled for independence from the Ottoman Em- 
pire. Thus Church Slavonic, like Latin before it, was progressively restricted to the 
religious sphere. The same ultimately happened in Georgian, Greek, and Arme- 
nian — in the 19th and 20th centuries the written versions of these languages based 
upon ancient spoken forms which had been dominant in all usages for more than a 
millennium came to be replaced in non-religious functions by new written forms 
based upon contemporary vernaculars. 


6. The influence of sacred languages 
on the development of national languages 


It is commonly understood that the influence of religion upon society in general, 
including language, has declined substantially in modern times, and there is un- 
doubtedly a good deal of truth to this. But this point should not be stated too 
strongly, for two reasons in particular. First, even as languages particularly associ- 
ated with religious usages — Latin, Byzantine Greek, Church Slavonic, Classical 
Armenian, and Classical Georgian — disappeared from regular usage, they had a 
powerful influence on the construction of modern national languages. Most ob- 
viously, in almost all cases (Romanian being the only exception among Christian 
groups), people developing modern national languages have written them with the 
same alphabet which was used to write the traditional sacred language of the 
group, with the use of the Latin alphabet in Protestant countries reflecting the his- 
torical fact that Protestantism developed in areas which had been historically Cath- 
olic; thus French and English are written in the Latin alphabet because France and 
English were Catholic at the time these languages began to be written, Bulgarian 
and Russian are written in the Cyrillic alphabet because these group are Orthodox, 
etc. But the influence of traditional sacred languages has been even more extensive 
than this. Hary and Wein 2009 argue that modern national written languages are es- 
sentially the product of a fusion between vernacular usage and traditional sacred 
languages, which have contributed words, expressions, metaphors, and syntactic 
constructions to the modern languages. For example, they identify a number of 
examples of this type in English, of words (absolution, amen, apocalypse, baptism, 
confession, firmament, gospel, grace, Halleluiah, etc.), idioms (a man after his 
own heart, broken-hearted, a cross to bear, fire and brimstone, lion's den, etc.), and 
metaphors (relating apples and knowledge, Jesus to a lamb, wine to Jesus' blood, 
etc.), and they show that the same thing has happened in other modern European 
languages. Similarly, Kortmann 1997 demonstrates that there are different patterns 
of usage of adverbial subordinators in the modern languages of Catholic/Protes- 
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tant/ Western Europe and Orthodox/Eastern Europe and that this is the result of the 
modern languages having inherited the distinct usage patterns of the classical and 
traditional sacred languages of these two areas — Latin in the West and Classical 
Greek and Church Slavonic in the East. By far the greatest single source of this 
type of influence has been Bible translations, and in this respect it is important to 
keep in mind that these translations were incomparably more important in premod- 
ern times, during the formative period of European written languages, than they are 
today. They constituted the staple reading material of those societies which per- 
mitted them and they were understood to be the usage par excellence of the written 
language, so that the material taken from sacred languages and included in Bible 
translations was routinely disseminated to writings in modern languages in general 
(see Hotchkiss and Price 1996, Metzger 2001, Hary and Wein 2009). 


7. Language, religion, and national identity 


The other reason that the decline of the influence of religion in modern society 
should not be stated too strongly is that in some cases it has been clear that even in 
ostensibly secular modern Europe, religious identity has continued to be more im- 
portant than linguistic identity. If we consider the case of countries with popu- 
lations divided between Catholics and Orthodox, we find that in most cases (Bela- 
rusian, Ukrainian, Romanian), people sharing the same spoken language but 
having different religious affiliations have been able to merge into a single lan- 
guage-based nationality. By contrast, in the case of Orthodox and Catholic speak- 
ers of Serbo-Croatian, the political program of creating a Yugoslav state based 
around the Serbo-Croatian-speaking people failed catastrophically and as a conse- 
quence Serbian and Croatian are now considered to be distinct languages, associ- 
ated respectively with Orthodox and Catholics, though they do not differ signifi- 
cantly on the spoken level. Similar variation is found in cases in which speakers of 
the same language are divided between Catholics and Protestants: A common lan- 
guage-based nationality could be constituted for the Hungarians, the Czechs, and 
the Germans, but this was not the case for speakers of Dutch in the territories which 
are today Holland and Belgium, which were combined as the United Netherlands 
in 1815 but divided into the mainly Protestant north and the entirely Catholic south 
in 1830, or for speakers of English, who were unified as the United Kingdom in 
1800 but divided into Protestant Great Britain and Catholic Ireland in 1922. And 
regarding combinations of Muslims and Christians who speak the same language, 
we see that a common language-based nationality could be created between Mus- 
lim and Catholic speakers of Serbo-Croatian in Bosnia, and also for Muslim, Cath- 
olic, and Orthodox speakers of Albanian in Albania — but not for Muslim and 
Orthodox speakers of Serbo-Croatian in Bosnia, as demonstrated by the collapse of 
Yugoslavia and the effective secession of the Serbian Republic from Bosnia. Nor 
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did this combination work for Muslim and Christian speakers of Turkish in Anat- 
olia in the 1910s and the 1920s, where the Turks killed Armenian Christians in the 
genocide of 1915 whether they spoke Turkish or Armenian, and where the Turkish- 
Greek population exchange of 1923 sent 1,300,000 Greek Orthodox out of Anat- 
olia to Greece (or retroactively ratified their expulsion), whether they spoke Greek 
or Turkish, in exchange for 350,000 Muslims who were sent out of Greece to Tur- 
key, whether they spoke Greek or Turkish. These facts are summarized in table 2: 


Table 2. | Language or religion, which is more important to national identity? 


Separate Don't separate 
(that is, religion is more important (that is, language is more important 
than language) than religion) 
Catholic + e United Kingdom divides into Hungarians, Czechs, Germans 
Protestant Ireland (Catholic) and Great 


Britain (Protestant) (1921) 

* United Netherlands divides into 
Belgium (Catholic) and Holland 
(Protestant) in 1830 


Catholic + Croats and Serbs (Yugoslavia in Belarusians, Ukrainians, 

Orthodox 1990s) (both speak Serbo-Croatian) Romanians 

Muslim + e Turks and (Turkish-speaking) Albanians, Bosnian Muslims 

Christian Armenians (Armenian Genocide and (Catholic) Croats (both speak 
of 1915) Serbo-Croatian) 


e Turks and (Turkish-speaking) 
Greeks (population exchange of 
1923) 

* (Orthodox) Serbs and Bosnians 
Muslims (Yugoslavia in 1990s) 
(both speak Serbo-Croatian) 


Many observers have responded to those particular cases in which religious iden- 
tity has proven to be important in modern times by suggesting that religion remains 
a powerful factor in determining political identity in general. This was articulated 
most famously in the “Clash of Civilizations’ thesis, according to which Europe 
and the Middle East are inherently divided into ‘civilizations’ which are essentially 
historically religious, “Western’ (Protestant+Catholic), Orthodox, and Muslim 
(Huntington 1996). However, a more nuanced analysis, which seriously considers 
historical factors, suggests a different situation. In particular, as argued at length in 
Myhill 2006, certain groups in premodern times defined their national identities in 
terms of what can be called a national church (see also Hastings 1997), and in the 
19th and 20th centuries, when language came to be an important marker of politi- 
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cal identity and many new nationalities were constituted and mobilized on the 
basic of language, these premodern national church groups were not interested in 
forming a new national identity based upon language and independent of religious 
affiliation. It has been specifically efforts to combine these premodern national 
church groups with other groups sharing the same spoken language but having a 
different and non-national religious affiliation which have failed; on the other 
hand, when neither group sharing a spoken language had developed a national 
identity based upon a national church in premodern times, then they could be com- 
bined into a single language-based nationality. 

This analysis depends crucially upon the concept of a ‘premodern national 
church.’ I am using this term in this context particularly to refer to a religious af- 
filiation which developed in the context of a struggle for independence from exter- 
nal authority, that is, political independence and religious independence (this is 
discussed in more detail in Myhill 2006, particularly pp. 12-17). This therefore in- 
cludes (1) the Armenians, whose church developed as part of a successful struggle 
for political autonomy within the Persian Empire in the 5th century, (2) the Bul- 
garians, the Georgians, the Serbs, and the Russians, who each established their re- 
ligious independence from the Greek Church at a time when they were politically 
independent of the Byzantine Empire, (3) the Greek-speaking Greek Orthodox 
from the 13th to 15th century, after the Bulgarians, the Georgians, and the Serbs 
had established independent national churches and the Greeks were left alone 
fighting against the Ottoman Turks, and (4) the English, the Dutch, the Swedes, 
and the Danish, all of whom established Protestant state churches independent of 
the Catholic Church in the 16th century. On the other hand, the category ‘premod- 
ern national church' excludes religions which are universal (e.g. Catholicism or 
Islam) or purely doctrinal (that is, without implying any kind of political indepen- 
dence, e.g. German Lutheranism, Hungarian Calvinism, etc.), as well as autocep- 
halous Orthodox Churches established in the 19th and 20th centuries (e.g. the Uk- 
rainian Orthodox Church, the Romanian Orthodox Church, etc.). 

Because the Armenians, the Greeks, the Serbs, the English, and the Dutch had 
established national churches in premodern times, these nationalities could not be 
defined in terms of spoken language to the exclusion of religious affiliation when 
this ideology became popular in modern times. Thus the Turks recognized that the 
loyalties of Turkish-speaking members of the Armenian and Greek churches was 
clearly to their religious group, not their linguistic group, and so these peoples 
were respectively murdered and expelled from Anatolia in the 1910s and the 
1920s; the Serbs recognized that Serbian nationality was inherently Orthodox and 
could not be combined with Catholic and Muslim speakers of Serbo-Croatian in a 
Yugoslav state supposedly tied together by language; and the Dutch and the Eng- 
lish, as a consequence of their struggles to establish and maintain their indepen- 
dence from Catholic powers in the 16th and 17th centuries, viewed their own 
nationalities as inherently Protestant and were thus unable to combine with Cath- 
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olic Belgium and Ireland to make unified states in modern times (all of these de- 
velopments were of course considerably more complicated than can be described 
in the context of this brief review and are recounted in detail in Myhill 2006, es- 
pecially in chapters 4 and 5). On the other hand, when neither of the religious 
groups to be combined had formed a national church in premodern times, then 
when the ideology relating language to national identity was developed in modern 
times, it could effectively tie the different religious groups together. We see this 
with combinations of Catholics and Muslims (e.g. Albanians and Bosnians, and for 
that matter Arabs), combinations of Catholics and Orthodox who had only made an 
autocephalous church in modern times (e.g. Belarusians, Ukrainians, Romanians, 
and Albanians), or combinations of Catholics and Protestants who had not fought 
for political independence (e.g. Germans, Czechs, and Hungarians). All of these 
developments therefore show that religion has continued to play a significant role 
in interacting with linguistic identity even in modern times, although only in spe- 
cific cases which are predictable and explicable in terms of history. 


*I thank Bernd Kortmann and Johan van der Auwera for their helpful comments on 
an earlier draft of this paper. 


See also the following chapters in this volume: 39 by Aschenberg, 43 by Voss and 
44 by Wright. 
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36 Language and the school: 
European perspectives 


Joachim Grzega 


1. The place of languages in European schools 

1.1. Instruction of the national official language(s) of a country 
1:2. Instruction of subnational official and non-official languages 
1.3. Instruction of foreign languages 

2. Teaching concepts at European schools 

2:1; Concepts for teaching native languages 

2.2. Concepts for teaching a foreign language 

2:3. Concepts for teaching a lingua franca 


The teaching of language(s) has been part and parcel of schools in Europe since 
their birth. Apart from the mother tongue, foreign languages have played a central 
role in European school history. However, the languages taught, the concepts used 
and the educational policies pursued have changed over times. The present chapter 
gives an overview of this development (based on various in-depth contributions) 
and characterizes present-day Europe's model of foreign languages at school 
(largely based on reports by the Buropean Commission's information network on 
education in Europe, called Eurydice, and the Council of Europe). Note that in 
this chapter Europe is not used in the geographical or political definition, but in the 
cultural-anthropological definition, which basically covers the central and western 
countries of the European subcontinent. 


1. The place of languages in European schools 


1.1. Instruction of the national official language(s) of a country 


Formal instruction in children’s native language starts to spread in the 11th century 
(for the sake of acquiring competence in reading and writing) and becomes gen- 
erally institutionalized after the mid-18th century (cf. Grzega 2006a: 32). The first 
European school for deaf-mute people is established in Leipzig (Germany) in 
1778. Official languages are, of course, creations of the 19th and 20th centuries. In 
the early 21st century, children are trained in the country’s official language or lan- 
guages from the beginning of formal school education onward. 

Not infrequently, a European country has more than one official language. 
There are different ways of coping with this. In Luxembourg, German and French 
are used as languages of instruction in addition to Luxembourgish: German in pri- 
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mary and lower secondary education, and French in upper secondary education (cf. 
Eurydice 2008: 43). In Ireland, Finland and Malta, schools generally use the re- 
spective official state languages as languages of instruction (cf. Eurydice 2008: 
43). 

For the majority of EU citizens their mother tongue is also the national tongue 
(in 2005 for nearly 100 % of Poles, Portuguese, Greeks, Cypriots, Czechs, Croa- 
tians and Hungarians, at the lower end of the scale, for still over 70 % of Latvians 
[TNS Opinion & Social 2005: 1-2, TNS Opinion & Social 2006: 7]). Neverthe- 
less, due to the high mobility today, some European schools have considerable 
proportions of students whose mother tongue is not the national tongue. European 
nations offer immigrant children special instruction concepts. These measures can 
be classified into two basic models: (1) an integration model, in which immigrant 
children immediately join classes consisting of children of the same age in regular 
education and in which they receive language support on an individual basis dur- 
ing regular school hours; (2) a separation model, in which immigrant children are 
separated from other children for a limited period so that they can obtain special 
linguistic tutoring (sometimes accompanied by the attendance of some regular 
lessons). In 2006/2007 the first type was promoted by Denmark, Ireland, Italy, 
Malta, Cyprus, Austria, Hungary, Portugal, Poland and the United Kingdom 
(Scotland); the second type was promoted by Germany and Romania; the other 
countries used mixed systems (cf. Eurydice 2008: 104, Extra/Gorter 2008). 


1.2. Instruction of subnational official and non-official languages 


The Ordonnance de Villers-Cottérets from 1539, by the French king Francis I, 
probably represents Europe's first language law. Its command to use only (Paris- 
ian) French in all official documents did not only accelerate the death of Latin as an 
official language, but also led to a decline of French dialects, Breton, Occitan and 
other vernaculars in the French kingdom. In other European countries, it was — at 
the latest — the definition of national languages as languages of the state which put 
minority languages at threat. 

The 1992 European Charter for Regional or Minority Languages (initiated by 
the Council of Europe and, by 2010, ratified by Armenia, Austria, Bosnia and 
Herzegovina, Croatia, Cyprus, the Czech Republic, Denmark, Finland, Germany, 
Hungary, Liechtenstein, Luxembourg, Montenegro, the Netherlands, Norway, Po- 
land, Romania, Serbia, Slovakia, Slovenia, Spain, Sweden, Switzerland, Ukraine 
and the UK) strengthened the place of regional languages also in various school 
systems. In 2007 the following regional languages in the sense of the Charter (i.e. 
languages that are official, but not officially supraregional languages in a country) 
were taught as subjects and/or were used as language of instruction in the follow- 
ing countries (cf. Eurydice 2008: 117-120, 51, and the contributions in Extra/ 
Gorter 2008): 
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Basque in Spain 

Catalan in Spain 

Croatian in Austria and Romania 
Czech in Austria and Romania 
Danish in Germany 

French in Italy 

Frisian in the Netherlands 

Galician in Spain 

German in Italy and Romania 
Hungarian in Austria, Romania and Slovenia 
Irish in the UK (Northern Ireland) 
Italian in Slovenia 

Sami in Finland, Sweden and Norway 
Scottish Gaelic in the UK (Scotland) 
Serbian in Romania 

Slovak in Austria and Romania 
Slovene in Austria 

Sorbian in Germany 

Ukrainian in Romania 

Valencian in Spain 

Welsh in the UK (Wales) 


The European Charter for Regional or Minority Languages has also made some 
autochtonous/indigenous minority/regional languages without official status (in 
the respective area) subjects and/or languages of instruction in European school 
systems (cf. Eurydice 2008: 117—120, 51, and the contributions in Extra and Gorter 
2008; data valid for 2007): 


Alsatian in France 

Armenian in Romania 

Basque in France 

Belorussian in Latvia and Poland 
Breton in France 

Bulgarian in Hungary and Romania 
Catalan in France 

Corsican in France 

Creole in France 

Croatian in Hungary and Slovenia 
Estonian in Latvia 

Finnish in Sweden and Norway 
German in Hungary, Poland and Lithuania 
Greek in Hungary and Romania 
Lithuanian in Poland and Latvia 
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Kashubian in Poland 

Occitan in France 

Polish in Hungary, Romania, Latvia, and Lithuania 
Romanian in Hungary 

Romany in Latvia, Hungary, Austria, Poland, Sweden, Slovenia, and Romania 
Russian in Estonia, Latvia, Lithuania, and Romania 
Ruthenian in Poland and Hungary 

Serbian in Hungary and Slovenia 

Slovak in Hungary and Poland 

Slovene in Hungary 

Tahitian in France 

Turkish in Romania 

Ukranian in Latvia, Poland and Romania 

Valencian in France 

Yiddish in Sweden and Latvia 


1.3. Instruction of foreign languages 


In Europe, teaching a foreign language was synonymous with teaching Latin for 
many centuries. In the late Middle Ages, modern European vernaculars started to 
compete with Latin and to gain attraction in foreign language learning. With the 
growing popularity of modern languages, the classical languages eclipsed more 
and more. But still today Latin is offered as an optional language in many Euro- 
pean countries. Many of them also offer Ancient Greek as an option in their cur- 
ricula; Ancient Hebrew is listed as an option in Germany's and Ireland’s curricula 
(cf. Eurydice 2008: 53; data valid for 2006/07). 

The first vernacular languages to enjoy international popularity were French 
and Occitan in the 10th and 11th centuries, but rather on a quite private basis, not in 
institutionalized settings (schools or institutes) (cf. Grzega 2006a: 26); here Latin 
remained the major language for international exchanges. In the 15th and 16th cen- 
turies Italian, which mainly means Tuscan, enjoyed general attraction also for 
group instruction, particularly in Germany and Spain, to a lesser degree in France, 
England and Croatia (cf. Marazzini 2000). French became a widely studied lan- 
guage in the 16th century, predominantly in England, the Netherlands, and Ger- 
many, to some degree also in Sweden and Poland, to a much lesser degree in other 
European countries; Italians rather learnt German or Spanish at that time (cf. Kaltz 
2000). Another language attracting a considerable number of learners in the 16th 
century was Spanish (starting with the reign of Charles V) in those countries that 
Spain kept strong bonds with, i.e. the Low Countries, Italy, France, England and 
Germany (Breva-Claramonte 2000: 717). Hebrew was taught in Protestant schools 
from the 16th century to the mid-18th century (Bobzin 2000). The teaching of Eng- 
lish in the late 16th century was basically restricted to coastal trading towns; this 
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changed gradually in the second half of the 17th century, with the writings of Prot- 
estant theologians and the work of the order of the Loreto Sisters (also known as 
Institute of the Blessed Virgin Mary) in the Catholic areas of Europe (cf. Schróder 
2000: 727). In the 19th and 20th centuries the modern languages continued to 
spread as subjects in schools, with different speeds in different countries and re- 
gions. 

By the 1980s, English was already compulsory in most countries of the Euro- 
pean Union (European Community). Today this is the case in Belgium (in some 
areas), Cyprus, Denmark, Germany, Greece, Italy, Luxembourg, Malta, the Neth- 
erlands, and Sweden (cf. Eurydice 2008: 45-46). Countries east of the EU where 
Russian was a mandatory language in the 1980s (which, by the way, was not seen 
as a foreign language in the Baltic countries) gave up this policy of compulsory 
Russian in the early 1990s (cf. Eurydice 2008: 34). After 1984, policies in Europe 
have lead to an increase in the number of years during which teaching of at least 
one foreign language is mandatory, and a decrease in the age at which this starts 
(cf. Eurydice 2008: 34-35). From the 1990s on, the Common European Frame- 
work of Reference for Languages has served as a grid for standardized language 
learning and assessing across Europe (leading to the standardized certificate 
UNIcert). The European Commission's 1995 White Paper Teaching and Learning — 
Towards the Learning Society stated that at the end of compulsory school training 
every European should be proficient in two foreign European languages. At the 
Lisbon summit of 2000 language competence was defined as one of five key skills. 

What is the situation at the beginning of the 21st century? Half of the EU's 
citizens state that they can speak at least one foreign language at the level of being 
able to have a conversation (from 29 % of Hungarians to over around 90 % of Es- 
tonian, Swedish, Danish, Slovenian, Lithuanian, Dutch, Maltese, Slovak and Lat- 
vian citizens, to 99% of the people from Luxembourg [TNS Opinion & Social 
2005: 3]). About 30 % of the EU citizens are at least bilingual (the EU’s aim of hav- 
ing citizens who are at least bilingual is reached by the majority of the population 
in Luxembourg, the Netherlands, Slovenia, Malta, Belgium, Denmark, Estonia, 
Latvia and Lithuania, in other words: smaller countries). About one tenth are at 
least trilingual (TNS Opinion & Social 2006: 8) — countries with percentages of bi- 
and multilingual citizens under 40 % are Ireland and the UK (TNS Opinion & So- 
cial 2006: 8). According to a 2005 study, the “multilingual European" can be char- 
acterized as a young, well-educated person, with a multilingual background (who 
is born or whose parents are born in another country), in a managerial position or a 
student, and motivated to learn (TNS Opinion & Social 2006: 10). 
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1.3.1. What does language education at school look like? 


How many languages do children (have to) study at school? In all European coun- 
tries save Ireland, Italy and the United Kingdom (Scotland), students have to learn 
at least one foreign language until the end of compulsory education (cf. Eurydice 
2008: 27). Most countries enable all children to learn two foreign languages during 
compulsory education; this is even compulsory in the Baltic countries, parts of 
Belgium, Bulgaria, the Czech Republic, France, Greece, Hungary, Italy, Luxem- 
bourg, Malta, the Netherlands, Portugal, Romania, Slovakia and the Scandinavian 
countries (except for Norway) (cf. Eurydice 2008: 28-31). The most common con- 
cept is to have all students study at least two foreign languages for at least one year 
during compulsory school education (cf. Eurydice 2008: 28-31). 

When do children start to study a foreign language? In almost all European 
countries children start to learn a compulsory foreign language at primary school, 
and the general tendency is to lower the age at which learners first come into con- 
tact with a foreign language (cf. Eurydice 2008: 27—29). Starting a second foreign 
language is compulsory in general secondary education in all countries of central 
and eastern Europe, Iceland, Norway, some western European countries; these lan- 
guages are started at primary school in Estonia, Luxembourg, Sweden and Iceland, 
in Luxembourg from the second year of elementary school onward (French and 
German, although both have official language status, are treated as foreign lan- 
guages in the Luxembourgish curriculum) (cf. Eurydice 2008: 31). On the level of 
compulsory general secondary education the annual time reserved for foreign lan- 
guage teaching varies from 0 hours (in countries and school-types where a foreign 
language is not a compulsory subject) to 203 hours (in Luxembourg), mostly be- 
tween 40 and 80 hours (cf. Eurydice 2008: 97-98). 

Which languages do children study at school? The most widely taught and 
known foreign language in Europe today is English (cf. Eurydice 2008: 12, TNS 
Opinion & Social 2005: 4). Except for Belgium and Luxembourg, it is the most 
prominent foreign language at European primary schools (cf. Eurydice 2008: 62). 
In most countries, it is also the most widely studied tongue in secondary education; 
the only exceptions are Belgium, Luxembourg and Romania (in the latter, though, 
it ranks only slightly lower than French). German and French share the rank of sec- 
ond most studied language, the former in the Nordic countries, the Netherlands and 
the countries of central and eastern Europe, the latter in the countries of southern 
Europe (cf. Eurydice 2008: 11). In the three Baltic countries and Bulgaria this posi- 
tion is held by Russian (which in other countries is hardly taught at all); and in 
merely four countries the second most widely studied tongue is yet another one, 
viz. Spanish in France and Sweden, Italian in Malta, Swedish in Finland and Dan- 
ish in Iceland (cf. Eurydice 2008: 67). According to polls (Eurydice 2008: 48), in 
EU member countries, most languages that are included in curricula as theoretical 
offers stem from the Germanic group (English and German) and the Romance 
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group (predominantly French, followed by Spanish and Italian). Official docu- 
ments in several countries also encompass Russian (France, the Netherlands, 
Germany, Austria, Malta, the Czech Republic and Slovakia, the Baltic countries, 
Romania and the UK), Latin (Ireland, the Netherlands, Belgium, France, Germany, 
Austria, Malta, Slovenia, and the UK) and Ancient Greek (Ireland, the Nether- 
lands, Belgium, France, Germany, Austria, and the UK). In some countries, cur- 
ricula allow for a rather broad range of languages; in Germany and France this 
includes Chinese and Japanese — both languages, however, are rarely offered in 
schools due to a lack of financial and human resources. Mention must also be made 
of the fact that just a limited number of the many languages theoretically offered 
are actually taught to a considerable portion of children at school: in secondary 
education, English, French, German, Spanish and Russian make up over 95% 
of all languages learned in the majority of European countries (cf. Eurydice 2008: 
11, 41). Despite its prominence at school, English is the most widely known 
foreign tongue in fewer than 20 BU member states, but it occupies the second or 
third rank in other EU countries, except for Slovakia (where it ranks fourth); in the 
UK and Luxembourg, French is the most widely known foreign language, in the 
Czech Republic and Hungary it is German (TNS Opinion & Social 2005: 5). 


2. Teaching concepts at European schools 


2.1. Concepts for teaching native languages 


Instruction of and in the mother tongue means "instruction of and in its standard 
variety" (the rise of dialects in status is a rather recent phenomenon, and in some 
countries only). It is not possible to draw a clear Buropean picture of concepts 
for teaching a mother tongue. At the basic level reading and writing are important, 
but the ways of teaching these skills also differ because of the different sound- 
letter combinations of different languages. Likewise, the types of teaching native 
grammar differ considerably due to the diversity of morphological and syntactic 
structures. Apart from these skills also those of text interpretation and text produc- 
tion can be found as objectives all over Europe. However, there is no agreement — 
sometimes not even within a country — as to the relative importance of formal vs. 
semantic skills. An attempt of bringing together discussions on policies of teaching 
native languages is the International Mother Tongue Education Network (IMEN), 
launched in 1982 (a state of the art and proposals for concerted mother tongue edu- 
cation are gathered in Herrlitz, Ongstad and van de Ven 2007). 
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2:2: Concepts for teaching a foreign language 


Historical outlines of the development of teaching approaches and methods are 
provided in several monographs, e.g. those by Titone (1968) and Kelly (1969, cf. 
particularly the synoptical table on p. 394). Polls revealing people's view on how 
they master languages and on the way foreign languages should nowadays be 
taught are presented in TNS Opinion & Social (2006). 

In the Middle Ages, ancient languages, predominantly Latin, were taught to (fu- 
ture) clerics for literary and scholarly aims. Learning Latin in the Middle Ages was 
still based on the works by the classical authors Aelius Donatus (Ars grammatica) 
and Priscian (Institutiones grammaticae). Beyond these, the first conversational 
books (colloquies) were already in use, which were used both at school and for self- 
teaching. Their target audiences were (future) merchants, travelers, students (cf. 
Rossebastiano 2000). After 1200, schools also used the Doctrinale puerorum by 
Alexander of Villedieu, written in verses and thus efficient for memorizing. Since 
the 12th century modern languages began to be taught — also to a broader audience, 
namely (future) physicians, scientists and jurists. The main method to teach Latin 
was presenting grammar from a book — in Latin, so that Latin was both medium and 
object of learning. The two motivations for learning a foreign language that we can 
observe since the Late Middle Ages are those of commercial and cultural interest. 
In the Renaissance era the methods for Latin and Greek continued the medieval pat- 
tern. In modern language classes, translation techniques were used for literary pur- 
poses; oral practice was introduced for social aims, viz. opening learners of French 
the fashionable aristocratic world and learners of English the worlds of philosophy 
and commerce. The moral formation through learning a foreign language (also a 
classical one) was also discovered and valued at that time. Scholarly aims did not 
play a role in the teaching of modern languages. Conversation books were the pre- 
ferred textbook genres in the Renaissance, since the 16th century most of them 
were even polyglot. The two most important textbook "families" are (1) Introito e 
porta, the first book for teaching two vernacular languages (Southern German and 
Venetian), whose derivates were mainly referred to as Vocabulista, and (2) Noel de 
Berlaimont's Colloquia et dictionariolum, which was based on the 1340 colloquy 
Livre des mestiers. While studying Latin was based on the usage of classical auth- 
ors, texts used for the teaching of vernaculars comprised the language of artisans, 
traders, and travelers. Language classes in the 16th century consisted of dialogue 
practice, reading and translations exercises, word memorizations, pronunciation 
and grammar exercises (Breva-Claramonte 2000: 717—718). In the 17th through 
19th centuries, social aims were secondary also in learning modern languages; 
scholarly aims dominated, literary purposes were the most important subsidiary 
goal. Grammar is approached in a logic-oriented way, which leads to the grammar- 
translation method, i.e. students translate texts word for word and learn grammati- 
cal rules and long lists of words. Oral practice plays a marginal role. 
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The development in the 20th century up until today can be sketched with the 
following three methods (cf. Richards and Rodgers 2001): 


(1) the formal approach: reading method and grammar-translation method 

(2) the direct method, established in Germany and France (based on the idea that 
second language acquisition must be an imitation of first language acquisi- 
tion; strong focus on orality, also leading to the first language laboratories in 
the late 1960's) 

(3) communicative language teaching 


After the first years of the communicative approach, more authenticity-oriented 
and real-life driven refinements have been undertaken, of which “task-based lan- 
guage teaching” is one of the outcomes. The communicative approach has also 
led to a number of concepts and ideas which try to connect communication to lin- 
guistic and non-linguistic content-learning. Theories and models of bilingual 
education in Europe ultimately go back to research in Canada and, to a lesser de- 
gree, the US; Titone (1972) and Siguán (1975) are considered the European pion- 
eers in this respect (cf. Baetens Beardsmore 1996: 465). Another model is Jean- 
Pol Martin's LdL (2 Lernen durch Lehren/Learning by Teaching), whose core 
idea is that learners are given as many teaching tasks as possible, but which is a 
pluralistic model integrating nearly all methods and nearly all social forms cur- 
rently used in Europe (cf., e.g., Grzega 2006b). Intercultural competence also 
evolved as a vital notion of European education in the 1990's. At the beginning of 
the 3rd millennium, European countries focus especially on CLIL (2 content- 
and-language-integrated learning). CLIL models exist in most countries, but only 
a minority of children can participate in CLIL programs; although the status and 
position of the type of CLIL varies depending on the country, it is frequently an 
integral part of the education system (cf. Eurydice 2008: 11). Other important de- 
velopments can be noticed in the field of computer-assisted language learning 
(CALL): this includes the Internet, where we can find two basic types of language 
learning offers: (a) language exchange websites and (b) portals that provide lan- 
guage content. The official curricula for foreign languages in European countries 
name the skills of listening, speaking, reading and writing as the four major aims 
(though not always in equal ranking — in elementary schools the focus is often 
laid on speaking and listening) (cf. Eurydice 2008: 87). Furthermore, the EU has 
established specific language promotion programs: the COMENIUS objectives, 
for example, are the improved mobility of students and educational staff, the en- 
hancement of international partnerships, the promotion of foreign language 
learning, innovative ICT-based content, services and better teaching techniques 
and practice. The LINGUA program was established to encourage and abet lin- 
guistic diversity in the EU, to help improve the quality of language teaching and 
learning, and to enable lifelong language learning opportunities appropriate to 
each individual's needs. 
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During the Middle Ages and up until around 1800, foreign languages were 
taught by native speakers (cf. Titone 1968: 9). The professionalized training of lan- 
guage teachers is an invention of modern times. In all countries, trainings, or study 
programs, as semi-specialists and specialists in teaching a foreign language are 
first given at universities (cf. Eurydice 2008: 14). In most European countries such 
programs include compulsory theoretical courses on foreign language teaching. In- 
school placements and practical language training are frequent as well. But only 
7 countries officially recommend stays abroad in a target language country as an 
activity (Belgium, Ireland, France, Austria, Norway, Sweden) (cf. Eurydice 2008: 
83-84). 

At elementary schools today, foreign languages are taught by general teachers 
qualified to teach all (or almost all) subjects in the curriculum. Exceptions are 
Malta, Greece, France, Spain, Portugal, Poland, Slovakia, Bulgaria, Romania and 
the UK (just in Scotland, however), where we find specialist or semi-specialist 
teachers (cf. Eurydice 2008: 77). On the level of secondary education, we find 
specialist or semi-specialist teachers in the majority of European countries (cf. 
Eurydice 2008: 79). (For information on current developments in, and polls on, 
foreign language teaching in the EU, visit the websites of the European Commis- 
sion (http://ec.europa.eu) and the Council of Europe (www.coe.int) and the Euro- 
pean Journal of Language Policy, founded in 2009). 


2:3. Concepts for teaching a lingua franca 


While Latin was the lingua franca in central and western Europe during the Middle 
Ages and into Early Modern Times, this had no specific effect — as far as we can 
tell — on the concepts of its teaching. The aspects of cross-cultural communicative 
behavior and the use of the lingua franca as an opener to other languages was not 
taken into account (except for the fact that the grammar terminology for Latin was 
also used to classify and explain the grammar of the modern languages). 

The desire to have a neutral lingua franca is expressed in the creation of sev- 
eral articifial languages. One of them, Esperanto (or Internacia Lingvo/ILo), has to 
a limited extent gained international attention in school education. In 1975 it was 
introduced as a project subject in elementary schools, first in Germany, then in 
other European countries (France, Italy, Croatia, Austria, Hungary and Slovenia). 
In this project ILo was conceived as a first tool for reflecting on language and 
communication and as an instrument for facilitating the acquisition of other 
foreign languages at later stages (cf. Frank and Lobin 1998). The concept entered 
Italy's official curriculum for foreign language teaching at elementary schools in 
1993. Since English is naturally the most widespread language world-wide, there 
have been many calls for respecting this fact also in the teaching of English as a 
foreign language; specific vocabularies for lingua-franca teaching have been sug- 
gested since the 1930's (starting with Charles Ogden's BASIC English) and the 
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number of empirical studies on phonetic, grammatical, pragmatic and social as- 
pects of lingua franca English has been increasing since the 1990's (cf. Seidlhofer 
2007 and Grzega 2008 for brief overviews). The idea of transferring empirical 
findings into concrete new models of teaching English as a lingua franca has both 
advocators and objectors and will undoubtedly continue to be hotly debated (cf. 
Grzega 2010). 


See also the following chapters in this volume: 29 by Darquennes, 32 by House, 39 
by Aschenberg, 42 by Stern, 43 by Voss and 44 by Wright. 
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1. Introduction 


This chapter deals with the prehistory of the European languages from the earliest 
arrival of modern humans in the late Stone Age until the dawn of history, marked by 
the attestation of written records in Europe from the second millennium BC on- 
wards (see Appendix for a timeline). Its aim is to give an overview of the main de- 
velopments and to point out the main results of recent research as well as to report 
on current discussions. One central topic will be language contact during the period 
of the Neolithic until the beginning of the Iron Age, in particular the impact of sub- 
strate languages on the Indo-European languages and the Indo-European super- 
strate in the Uralic languages. This chapter is structured as follows: Section two 
deals with Europe before the Neolithic period. In section three the Neolithic popu- 
lation movements are addressed in connection with the process of agriculturaliz- 
ation and the languages that entered Europe during this time. Section four discusses 
the linguistic effects of the Neolithic developments, focusing in particular on the 
Vasconic Theory by Theo Vennemann. The final section presents the linguistic map 
of Europe before the arrival of the Iron Age and sums up the most important results. 


2. Prehistoric Europe until the start of the Neolithic period 
(until 7000 BC) 


According to general opinion, modern humans (homo sapiens sapiens) entered Eu- 
rope during the Upper Paleolithic, around 33000 BC, living alongside the Nean- 
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derthal man (homo sapiens neanderthalensis), whose last traces in southernmost 
Spain are dated around 24000 BC. Climatically, this time was characterized by the 
last glacial (about 110000 to 10000 BC), in which periods of low temperature and 
repeated glaciation alternated with shorter periods of higher temperature, during 
which humans could live in central Europe, having to retreat to more hospitable 
southern areas during glaciations. The end of the Paeolithic in Europe coincides 
with the end of the last glaciation, when the whole continent became suitable for 
human survival again. 


2:1; Humans in Paleolithic Europe (33000-9000 BC) 


Throughout the Paleolithic period hunter-and-gatherer groups lived in Europe, 
though significant cultural traces during the glacial maximum (around 18000 BC) 
have only been found south of a line stretching from southern France in the west to 
southeastern Ukraine in the east, excluding the Alps, which were covered with a 
thick ice sheet during the entire time. In the south green plains existed, which pro- 
vided enough subsistence for small groups of humans to live, whereas this was not 
the case in the treeless tundra further north. During the glacial maxium the ice in 
central Europe reached as far as just south of Berlin. As a result, Germany was vir- 
tually uninhabited during the time of the glacial maximum until the arrival of 
people from the warmer south from about 12000 BC onwards when Northern Ger- 
many was ice-free (Probst 1999: 89). In Britain only the south and the southeast 
were free from ice as was the western part of the European mainland until as far 
north as Norway. In Eastern Europe, too, only the south was habitable. As a result, 
until the ice began to recede, humans survived only in the southwest (Southern 
France, Iberian Peninsula), the south and the (south-) east of Europe. However, the 
eastern and the western group were largely isolated from each other until the Me- 
solithic era, which has also linguistic ramifications, in the sense that the languages 
of both groups were probably only distantly related (Kallio 2003: 228). 

As far as can be told from the archeological finds, the widespread pre-glacial- 
maximum Gravettian Culture! (about 26000 to 21000 BC) was superseded by the 
Magdalenian (the main western culture from 18000 to 10000 BC) and Solutrean 
Cultures in the west and by the Sviderian, Ahrensburg and Hamburg Cultures in 
the east until the final stages of the Paleolithic period. The traces left by these 
groups of hunters and gatherers are impressive, including tools made of stone and 
bone, art and small settlements, yet nothing is known about the languages they 
spoke. People lived in groups of 25 to 100 during this time. From 9500 BC onward 
the temperature increased rapidly and so the ice sheet, which had been retreating 
since about 15000 BC, left Europe to be re-populated. 
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2.2. Mesolithic Europe: repopulation from the south (9000-7000 BC) 


The trend towards complete occupation of the European continent had already 
started in the last stages of the last ice age, and continued in the Mesolithic period, 
which began in the 9'^ millennium BC, when the temperature increased rapidly. 
From the south-west and, to a lesser degree from the south-east, people moved 
northward as northern Europe became more hospitable. Both movements cannot 
only be traced archaeologically, but also genetically (see Villar and Prosper 2005: 
397—419 for an overview of the genetic research). Hunter-and-gatherer groups en- 
tered England in the 9th and Ireland in the 8th millennium BC, leaving some of the 
most important cultural traces of the European Mesolithic era. In central Europe, 
the relics of the Magelmosian Culture in Denmark bear witness to the northward 
expansion, as do the remnants of the Kunda Culture (Estonia) in eastern Europe. 

In the wake of these migrations the eastern Paleolithic group probably came 
into contact with the western group, so that, for example, the human pioneers in the 
area east of the Baltic Sea originally belonged to both groups and were therefore 
not likely to be a linguistically homogenous group (Kallio 2003: 229). With regard 
to southeastern Europe, however, archaeologists generally do not speak of a Me- 
solithic period: firstly, because the area was only very sparsely populated, and sec- 
ondly because the first farmers, and with them the Neolithic period, became estab- 
lished in Greece already around 7000 BC (van Andel and Runnels 1995: 481). 
During the Mesolithic period people lived as nomadic hunter-and-gatherer groups, 
but in contrast to their Paleolithic ancestors they established comparatively large 
settlements for up to 100 people, consisting of wooden huts in which they lived 
over several months (Probst 1999: 170). 

Similar to the preceding period, though the archaeological finds clearly permit 
the inference that quite a number of hunter-gatherer groups existed in Mesolithic Eu- 
rope, virtually nothing can be said about the languages that were spoken then. It 
seems clear, though, that the people living in Europe at this time were exclusively 
native in the sense that they were descendents of the Paleolithic population of Eu- 
rope. However, all inferences and speculations about their languages can only be 
based on what can be reconstructed from the attested situation several millennia later. 


3. Native populations and immigrants: 
Europe until the Bronze Age (7000—2000 BC) 


The characteristic developments of the Neolithic period in Europe are the spread of 
agriculture, stockbreeding, pottery and, later, metallurgy. This era also witnessed 
the arrival of immigrant people and languages. The various kinds of pottery found 
from this time are frequently used as technical terms to characterize a particular 
culture, such as Linear Ware, Combed Ware, Corded Ware, etc. The Neolithic is the 
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first period for which inferences can be made about languages that were spoken in 
Europe, though the divergent opinions on this matter bear witness to the consider- 
able amount of speculation that is involved in this. At this point it is necessary 
to briefly reflect on the methodology used to investigate prehistoric languages. Un- 
less, of course, there are written records of a culture that permit a linguistic identi- 
fication, the language of a particular culture cannot always be determined in a 
straightforward way. Basically, the following methods have been employed 
(Zimmer 1990: 11-12): 


a) The lexicon of a language is used to draw conclusions about the speakers’ cul- 
ture and their habitat (linguistic paleontology). 

b) Loanwords are used as evidence for contact between languages and their lo- 
calization. 

c) Due to their stability, toponyms can provide reliable evidence about prehis- 
toric stages, provided that their language(s) can be identified. 

d) Based on the known locations of languages and their speakers in historical 
times, inferences are made about the relative position of these languages in pre- 
historic times. 


This overview confines itself to a brief treatment of linguistic paleontology, be- 
cause it is probably the most controversial method. The main point of criticism di- 
rected against this approach is that the existence of a particular term in the lexicon 
of a language is a necessary but not a sufficient condition to allowing us to infer the 
existence of the concept or object denoted by the relevant term within the culture 
that is supposed to have spoken this particular language (Zimmer 1990: 14). How- 
ever, while this is obvious, it does not automatically follow from this that linguistic 
paleontology is a useless tool. It just means that it has to be applied with caution 
and that every case has to be examined carefully. But the obviously close connec- 
tion between language and culture cannot be denied, and this can clearly be used to 
examine the cultural reality of a language. 


3.1. The Northeast: The arrival of Uralic languages 


Recent Uralicist research dates Proto-Uralic around the turn of the fifth and fourth 
millennia BC, and locates its homeland in the Middle and Upper Volga region.” 
Kallio (2003: 228) argues that the area of the Pit-Combed Ware Culture (about 
4200-2000 BC) fits “the area of the Uralic languages in the beginning of the his- 
torical age better than the area of any other prehistoric cultural complex does.” 
Based on archaeological and linguistic evidence, Kallio (2003: 231—232) argues 
that the speakers of Saamic did not originally speak Uralic, and that they geneti- 
cally belong to the Western Paleolithic group that originally settled on the western 
coast of Norway. Accordingly, the ancestors of the Saamic speakers entered Fin- 
land only during the Bronze Age, when their language finally became Uralic. How- 
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ever, Koivulehto (2001 et passim) makes a case for early Indo-European loanwords 
in Saami, dating from the fourth and third millennia, which is problematic for the 
theory of a late Uralicization of Saami. It has to be noted that Kallio’s theory would 
seem to predict far more radical substrate changes in the resulting Uralic Saami 
than the few phonological changes he mentions (2003: 232). By contrast, Koivu- 
lehto’s hypothesis would have to explain how exactly speakers of a Northwestern 
Indo-European protolanguage connected to the Corded Ware Culture should come 
in contact with speakers of Saami, who presumably lived in Finland at the time 
(cf. Kallio 2003). This issue will not be pursued any further here. However, beside 
the question of Saamic, it seems clear that the Uralic languages spread during the 
Neolithic period and settled in northeastern Europe. 


3:2. Agriculturalization and the Indo-Europeans in Europe 


The beginnings of agriculture in Europe, and hence of the Neolithic period, are 
commonly dated somewhere in the seventh millennium BC. From its beginnings in 
Greece, agriculture spread to the Balkan area and then within only 500 years 
through central and western Europe, manifesting itself culturally in the shape of 
the Linear Ware Culture, which is found all over this area. There were two main 
paths along which agriculture spread throughout Europe, one to the northwest and 
one to the west of the Balkans, both joining up about half a millennium later in 
western Germany.? The archaeological and genetic facts point to an addition of 
new people to the Mesolithic population of central and western Europe, a more 
Mediterranean type. During this period the total population of Europe increased 
significantly (Probst 1999: 228, 252). However, in contrast to the demic diffusion 
model (spread of agriculture and population) and the cultural diffusion model 
(spread of ideas only), recent genetic research indicates that probably people as 
well as ideas diffused into Mesolithic Europe, but that neither model taken to its 
extremes is likely to be correct (Haak et al. 2005). 

The arrival of agriculture has frequently been connected with the arrival of 
speakers of Indo-European languages in Europe and the question of the Indo-Euro- 
pean homeland. Notwithstanding the criticism the concept of an Indo-European 
culture and an Indo-European homeland has attracted from both archaeologists and 
linguistics (see Häusler 2003 with references), from a linguistic point of view it is 
clear that there must be an ancestral language connecting all Indo-European lan- 
guages as a top node, and it is highly likely that there were speakers of this language 
at some stage. Whether these speakers possessed a more or less uniform culture is a 
different matter altogether, but the assumption of one, even though idealized, 
protolanguage is a methodological necessity from a linguistic point of view. 

Where the Indo-European protolanguage was spoken, however, has been de- 
bated in the literature. Since the modification of Colin Renfrew's theory (see Ren- 
frew 2003), the exact location of the Indo-European Urheimat is no longer an issue 
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of tremendously high significance. The close contact between Proto-Indo-Euro- 
pean and Uralic, which is evidenced by early Indo-European loanwords in Uralic 
(see Kallio 2003 and Koivulehto 2001 et passim, both with references), strongly 
suggests that these two protolanguages were spoken adjacent to each other.* This, 
in turn, argues in favour of a stage during which Proto-Indo-European was spoken 
in a relatively small area south of the Uralic homeland. Whether or not the speakers 
originally immigrated from Anatolia, as Renfrew (2003) claims, or whether the 
Anatolian languages arrived in Anatolia after a breakaway move of individual 
branches is secondary for the purpose of this chapter. 

Whether the speakers of Proto-Indo-European were also responsible for the 
spread of agriculture, which is associated with the Impressed Ware and Linear 
Ware cultures, is however, doubtful. The reason for this is the vast area this cultural 
complex covers. As Kallio (2003: 232) points out, the common Indo-European ter- 
minology for wheeled vehicles is too uniform to assume an early extension of 
Proto-Indo-European across such a vast territory without some linguistic differ- 
entiation. Moreover, the fact that most of the Indo-European agricultural vocabu- 
lary seems to be restricted to Northwestern Indo-European, with a good deal being 
non-Indo-European loanwords, implies that Europe was already agricultural when 
the Indo-Europeans settled there (Kallio 2003: 233). 

This means that a common Indo-European protolanguage was probably spoken 
roughly south of and adjacent to the Uralic homeland before it split up. As wheeled 
vehicles were invented around 3500 BC (Probst 1999: 239), this suggests that Proto- 
Indo-European was still spoken around this time (see also Kallio 2003: 232). Con- 
sequently, the Indo-Europeanization of Europe is a more recent phenomenon than 
the rise of agriculture. It started after 3500 BC, and is connected to different cultural 
complexes (cf. Mallory and Adams 2006: 452), as has been suspected by archae- 
ologists skeptical of a uniform Indo-European culture (see e.g. Háusler 2003). 


4. Effects of the Neolithic expansions on the linguistic map of Europe 


4.1: A Mesolithic substrate in Burope: Vasconic 


In a series of papers (see Vennemann 2003a et passim), Theo Vennemann has ad- 
vanced a theory of the linguistic prehistory of Neolithic Europe. He proposes that 
the Mesolithic population of Europe spoke languages of which modern Basque is 
the only survivor, which he therefore named Vasconic. Basically, this assumption 
follows from what is known from archaeology, namely that Europe was repopu- 
lated mainly from the southwest after the last glacial maximum (cf. 2.2 above) and 
the fact that Basque was spoken in southern France and northern Spain in antiquity 
(as, in fact, today). The genetic data linking the speakers of modern Basque to the 
Mesolithic process of repopulation renders material support to the theory. Accord- 
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ing to Vennemann’s theory, the Vasconic languages formed a substrate layer for the 
languages in (Sub-)Neolithic times, leading to the following main effects: 


a) The remarkably uniform and extensive system of the Old European (i.e. Pre- 
Neolithic) hydronymy (see e.g. Krahe 1963), and more generally, the Old 
European toponymy are Vasconic. This follows directly from the assumption 
that the majority of Mesolithic Europeans spoke Vasconic languages. During 
the process of repopulation they named every feature of the landscape they 
came across and every location they stayed at. It is well known that later popu- 
lations generally take over the old names and refrain from changing their sub- 
stance material (though modifications, especially additions are common). The 
uniformity and the extension of the toponymic system also make it unlikely 
that we are dealing with a coincidence. Moreover, Vennemann (2003a, chapter 
6) shows that the language of the Old European hydronymy is unlikely to be 
Proto-Indo-European or a direct descendent, and instead points out structural 
and lexical correspondences to (Proto-)Basque. 

b) A lexical substrate exists in several Indo-European languages. So far a number 
words in several Western Indo-European languages without convincing Indo- 
European etymologies have been etymologized as going back to the Vasconic 
substrate. 

c) Several structural properties of Western Indo-European languages are due to 
Vasconic influence. Examples are the vigesimal way of counting (basis 20 in- 
stead of 10, cf. F quatre-vingt ‘80’), which is autochthonic in Basque and the 
fixed dynamic stress on the initial syllable, found only in Germanic, Italic and 
Celtic, which, according to Vennemann (2003a: 178), can perhaps also be as- 
sumed for an earlier stage of Basque. 


The criticism directed against this theory has neither been able to falsify it nor been 
able to present a plausible alternative. Though it is correct that the Mesolithic re- 
population of Europe was not achieved by one genetically homogenous group of 
people (Villar and Prosper 2005: 411), which may perhaps imply that the linguistic 
situation was not homogenous either, this does not rule out the possibility that at 
least the western and southwestern languages belonged “to the same linguistic 
stock" (Vennemann 2003a: 181).5 The archaeological and genetic evidence at least 
clearly suggest that the western group left more substantial marks, which may well 
argue for their numerical and technological supremacy. 

Likewise, the objections raised by some Bascologists against correspondences 
between Basque and the Old European language reconstructed by Vennemann 
show that Old European was not identical to either modern or historical Basque 
(Trask's 1995 Pre-Basque dated about 2000 years ago). Atthe same time they can- 
not rule out that Vennemann's Vasconic languages are related to Modern Basque 
over a distance of more than ten millennia. As far as alternative theories are con- 
cerned, the idea that the Old European hydronymy is Indo-European is implausible 
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for linguistic and extralinguistic reasons and theories that operate with mysterious 
languages that may have existed and subsequently disappeared are no scientific al- 
ternative. As far as properties b) and c) above are concerned, they have either not 
been convincingly explained (b) or virtually completely ignored (c). 

To conclude, it is conceivable that a good deal of the Mesolithic population in- 
cluded speakers of languages belonging to a group that is distantly related to Mod- 
ern Basque and that left its traces on the linguistic map of Europe in the shape of 
the oldest place names in Europe and various other substratal features in Western 
Indo-European languages. The immense time depth and the paucity of data, how- 
ever, remain a considerable challenge for any hypothesis on the linguistic situation 
at that time. 


4.2. More on language contact in Neolithic Europe 


Apart from the Mesolithic substrate termed Vasconic and the one detectable in the 
Uralic languages, there are other situation of language contact in Neolithic Burope 
with different stratifications that are worth mentioning. 

Starting in the northwestern edge of Europe, there is the case of Pictish, a 
sparsely attested language spoken in Scotland in historical times. It has been ident- 
ified as a non-Indo-European language, though heavily Celticized (Forsyth 1997). 
Hypotheses about the origin of Pictish have either suggested an ancient Mesolithic 
connection to the continent (e.g Kallio 2003: 232 fn 3) or a more recent Neolithic 
migration, as Vennemann's theory on the prehistory of Europe proposes. Accord- 
ing to the second hypothesis, the ancestors of the Picts were speakers of a Semitic 
language — termed Atlantic — who arrived in Western Europe across the sea and 
settled in the British Isles, forming a substrate for the Celts, who arrived many mil- 
lennia later. Insular Celtic — Irish in particular — diverges in many structural re- 
spects from other Celtic languages, which points to a situation of language shift 
(Schrijver 2004), and these deviances find close parallels in Semitic languages as a 
number of studies have shown (cf. Vennemann 2003b: 327). 

However, cases have also been made for other lost contact languages in pre- 
Indo-European Europe, which may have arrived together with agriculture at the 
beginning of the Neolithic period, and which may have left their traces in some 
Indo-European languages (see Schrijver 2001). In some cases these may even have 
been Indo-European languages, as Kallio (2003: 231) suggests for various loan- 
word strata in Uralic languages, and as has been suggested in the literature for 
Greek (see Strunk 2003 with references).° Other cases in point are the non-Indo- 
European languages on the Iberian Peninsula. Basque was addressed in the pre- 
vious section; however, not much is known about the others, Tartessian and Ibe- 
rian. The Iberians left written records, which have not yet been understood. What 
seems clear, is that Iberian is not directly related to Basque (Trask 1995), but the 
relations of Tartessian are almost completely unclear. It is also unknown when 
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these languages arrived on the Iberian Peninsula, though there is some indication 
that they arrived during Neolithic times. At any rate, both languages were already 
there when the Indo-Europeans (Celts and later the Greeks) and Semites (Phoen- 
cians and Carthagenians) arrived. The language commonly referred to as Celtibe- 
rian bears testimony of the Iberian substratal influence on Celtic. 

It has been suspected that Tartessian and Iberian originally belonged to the so- 
called Aegean language complex, originating in Asia Minor. For other extinct lan- 
guages of Europe this has been proven, namely for Etruscan and its relatives Rae- 
tian and Lemnian (cf. Steinbauer 1999, Schumacher 2004). However, one of the 
greatest linguistic puzzle of the Neolithic period is that of the Minoan language, 
which has remained unaffiliated so far. 

The two biggest language families in Europe, Uralic and Indo-European, have 
been in contact from the beginning. The continuing contacts between northern 
Indo-European languages, Germanic and Baltic, and Uralic languages, Finnic and 
Saamic, manifested themselves in shared structural features and a layer of super- 
strate loanwords in some Uralic languages as well as in substrate features in Balto- 
Slavic (cf. Kallio 2003: 230 with references). 


5. Languages in Europe in the Bronze Age and beyond (2000-500 BC) 


At the end of the Neolithic period the Indo-European languages were spreading 
across the continent. By the end of the Bronze Age in the middle of the last mil- 
lennium BC, we find all languages in their historical locations. As mentioned in 3.2 
above, the Indo-European languages are probably represented archaelogically by 
more than one cultural complex. Though their speakers did not bring agriculture to 
Europe, they continued to shape this continent decisively. However, we must not 
forget that language contact situations continued to occur after the Neolithic period. 
Investigating these can shed some light on hitherto unexplained phenomena. 

One case in point is the influence Semitic languages have exercised since the 
Bronze Age, in particular the Phoenecians and the Carthagenians (often aptly 
called Orientalization). There is clear indication for this influence in the archae- 
ological remains from the Mediterranean area, the Iberian Peninsula and also the 
British Isles. Consequently, linguistic ramifications should be expected, some of 
which Theo Vennemann has explored (see Vennemann 2003b) in an expansion of 
his Vasconic Theory (cf. 4.1 above). The results show the considerable explanatory 
potential of this line of research. Such studies demonstrate how difficult the lin- 
guistic investigation of the prehistoric era can be, but also how powerful linguistic 
tools are in this area of research. 


See also the following chapters in this volume: 6 by Daniel and Lander, 8 by 
Laakso, 10 by Haase, 16 by Tomić, 24 by Tróster-Mutz and 38 by Ramat. 
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Appendix: Timeline 


Upper Paleolithic Mesolithic 

33,000 BC 24,000 18,000 9,000 

Homo Sapiens Neanderthal last glacial end of Ice Age/ 

enters Europe man extinct maximum beginning of 
Mesolithic 

Neolithic Bronze Age 

7,000 4500 3500 2000 500 

agriculture agriculture development IE daughter end of 

spreads to in Central of the wheel; languages Bronze Age 

Greece Europe Proto-Indo- 


Proto-Uralic European 


Notes 


* | would like to thank the editors, Stephen Laker (Leiden) and Theo Vennemann (Munich) 
for reading over drafts of this chapter. All remaining errors are my own. 

1 From an archaeological point of view, there is a terminological difference between the 
terms culture and group. The former refers to a complex phenomenon consisting of tools, 
economy, art, settlements, burial rites and anthropological characteristics, whereas for 
groups not all of these elements can be described. In this narrow sense, the pre-Neolithic 
cultures are rather like groups, and it is only from the Neolithic onwards that archaeol- 
ogists speak of cultures (Probst 1999: 227). 

2 Fora survey of other theories see Kallio (2003). 

3 For arguments against an autonomous development of agriculture in Europe, at least not 
to the extent of the Linear Ware Culture see Probst (1999: 250). Also the genetic facts re- 
ported in the main text are counter-arguments to such a view. 

4 Note that these lexical correspondences clearly point to language contact and not to a 
genetic relationship in the sense of the Nostratic Theory. As Koivulehto (2001: 257) 
points out, the phonetic matches are far too exact to be due to a genetic relationship. 

5 It is apparent that genetic and cultural homogeneity do not permit the inference of lin- 
guistic homogeneity. The genetic study by Calderon et al. (1998) concludes that the 
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speakers of Basque arrived on the Iberian Peninsula only in Neolithic times. However, 
given the poor linguistic relations between Caucasian languages and Basque (cf. Trask 
1995: 81-86) it has to be assumed that Basque is the last survivor of the Mesolithic lan- 
guages of Europe (see Trask 1995: 91 and 1997: 8). Note also that the eastern and the 
western group ultimately go back to the same wave of Paleolithic immigration from Af- 
rica. Hence, their languages are actually likely to be ultimately related. 

6 Lusitian has been shown to have been an Indo-European language (Mallory and Adams 
2006: 37). This has also been demonstrated for Ligurian, which is related to Continen- 
tal Celtic Lepontic, the earliest attested Celtic language (cf. Mallory and Adams 1997: 
233). 
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1. Preliminary remarks 


Since this chapter will deal with migrations into the European continent from a 
(pre)historic point of view, there will be many contact points with other chapters of 
the volume, especially those dealing with areal typology (Part II.1) and language 
contact (Part II.2) in specific sub-areas of the continent (the Circum-Baltic area, 
the Mediterranean area, etc.). Contact-induced changes have shaped the linguistic 
history of Europe through the millennia. Only the Basques in the Pyrenees cannot 
be considered as an output of migrations (at least in the millennia whose (pre-)his- 
tory we can reasonably recover). 

Therefore it is necessary to begin with some general reflections in order to give 
a theoretical frame for concepts such as ‘linguistic area’, ‘linguistic contact’ or 
linguistic influence (adstrate, superstrate, substrate). For a general discussion see 
Heine and Kuteva (2006), Muysken (2008), Siemund and Kintana (2008). It goes 
without saying that just the main lines of the history of language contacts in Burope 
will be sketched on the following pages. Each area, indeed each sub-area would de- 
serve an in-depth description on its own. References to other chapters of the book 
will help to deepen particular aspects of the general story. 

In order to give an exhaustive picture it is necessary to speak not only of mi- 
grations but also of invasions. Just to give an example of how multi-faceted and 
complex the contacts in a specific area can be, consider the Salentum peninsula in 
Southern Italy (Brindisi and Lecce provinces). The peninsula was inhabited by the 
Messapii, an Italic population that has left written documents since the 5th century 
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BC. They used an alphabet from Tarentum (a Greek colony founded in the 7th cen- 
tury BC) and were strongly influenced by the Ionic dialect of the Magna Graecia 
colonies. The Romans conquered the peninsula in the 4th century BC, and for a 
short period after the Romans it were the Goths and Lombards who ruled the 
country. From the 6th until the 11th century the Byzantines incorporated it into 
their Empire. Then the Normans came. After the fall of the Hohenstaufen emperors 
(Frederick II, 1 1250) there was an Aragonese domination, while the coasts were at 
times occupied by Turks and Venetians. Now, the documents from the 10th to the 
13th century (mainly scriptae of notaries) testify to a complex mixture or coexist- 
ence of Greek, Latin, and vulgar Romance, according to different sociolinguistic 
and cultural layers: Byzantine-Lombard features are attested along with “semi-vul- 
gar’ Romance traits. Albanian dialects are still spoken in some villages of Salen- 
tum and finally, we have to mention the Grecia Salentina, a yet surviving linguistic 
island, which is now protected by the European legislation for linguistic minor- 
ities. It is not clear whether the ‘grecanico’ (or ‘griko’) spoken in Salento is a direct 
continuation of ancient Greek or the outcome of a new wave of immigrants who, 
like the Albanians, wanted to escape the Turkish domination in the Balkans. 

Another preliminary point deserves our attention. We have to distinguish be- 
tween cross-linguistic similarities (isoglosses) due to common genetic origin, on 
the one hand, and similarities which originate in a linguistic area (‘Sprachbund’) 
among neighbouring languages, on the other hand. According to the pioneering 
paper by Trubetzkoy (1928), the latter are characterized by morphological and syn- 
tactic similarities, similar phonological inventories (but not systematic sound cor- 
respondences), and shared culture words, while the former are characterized by a 
common basic vocabulary, agreement in the phonetic expression of shared gram- 
matical categories, and consistent sound correspondences (sound laws). Practi- 
cally, it is often difficult to make a clear distinction between the two. It may be use- 
ful to illustrate the problem by means of a concrete example. 

A widespread phenomenon around the Mediterranean is the rise of the definite 
article. Mycenean Greek does not appear to have had one. It appears that the definite 
article took shape in the writings of Homer. It became normal especially in prose 
from the late-archaic period onwards. The Greek colonization extended the (re-)dif- 
fusion of the Greek language not only to the Anatolian coast but also to the coasts of 
the Western Mediterranean, introducing this grammatical category there. But even 
before the Greek documents we can follow the development of the definite article 
through texts which begin in Ancient Egypt and extend to the Semitic languages of 
the 3rd and 2nd millennium BC, as well as Ugaritic and North-Western Semitic. 
The kinds of texts where we find first forms that may be interpreted as definite ar- 
ticles (letters and other informal scripts — along with official documents without 
these forms) indicate that the definite article was generated, on the one hand, by in- 
ternal structural factors in the various languages, due to the weakening of nominal 
inflection, and, on the other hand, by the general pragmatic requirements of discur- 
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sive efficacy (deixis): cf. Modern Hebrew kol ha-?anasim (ADJ+DET-man;; ur) 
‘all the men’ versus kol is (ADJ+man) 'every/each/any man’; Spanish todos los 
hombres (ADJ+DET+N) "all the men’ versus todo hombre (ADJ+N) ‘every man’. 
The parallel manner in which the development appears to have occurred in the Near 
East suggests that the appearance of the definite article is due to spontaneous pro- 
cesses of “natural morphology” in the different linguistic traditions. For instance, 
Germanic languages saw the same development of a definite article from deictic 
pronouns (e.g. German der, English the) (see Putzu and Ramat 2001). 

There can be little doubt that the (postposed) definite article we find in Mace- 
donian and Bulgarian is attributable to language contact in the Balkan area, given 
that the other Slavonic languages do not have it or are at most on the way of devel- 
oping some form of it (Czech, Slovenian, Sorbian-Lusatian under the influence of 
German). Since Bulgarian and Macedonian belong to the Balkan linguistic area but 
are at the same time members of the Slavonic language family, we have to con- 
clude that the rise of the definite article was due to a twofold cause: a process of 
‘natural morphology’ and the influence of the Balkan Sprachbund reinforcing this 
morphosyntactic model.! 


2. The Indo-Europeans 


There is general agreement that the Indo-Europeans (IEs), who nowadays consti- 
tute by far the largest language family of Europe, at some point migrated to this 
continent from somewhere. Opinions about their original homeland and the time 
of their migrations diverge strongly. A short summary of the different opinions 
is presented below. The only pre-IE population and language that still survives in 
Europe is Basque (Hungarians arrived in Europe only in the 9th century AD, see 
below; and Malta, where an originally Semitic language is spoken, is at the outer 
edge of Europe). The East Mediterranean area is the one whose (pre)history may 
be best reconstructed. 

The archaeologist Colin Renfrew suggested (1987) that Proto-Indo-Europeans 
(PIEs) lived in Anatolia, during the Neolithic era, later diffusing throughout the 
Mediterranean and into Central and Northern Europe. According to Renfrew, the 
PIE diffusion towards the West may be proven by following the archaeological 
traces of the diffusion of agriculture: the demic diffusion of farming would corre- 
spond to the progressive Indo-Europeanization of Europe. Several peaceful waves 
of colonization from ca. 6500 BC on would have introduced the agricultural rev- 
olution more and more into the (north)western parts of Europe. This diffusion 
would have been accompanied by the splitting of the PIE language into different 
branches, such as Northwestern IE, Balkan PIE, Early Steppe PIE, and after 3000 
BC we would have had the formation of the languages we historically know, e.g. 
Italic, Celtic, and Germanic from the Northwestern IE (see Renfrew 2003). 
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James P. Mallory has been strongly critical of Renfrew's theory, maintaining 
that linguistic palaeontology is the most valid tool for solving the IE homeland 
problem and arguing that this discipline does not offer any support to Renfrew's 
reconstruction. The vocabulary we may reconstruct via cross-linguistic compari- 
son indicates a culture of the Early Bronze Age, with knowledge of the wheel, 
metal working, and domestication of the horse. But this stands in stark contrast to 
the early Neolithic dating originally suggested by Renfrew (8th millennium). Ren- 
frew's revised hypothesis (2003) of an “Indo-Hittite” scenario dating back to 5000 
BC approaches the Kurgan model (see next paragraph), at least in terms of time 
depth. 

In contrast with Renfrew's thesis there also exists the just mentioned Kurgan 
hypothesis by Marija Gimbutas (the kurgans are a type of burial tumuli diffused in 
Eastern Europe, dating from the Bronze Age onwards). She postulated (1956) that 
Proto-Indo-Europeans originated in the period between the 5th and 3rd millennium 
BC in the steppes near the Black Sea and Caucasus, west of the Urals, where kur- 
gans are found. 

Hans Krahe thought that the situation of Europe in the 2nd millennium BC was 
that of a territory inhabited by more or less small migrant tribes speaking some var- 
iety of IE. He named these tribes *teutds since this word is known with the meaning 
‘tribe, people, ciuitas' in the Celtic, Baltic, Italic, Illyrian, and also Germanic lan- 
guages (Gothic Piuda, etc.). According to Krahe (1964), these IE varieties would 
have left their traces in a more or less homogeneous hydronymy. From these *teu- 
tds originated — via a merging process (“Verschmelzungsprozess’) — the Celts, the 
Balts, the Slavs, and also the Germans (‘Germanic tribes’). Paradoxically, we can 
say that the ‘Germanen’ did not exist before they settled in their ancient settlements 
in Jütland and South Scandinavia, about 1500 BC. As ‘Germanen’ they came from 
nowhere; they became ‘Germanen’ with their specific cultural and linguistic fea- 
tures in the territories previously mentioned and spread south- and westwards later. 


3. Ancient Greece and the East Mediterranean 


In Homeric and classical Greek, there are many loanwords (mainly referring to the 
ecological environment, e.g. Myken. ku-pa-ri-se-ja, CIGr. kypárissos ‘cypress’) 
that testify to contacts with pre-Greek languages. The same word can also be found 
in Latin: cupressus (and Hebrew gofer!). Like many other words of the same kind 
(see, e.g., CIGr. rhódon, Lat. rosa — and Aram. warda, Arab. ward! — ‘rose’) it is a 
word that was adopted from an unknown Mediterranean substrate. As the English 
translations show, these words spread across Europe via Latin. 

In fact, the Roman conquest of (Western) Europe from the 2nd century BC on 
made Latin the international language, which has left a lasting imprint that not only 
concerns loanwords but also morphology and syntactic constructions. Pre-Roman 
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Italy, part of Switzerland (Rhaetia), Gallia, and Spain were almost completely 
latinized. Even the Balkans, including Dalmatia, were under Latin influence for a 
long time, although a prestigious language like Greek offered a stronger resistance 
to complete latinization than the languages in other parts of Europe, such as in 
Spain or Gallia. However, Romanian is beyond any doubt a neo-Latin language, 
even if strongly influenced by the neighbouring Slavs, who arrived in the Balkans 
in the 6th century. 

In the declining centuries of the Byzantine Empire, South Slavs expanded into 
the Balkans in two streams. Between them there was a large non-Slavic population 
of Romance speaking Vlachs so that there was no areal continuity of South Slavic 
people. 


4. Slav(on)ic populations 


Slavic peoples expanded over a large territory, even further than the Balkans. 
Written documents of the 9th, 10th, and 11th centuries prove that Slavonic varieties 
were spoken even in central Buropean territories that no longer belong to Slavonic- 
speaking populations. The arrival of the Hungarians, who speak a Finno-Ugric lan- 
guage, in Pannonia in the 9th century separated South Slavs from the other Slav 
speakers. The connection between West Slavs in present-day Poland and present- 
day East Germany (Pomerania) with South Slavs (Croats, Serbians, Macedonians) 
was broken. Finally, the conquests in Central Europe by Charlemagne (f 814) and 
then by the Emperors of the Holy Roman Empire of the German Nation (Saxon, 
Swabian, Luxembourg dynasties, from the 10th century on) completed the geo- 
graphical separation between these two groups, also severing the connection 
between Slavs in Moravia from those who had settled earlier in present-day Styria, 
Carinthia and Tyrol, as well as in modern Slovenia. 

The common basis for all South and East Slavonic languages is Old Church 
Slavonic, the first literary Slavonic language, which developed from the Southern 
Slavonic dialect of Thessalonika. In the 9th century the missionaries Cyril and 
Methodius translated the Bible and other texts from Greek into that Slavonic var- 
iety and were sent by Byzantium to Moravia to evangelize the Slavs. But a clash 
in Great Moravia (present-day Slovakia, the Czech Republic, parts of Hungary and 
Austria, Slovenia, and Croatia) between the Byzantine Empire and church, on the 
one hand, and the expanding Carolingian Empire with its German bishops, on the 
other hand, stopped the mission of Cyril and Methodius, and in 885 the use of Old 
Church Slavonic in Great Moravia was prohibited in favour of Latin. Thereupon, 
the language known as Church Slavonic maintained a high prestige status in Russia 
and among the other Eastern Slavs. 

As a result of all these splits we have a Western part of the Slavia using the 
Latin alphabet and a Southern/Eastern part that uses the Cyrillic alphabet. In con- 
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clusion, the approximate present-day distribution of Slavonic languages in Europe 
is as follows: 1. South Slavonic languages (Bulgarian, Macedonian, Serbo-Croat, 
Slovene); 2. West Slavonic languages (Czech, Slovak, Sorbian, Polish, Cassubian, 
Polabian); 3. East Slavonic languages (Russian, Belorussian, Ukrainian). 

Speaking of Bulgarian and Macedonian we come back to the Balkans. We have 
to mention the important role Venice played along the coasts of the Balkans and 
beyond. Many Venetian words are still alive in the individual languages of the area, 
up to Crete and Cyprus, given that Venetian was the trade language of the Eastern 
part of the Mediterranean for many centuries. 


5. The Balkans 


The history of the Balkan Peninsula has been heavily marked by the Ottoman con- 
quest, beginning in the 14th century (cf. also the chapters by Hill and Johanson, 
this volume). Contacts between the Ottomans and the Balkan peoples are reflected 
in numerous "turkisms": cf. Turk. asker ‘soldier, troops’ > Bulg. and Serbo-Croat 
asker, Alb. asger, Rum. ascheriu, Gk. askéri; Turk. firtina ‘storm’ (« Ital. fortu- 
nale!) > Serbo-Croat frtuna, Bulg. fortuna. However, there are not only lexical 
borrowings; even calques and derivational morphemes entered the Balkan lan- 
guages, thus demonstrating a deep influence of the Turkish language: the -c/ suffix 
of agent nouns (e.g. Turk. av ‘hunt’ — avcı ‘hunter’) spread to Bulg. -dži (lov 
‘hunt’ — lovdzi ‘hunter’), Serbo-Croat (govor ‘word’ > govordzi ‘chatterbox’, 
Gk. taxi > taxitzés, Rum. lapte ‘milk’ — laptagiu ‘milkman’ (see Banfi 1985: 
§ 3.7.5). After the fall of the Ottoman domination a puristic reaction spread in all 
new national states of the region which generally adopted archaic, educated forms 
(e.g. the katharévousa in Greece). 

Up to now we have spoken of Serbo-Croat because, linguistically speaking, 
there is no major difference between Serbian and Croatian. But we have already 
alluded to the clash between the Carolingian and the Byzantine Empire in Central 
Europe (Moravia) with the consequent adoption of two different alphabets. The 
srpski jezik is the most important representative of the southern and central var- 
ieties of Slav, spoken in Serbia, Montenegro, and Bosnia-Herzegovina, but all cul- 
tivated Serbs are able to use both alphabets. It is indeed difficult to draw a sharp 
dividing line between the Balkan Peninsula and the adjacent areas to the north of it. 
During the crisis of the Ottoman Empire at the end of the 16th century, the Croats, 
being located in the most northern part of the Balkans, became part of the German- 
speaking Central-European Habsburg world. There it was Latin which was held in 
high esteem as the cultivated official language until it was replaced by German, 
which at the same time came to serve as the most important trade language. 
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6. North-East Europe 


Passing now to the oldest (pre)historic language contacts between Russians and the 
Finno-Ugric populations in the northeastern part of Europe, we refer to Stern’s and 
Wälchli’s chapters in the present volume. 

Along with the military and political expansion of the Roman Empire of Ger- 
man Nation, the Hanseatic league played a very important economic and even lin- 
guistic role from the 12th century onwards, especially in Northern Europe and the 
Baltic Sea. The Hanseatic merchants and the Teutonic Knights of the Empire came 
into contact not only with Slavs up to Novgorod (near St. Petersburg) but, even ear- 
lier, with Baltic speaking peoples in Prussia and what are nowadays Latvia, Lithua- 
nia, and Estonia (in the latter country a Finno-Ugric language is spoken). Low Ger- 
man (Nieder- or Plattdeutsch) became the international commerce language: in 
Finnish, kansa (< Hansa!) means ‘people’, ‘Volk’. Even the Scandinavian languages 
exhibit plenty of Low German borrowings. 

We cannot speak of Low German as a lingua franca of the Baltic Sea and even 
less of a koiné based on Low German but certainly the linguistic influence of the 
Hanseatic league was of paramount importance between 1100 and 1500. The fol- 
lowing centuries experienced the economic and political dominance of many dif- 
ferent circum-Baltic (CB) states (Denmark, Poland-Lithuania, Sweden, Prussia, 
Russia) so that the CB area is a real linguistic melting pot. Though every language 
maintains its own profile, linguistic contacts have always played an important role. 
Table 24 of the synthesis article by Koptjevskaja Tamm and Walchli on the CB lan- 
guages (2001: 729—731) gives clear evidence of how relevant features are wide- 
spread among the CB languages and how these similarities (isoglosses) cover 
every language domain, from phonology and morphology to syntax and lexicon. 
Koptjevskaja Tamm and Wälchli speak of contact superposition zones. Thus, for 
instance, the overlengthening of the initial syllable, due to the dominance of initial 
stress, probably of Finnic origin, extended to Estonian, Latvian, Livonian, and 
some Low German dialects. On the other hand, the ‘pluralia tantum’ (nouns that 
lack or have only a very unusual secondary singular form) are probably of IE origin 
but in the CB area they extended to non-IE languages like Finnic. As for syntax, 
Finnic, Estonian, Komi, Veps, Mordvin show a more or less flexible SVO order, 
accompanied by a rigid Genitive + Noun sequence, like the IE languages of the 
area (Russian, Lithuanian, Baltic, Continental Scandinavian). Fairly infrequent 
phenomena are indicative of strong linguistic contacts: case alternation in non-ver- 
bal (i.e. nominal or adjectival) predication of the type ‘Peter is a student’ or “Peter 
is young’ is found in Finnic, Sami, Mordvin, and Komi, but also in Baltic, Eastern 
Slavonic, and Polish (more on this in chapter 17 by Wälchli). 

A different kind of language contact is represented by the so-called Russe- 
norsk, a Norwegian/Russian based pidgin, spoken in the Arctic Ocean by Russian 
traders and Norwegian fishermen during the 18th century (see chapter 20 by Stern). 
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7. West-Central Europe 


West-Central Europe witnessed a series of invasions from the last century of the 
Roman Empire onwards. First arrived the Visigoths (who in 410 plundered Rome) 
and then Attila’s Huns, who, coming from Siberia, invaded European Russia, 
Poland, Germany, the Byzantine Empire, and even the Western Roman Empire. 
But they did not settle in the huge territories they crossed and consequently there 
are no traces of their Mongolian language. The Western Roman Empire ended in 
476. Theodoric’s Ostrogoths invaded Italy in 488 and founded the first ‘Roman- 
Barbaric’ kingdom in Western Europe. Then came the Lombards, who, after a long 
migration from the Elbe river through Noricum (Austria), Pannonia (the region be- 
tween the Danube and the Sava rivers), and Bohemia, entered Italy in 568 and 
ruled (part of) the country until 774 when they were defeated by Charlemagne. 
There was some linguistic impact of the Germanic Goths and Lombards on the 
Latin speaking populations. Culturally and linguistically, they were rapidly assimi- 
lated to the indigenous populations but in standard Italian and the (Northern) Ita- 
lian dialects there are plenty of Germanic words. The It. adjective bellico ‘at war’ 
goes back to Lat. bellum ‘war’ but the It. word for ‘war’ is guerra which clearly 
derives from Germanic *werra (Eng. war, Grm. Wehr etc.). Lat. albus ‘white’ has 
been substituted by bianco from Germanic *blankaz (Grm. blank ‘bright’, Old 
Eng. blanca, Old Norse blakkr ‘pale’ ), a term originally bound to horse breeding. 
The same holds true for Fr. guerre, Span. and Prt. guerra, Fr. blanc, Span. blanco, 
Prt. branco, whereas Rom. räzboi is a Slavonic word (cf. Russian razboj) and 
Rom. alb continues Latin albus. In fact, Gallia and Iberia underwent the same des- 
tiny as Italy. Germanic populations crossed the Roman limes and settled in these 
territories. The Franks, who were to give their name to France, were West Germans 
and they had crossed the Rhine as foederati of the Romans as early as in the late 
Roman Empire. Part of them remained in Germany (see the toponym Franconia, 
Grm. Franken): The original nucleus of Germany and France was formed. The 
Franks who crossed the Rhine were rather quickly latinized whilst those who 
didn’t continued to speak a German variety. The Franks were not the only Germans 
to settle in France; relevant other German tribes include the Burgundians (whose 
name continues in Bourgogne) and the Visigoths. 

We can say that the Germanic element represents a kind of linguistic super- 
strate and/or adstrate in the Romance western countries, especially in France. 
Goths, Lombards, Franks, Burgundians, etc. represented the ruling military class. 
Particularly in France we can speak of Frankish-Romance bilingualism during the 
first centuries of the Franks’ presence. In fact, French is that Romance language 
which has undergone the largest transformations from Latin. 

The famous Strasburg Oaths (842), which are inserted in a Latin narration but 
report the words pronounced by Ludwig the German in the vulgar ‘romana lingua’ 
and, respectively, by Charles of France in ‘teudisca lingua’, are the first official 
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document of a bilingual situation in the core of Europe: “romana lingua [...]: Pro 
deo amur et pro christian poblo ... |...] Teudisca lingua [...]: In godes minna ind 
thes christanes folches ...". The Strasburg Oaths are the first manifestation of the 
*Charlemagne area' and what will become the so-called *Standard Average Euro- 
pean’ (see section 9, below). 

As for the Iberian Peninsula, Vandals crossed the Pyrenees along with Swa- 
bians, Alans, and other tribes at the beginning of the 5th century (i.e. when the 
Roman Empire still existed: the same story as for the Franks). After a period of rav- 
ages and destructions they passed to North Africa, starting out of course from the 
southern part of the peninsula. This part was later called al-Andalus by the Arabs, 
which might well mean 'Vandals' land'. Under the pressure of the Franks the al- 
ready mentioned Visigoths arrived in Iberia (beginning of the 6th century). They 
founded a kingdom which lasted until the invasion of the Arabs (711). The latter 
ruled over large parts of Portugal and Spain for eight centuries, up to the Recon- 
quista of Granada (1492). The Romance variety spoken in the Arab-ruled terri- 
tories is called Mozarabic, a rather conservative form of Romance. The linguistic 
history of Spain is characterized by the strong presence of arabisms, mainly of lexi- 
cal nature. Many "culture words" like alcohol, algebra, alchemy became inter- 
national words via the highly developed kingdom of Granada. Other lexemes, such 
as azúcar ‘sugar’, azafrán ‘saffron’, were also diffused through Europe by the 
Arabs. One can say that along with the Germanic superstrate, Western European 
languages show the presence of an Arabic adstrate, which in Spain rather played 
the role of a superstrate. 


8. The Mediterranean 


When speaking of Arabic, our horizon necessarily includes the Mediterranean. 
In spite of the millennia of trading contacts, it is not possible to consider the Medi- 
terranean a more or less compact linguistic area in the same sense as we can speak 
of a Balkan or Meso-American ‘Sprachbund’. We find a situation much similar to 
the Baltic one. No isogloss embraces all the languages around the sea but there are 
various isoglosses that delimit sub-areas connecting two or three languages. The 
term contact superposition zones introduced by Koptjevskaja Tamm and Wälchli 
for the Baltic region applies to the Mediterranean area, too. Recall the example of 
the definite article in the preliminary remarks (further details in chapter 18 by 
Sansó, this volume). 

The crisis of Rome, resulting in the creation of the *Roman-Barbaric' king- 
doms, and later also the crisis of Byzantium, with the invasion of the Balkans by 
Slavic peoples and the arrival of the Arabs to the African coasts of the Mediterra- 
nean, put an end to the linguistic unity of the area. Before, Latin and the Greek 
koiné had been the official, cultivated languages on all the Mediterranean shores 
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while the indigenous languages (Egyptian, Berber, Gallic, Illyrian, Thracian, etc.) 
either became substrate languages or even disappeared (as was the case of Gallic in 
Iberia and France). 

Malta is a special case in the Mediterranean panorama. Malti (i.e. Maltese), now- 
adays recognised as one of the official languages of the European Union, is the only 
Semitic language written in Latin alphabet and probably derives from a Sicilian var- 
iety of Arabic. It has a Semitic grammatical structure but has substantial borrowings 
from Sicilian and Italian. The British period (1814—1964) and the role of English as 
international language have caused a diffused ‘code mixing’ between English and 
Maltese. Interference of Maltese on Malta English is shown by some constructions 
which are not Standard English: Don't stay walking on the grass, Don't stand in the 
middle instead of Don't walk on the grass and Don't stay in the way (see Brincat 
2003: 373). As can be seen, the superstrate influences on Maltese concern not only 
the lexicon. Words of Romance origin are usually pluralized in two manners: ad- 
dition of Romance -i or Semitic -jiet (for example lingwa, lingwi ‘languages’, from 
Sicilian lingua, lingui and pubblikazzjoni > pubblikazzjonijiet). Even nouns deriv- 
ing from Arabic may have an -i plural along with the Semitic -ijiet. Words of English 
origin are pluralized by adding either -s or -jiet; tojlit (« Engl. toilet), plur. tojlits. 
But, just as for Romance loanwords (see pubblikazzjonijiet), the two suffixes can 
amalgamate: brikksa (« Engl. brick), plural bricks (collective meaning) or brikksiet. 

Most "learned" words are borrowed while those pertaining to everyday life are 
Semitic: cf. Malt. skola, Sicil. scola, Ital. scuola, Engl. school, versus Arab mad- 
rassah; Malt. gvern, Sicil. cuvernu, Ital. governo, Engl. government versus Arabic 
Pukumah; but Malt. ragel ‘man’, mara ‘woman’, dar ‘house’, xemx ‘sun’, etc. No- 
tice the already quoted word for 'saffron': Malt. zaffran. It might have arrived in 
Malta from Sicily (zaffarana) but we are dealing with an Arabic word that had gen- 
eral diffusion all over Europe, via the Arab trade with many European countries. 


9, ‘Standard Average European’ 


We have seen (sections 5 and 6) that the influence of the Germanic superstrate is 
mainly of lexical nature: many Germanic words such as *werra and *blagkaz were 
introduced but the morphosyntactic structure of the neo-Latin languages largely 
remained the same as in late Latin. On the contrary, the influence of the Latin/Ro- 
mance tradition on Germanic languages is not limited to lexical borrowings but con- 
cerns deeper layers of the linguistic structures, namely morphology and syntax. This 
does not mean that some Germanic syntactic features cannot percolate in Romance: 
Pro Deo amur et pro christian poblo follows the Germanic order Determinans + De- 
terminatum, exactly as in the Frankish version, In godes minna ind thes christanes 
folches ...; this word order will be later substituted by the reverse Romance order: 
Pour l'amour de Dieu et du peuple chrétien (cf. Vidos 1959: 240—243). 
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The question of the influence of Greek on Vulgar Latin is a problem not yet 
solved and the discussion of which would go far beyond the scope of this chapter. 
However, we may notice that both koiné Greek and (late) Latin know periphrastic 
verb forms such as dicere habeo, and écho légein (lit. ‘I have to say, ich habe zu 
sagen’ > TIl say, ich werde sagen’; habeo scriptum, écho gegramménon ‘I have 
written, ich habe geschrieben’) whereas Classical Latin and Greek preferred syn- 
thetic future and past forms (e.g. Latin dicam and, respectively, dixi). Compare 
also dico quod/quia and legö höti ‘I say that’ in contrast with the Accusative + In- 
finitive (Ach construction in Classical Latin and Greek (cf. Coseriu 1972). 

We have seen that neither of the periphrastic constructions belonged to the clas- 
sical languages, or if so, only marginally. But both are typical Standard Average 
European constructions (more on this in chapter 15 by van der Auwera). This 
means that the linguistic area where it makes sense to speak of SAE was formed 
after the breakdown of the Latin unity under the pressure of the German invasions 
during the first centuries of the Middle Ages and the Carolingian Empire. Its mod- 
els were basically of Latinate origin. Think of the participial passive construction, 
for instance: Lat. audistis quia dictum est (Matth. 5,21) *You have heard that it has 
been said’;? Germ. kawihit si namo din (Freis. Pater Noster,1) = Old Engl. si bin 
nama gehalgod - Ital. sia benedetto il nome tuo = Fr. que ton nom soit bénit; Old 
Sax. them is that euuiga riki [...] fargeben (Hel. 1303-5] ‘to them the eternal king- 
dom is promised', Dutch ... so wart gesendt dingel Gabriel van gode (Het Leven 
van Jesus) 'the angel Gabriel was sent by God’). The translations of the Latin Holy 
Scripts have moulded many morphosyntactic aspects of the Germanic languages in 
Central and Western Europe. In the Eastern parts of the continent Greek played a 
similar, but not as influential role. 


10. The British Isles 


When the Romans withdrew from Britain (England and Wales) in 410, they left be- 
hind the Gaulish populations who had been partially romanized after the emperor 
Claudius had conquered England and Wales (43 AD). Later (122 and 142 AD) the 
Romans built two fortified valla (> Engl. walls!) against the Picts and Scots, who 
lived in the Northern parts of the island (Scotland) and in Ireland. The Germanic 
Anglo-Saxons coming from Denmark invaded the southeastern part of England 
(the ‘land of the Angli") in the 5th century, while the Celtic Britanni escaped to 
French Armorica, which thereafter took the name of Bretagne (Engl. Britanny). 
Goidelic and Brittonic are the Celtic minority languages in the United Kingdom 
which have survived up to today. Celtic people were gradually pushed towards the 
most western parts of the island (Wales, Cornwall) and the upper North (Scotland) 
whilst Irish (Gaeilge, i.e. Goidelic) is the official language of the Irish republic and 
therefore also an official language of the Buropean Union. 
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The Anglo-Saxon kingdoms were repeatedly attacked by Normans (a collec- 
tive name for Vikings, Danes, Norwegians), who settled in North and East England 
(Northumbria), the so-called Danelaw area. In 1066 Normans arriving from North 
France defeated the last Anglo-Saxon king, Harold Godwinson - which is certainly 
not an Anglo-Saxon name! This was a decisive event in the history of England, 
also from a linguistic point of view. 

The Norman noblemen arriving with their army from the Duchy of Normandy 
in Northern France had a strong impact on the way of life and also introduced the 
medieval French language (langue d'oil). As a result English is the Germanic lan- 
guage with the highest number of Romance loanwords, which were either adopted 
directly from Latin or via French. Anglo-Norman was the spoken language of the 
Norman nobility and was also used to compile official documents and for commer- 
cial purposes. But English survived and eventually eclipsed Anglo-Norman, which 
left almost no trace in English grammatically. 

English toponyms reflect the many contact layers present in this language: a 
Celtic substrate (e.g. Avon ‘river’, Aber ‘river mouth’ e.g. in Aberdeen, hamps ‘dry 
stream in summer' e.g. in Hampshire), a Roman superstrate (chester « castrum, in 
Chesterfield etc., street « (via) strata ‘paved way’, etc.), and a Viking adstrate (by 
‘farm, village’ in Grimsby, Tenby etc., firth in Firth of Forth, Burrafirth etc., ay, ey 
‘island’ in Ramsay, Orkney etc.), along with the proper Anglo-Saxon terms such as 
bourn(e) in Melbourne etc., field as in Chesterfield (a good example of a Latin- 
Germanic compound), wood as in Hollywood. As these examples show, many Eng- 
lish toponyms are compounds formed by simple words referring to the environ- 
ment but there are also many other words which reflect the multi-layered contact 
nature of the English vocabulary. 

Thus English, the leading international language worldwide, has arisen, with 
a rather simple morphosyntactic structure, from a blending of Romance (Latin > 
French) and Germanic elements. The influence of Celtic has to be underlined as 
well, not only in the lexicon: zero marking of the relative clause (the man I saw), 
which is rather unusual among IE languages, is typical of Old and Middle Irish, 
and Old and Middle Welsh (druc digon-it ‘the evil [that] you do’). Preposition 
stranding in relative clauses (the girl I am in love with) is probably derived from 
Celtic, too (Middle Irish y wlat yd hanwyf oheni ‘the land that (yd) you are from 
(oheni)’ (see Roma 2007: 260-261).3 


11. Conclusions 


We have seen that since prehistoric times the linguistic history of Europe has been 
characterized by many migrations of populations, mainly coming from Asia or 
North Africa. Undoubtedly, the Indo-Europeans arrived in Europe from the East 
(or Southeast) and IE languages spread all over Western Europe, replacing the in- 
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digenous languages (with the notable exception of Basque). Traces of the indigen- 
ous languages survived in the form of (mainly) lexical items in the Mediterranean 
area as well as in northern parts of the continent, where Germanic, Slavonic, and 
Finno-Ugric languages are spoken. As far as Central and Western Europe are con- 
cerned, the crucial point was the 9th century (cf. section 3), where the dividing 
lines of an East-West division of the continent are laid. 

Later migrations did not change these lines much: the Magyars, a Ugric popu- 
lation, arrived in Hungary in the 9th century. Hungarian has borrowed many words 
and even some syntactic elements from the German adstrate. In the 11th century 
the Turks conquered Rüm (i.e. Rumenia) and in 1453 the Osman Turks conquered 
Byzantium. Turkish has borrowed plenty of Arabic words via Persian (Farsi). At 
the same time, the Turkish language also had a great impact on the Balkans during 
the long period of the Ottoman Empire. 

Summing up we may say that Europe does not represent an exception in the 
general frame of human history: all parts of the world have experienced migrations 
and contacts of different populations — be they genetically related or not. Really 
isolated languages are the exception. The ‘unmarked’ case in the history of man- 
kind is that of language contacts and mutual influences, which represent a substan- 
tial input for cultural development and enrichment. 

This holds true not only for the past, which has been the subject of the present 
chapter, but also for modern and contemporary times, where Europe is the target of 
many migrating people coming from all over the world. These arrivals are not 
without impact on the European languages. It is fruitless to judge such an impact 
‘good’ or ‘bad’: itis simply what happens and has happened many times in the his- 
tory of our continent. 


Notes 


1 However, balkanisms in Bulgarian and Macedonian are younger than the inherited Sla- 
vonic features, so that a chronological criterion may be important in distinguishing the 
two layers of Bulgarian. More on the Balkan linguistic area, the prototypical example of 
‘Sprachbund’, in chapter 16 by Tomič. 

2 The Greek text has still a synthetic form: bot (‘that’) erréthe, but the Gothic translation 
has already switched to the participial analytic construction: Datei giDan ist. 

3 Preposition stranding is known to Scandinavian languages, too, so that it might be due to 
Viking influence. Alternatively, Viking influence might have reinforced a feature already 
present in Old English (see Roma 2007: 281). On the Celtic influence on the English lan- 
guage see Filppula, Klemola and Pitkänen (2002). 
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3: Conclusion: language consciousness and reflection on language 


1. The Renaissance as a notion of cultural and linguistic historiography 


Since the 19th century the concept of Renaissance, originally restricted to art his- 
tory, has given rise to a great number of heterogeneous interpretations in different 
domains. 

The Renaissance began as a cultural movement in Italy in the late 14th century 
and gradually spread across Europe. Historically, it has to be understood against 
the backdrop of novel developments in society, politics and religion: (i) far-reach- 
ing societal changes after the “demographic and cultural shock of the plague” 
(Marino 2007: 140); Gi) a reconstruction of social relations based on different 
forms of capitalism; (iii) a strong belief in progress; (iv) the “rise of the New Mon- 
archies and the increased national competition by their ruling elites for power and 
resources” (Marino 2007: 141); (v) the schism of the Roman Catholic Church 
caused by the Reformation; (vi) the invention of the printing press, which pro- 
moted the extension of literacy, the expansion of Renaissance ideas and the publi- 
cation of Bible translations. 

The idea of Renaissance as an imitatio of Antiquity was shaped in the age of the 
Renaissance itself (Burke 1966: 2). The term rinascita was employed by Vasari 
(1511-1574), who applied the conception of a threefold movement in history to the 
development of art: maniera antica, maniera vecchia and maniera moderna, the 
latter being preceded in the 13th century by a rinascita of the arts. In this context, 
rinascita encompasses both an imitation of nature and an imitation of the objects of 
art of Antiquity, in which the artist finds the models and rules for his or her own 
creations (Buck 1969: 10-11). 
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The view of the Renaissance as an autonomous period and as an inauguration 
of the modern world is emphasized by the French historian Michelet and continued 
by the Swiss historian Burckhardt (Die Kultur der Renaissance in Italien, 1860). 
Burckhardt’s topics — the Discovery of the World, the Discovery of Man, the De- 
velopment of the Individual, the State as a Work of Art, the Individual as a Work of 
Art, and, finally, the Revival of Antiquity (Burke 1966: 10—15) — were reconsidered 
and modified by other authors. In opposition to Burckhardt, later theories stressed 
a continuous line of thinking from the Middle Ages to the Renaissance, opting for a 
plurality of parameters instead of focusing on individuality (Hausmann 2000: 
11-13). 

A term traditionally associated with the Renaissance is Humanism. The hu- 
manist return to Antiquity was motivated by a return to the ancient conception of 
the human being, with a special emphasis on the dignity of man (Burke 1966: 50). 
The ideal of the uomo universale inspired a revival of learning: The study of 
ancient texts and the imitatio and aemulatio (emulation) of ancient models, not 
only in the ideational realm but also in practical life, were believed to contribute to 
the perfection of the human being. Early treatises on education, ethics, theories of 
government, military science, and history, to name but a few, were studied, evalu- 
ated, and put into practise in everyday life. Language played an important role in 
humanist thought. In the programme of the studia humanitatis, grammatica and 
rhetorica formed the linguistic basis for the other disciplines. The relation between 
Latin and the vernaculars was an important issue in philological reflection. Renais- 
sance scholars held that vernaculars were as suitable for varied and versatile ex- 
pression as the ancient languages, provided that they underwent a process of cul- 
tivation and refinement. As a consequence, a multitude of books dedicated to 
language cultivation was produced: apologetic writings, treatises on orthography, 
dictionaries, grammars, and poetics. In short, Humanism is a lay culture, combin- 
ing the vita contemplativa and the vita activa (Hausmann 2000: 14). 

A more precise notion of the relation between Renaissance and Humanism 
views Renaissance as the superordinate concept encompassing all cultural phe- 
nomena. Humanism, by contrast, can be understood as an intellectual ideology that 
provides the philological foundation for the appropriation of ancient knowledge 
via the study of texts (Hausmann 2000: 14). 

In the framework of linguistic historiography, the notion of Renaissance is 
employed in two distinct ways: On the one hand, it designates a historical period 
whose temporal extension depends on the respective language (e.g. Baugh and 
Cable 2002: 200-252); on the other hand, it is used to refer to a complex cultural 
movement, in which the cultivation of vernaculars is of major concern. 
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2. The ancient languages: models for orientation 


After the Carolingian Reform the Renaissance was the second period of cultural 
revival which turned to Antiquity for inspiration. A new factor in this process 
was the directness of contact with the culture of ancient Greece. The victory of the 
Turks over Constantinople in 1453 exiled a considerable number of Greek 
scholars, who brought with them their knowledge of the Greek language and 
ancient literature. For the first time, the original versions of the texts of Homer, 
Plato, and Aristotle became accessible to scholars in Central and Western Europe. 
By reference to these authors, rhetoric, metaphysics, and ethics were re-concep- 
tualised and re-founded as fields of study in their own right (Hausmann 2000: 16). 

What was even more important for the linguistic development of European 
communities was the influence of Roman culture, mediated by the Latin language 
and Latin texts. Latin and the vernaculars were related by a situation of diglossia: 
While Latin as the idiom of higher prestige was used in official domains, the ver- 
naculars were restricted to informal communication. Nevertheless, the latter grad- 
ually gained more influence in other domains. In Early Modern Europe, Latin was 
used predominantly by four social groups: lawyers, officials, diplomats, and 
travellers (Burke 2004: 44—48). Furthermore, Latin assured the “cohesion of two 
international communities in particular”, namely the Catholic Church and the Re- 
public of Letters (Burke 2004: 44). As a general rule, Latin was employed in the 
Catholic liturgy while the Protestant movement was opposed to its use. However, 
there were also some supporters of the vernaculars on the Catholic side. In any 
event, the introduction of the vernaculars to the domain of the church signified “the 
loss of the sense of universality and continuity” (Burke 2004: 49). In the Republic 
of Letters, Latin was the first vehicle of international communication and opened up 
the possibility of a certain degree of mobility for the scholars. The ability of speak- 
ing and writing Latin gradually spread eastwards. But there were critical debates 
about the use of Latin in the domain of learning as well. Erasmus, Luther, Galilei, 
Descartes, and other important scholars chose the language of their writings ac- 
cording to the respective topic and the envisaged readership (Burke 2004: 52-60). 

In spite of the fact that we can see a certain functional continuity between the 
Latin of the Middle Ages and the Latin of the Renaissance, these two periods still 
represent two different registers of Latin, due to the different textual traditions on 
which they are based. The examination of classical authors is a central element of 
the humanist programme, as it not only signifies appropriation of ancient knowl- 
edge but also of stylistic elegance (Ijsewijn 1981:74—82). A central work to be 
mentioned in this context are the Elegantiarum Latinae linguae libri VI (1471), 
written in Rome and Venice by Lorenzo Valla. 

An important issue in the humanist conception of study is the discussion of the 
concept of imitatio as it was developed by Quintilian (35—ca.96) in his Institutionis 
oratoriae libri XII. Following Aristotle, the author emphasized imitation as an an- 
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thropological notion and as a basic principle of learning. In his view, learners 
should not imitate their models blindly but rather contribute to the progress of cul- 
ture by selecting and improving them. Thus, there are two facets of imitation: in the 
spirit of imitation as a servile copy, close attention is paid to the ancient models; 
in Quintilian's terms, by contrast, emancipation from the model enriches and pro- 
motes the vernacular cultures and languages. This argument can also be found 
in contemporary reflections on translation and on vernacular languages (cf. also 
section 5). 


3. The role of translation 


Translation played a fundamental role in the humanist programme: Not only was it 
considered a didactic tool “in the apprehension of vernacular and Latin grammar 
structures” but also one of the most powerful instruments in the process of appro- 
priating the ancient cultures (Norton 2007: 1374). According to Folena, we can 
distinguish between two types of translation: the “vertical” translation (“un tra- 
durre ‘verticale’”), i.e. the translation from a prestigious language (generally 
Latin) into a vernacular language, and the “horizontal” translation (“un tradurre 
‘orrizontale’”), i.e. the translation from one vernacular into another. Contrary to 
the former type, the latter type lacked prestige in the period of Renaissance (cf. Al- 
brecht 1998: 143-147). 

What are the major points of interest concerning translations in the Renais- 
sance period? First of all, there was, as a new development in the literary landscape 
of Europe: the humanist revival of Greek literature. Texts by Plato, Aristotle, 
Demosthenes, Aeschines, Plutarch, and others were translated into Latin. Latin 
authors translated into the vernaculars were Cicero, Caesar, Tacitus, Titus Livius, 
Sueton, etc. Examples for horizontal translation are Dante, Petrarch, Castiglione, 
Tasso, Montemayor, Alemán, and others. 

The translations of the Bible into the vernaculars are especially interesting, not 
only because of their religious and political consequences but also on account of 
their linguistic impact on the respective speech communities. The important Euro- 
pean Bible translators at the time of the Reformation, such as Martin Luther, Wil- 
liam Tyndale, Jacques Lefévre d’Etaples, and Pierre R. Olivétan, based their trans- 
lations on the Hebrew and Greek Bible according to the philological requirements 
of Humanism, also consulting the Vulgata (Albrecht 1998: 110—138). The diffu- 
sion of their translations (especially in the case of Luther, cf. section 4.5) and the 
large number of texts written in the vernacular languages as reactions on the side of 
the Counter-Reformation contributed to the consolidation of the vernaculars and to 
their introduction into domains which were exclusively reserved for Latin before. 
Translations, both horizontal and vertical, also played an important role in the 
domain of scientific discourse in Early Modern Europe. 


The Renaissance and its impact on the languages of Europe 701 


In the context of the new value that was attributed to the vernacular languages 
during the Renaissance period, authors began to contemplate the nature of trans- 
lation (e.g. Bruni, Vives, Dolet, Peletier du Mans, Du Bellay). Subjects of their writ- 
ings were the diversity of languages, the choice between literal and free translation 
according to the respective text genre, and the enrichment of the vernaculars by 
borrowing and imitation of the ancient texts. A prominent contemporary example 
of an early theory of translation was Martin Luther's Open Letter on Translating 
(Sendbrief vom Dolmetschen, 1530), in which he argued for a free, comprehensible 
translation of the Bible, tailored to the needs of the addressees and the idiomatic 
(i.e. structural, lexical, phraseological, etc.) nature of the target language. 

The outcomes of the extensive translation activities during the Renaissance are, 
amongst others, the following: 


OI The consolidation of the vernacular languages in many speech communities; 

(i) As in any situation of language contact, the mutual influence of the lan- 
guages involved; 

(ii) The renewal or change of historical text forms or discourse traditions 
(cf. Aschenberg and Wilhelm 2003) in accordance with ancient genres as, for 
instance, the dialogue, the ode, the epigram, and — with special respect to Ita- 
lian influence - the Italian sonnet. 


4. The vernacular languages — variation and standardization 


In the wake of the fundamental structural changes of European societies, the inven- 
tion of printing and the expansion of vernaculars into new domains, some initial 
conscious activities concerning language planning and standardization could be 
observed in several communities. According to the sociolinguistic conception of 
Haugen (1972), the process of standardization involves four phases: 


1l. “Selection of norm": selection of a variety serving as the basis for standard- 
ization, the selected variety generally being the language of the most influen- 
tial social group; 


2. “Codification of form": establishment of orthographical, grammatical, and 
stylistic norms; 

3. “Elaboration of functions": enrichment of a language in order to make it 
available for a wider range of functions; 

4. “Acceptance by the community”: efficient use and consolidation of the se- 


lected variety, especially in official domains (Haugen 1972: 110). 


In the following, these aspects of the standardization process will be illustrated 
with examples from different European countries. Given that the impact of the 
Renaissance is considerably more substantial in the Central and Western European 
languages than in the Eastern European languages, the focus will be on the former. 
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4.1. Italy 


The Renaissance in Italy began in the late 14th century and ended around 1600. 
Although Italy was the cultural centre of the European Renaissance, the country 
itself had no political centre. Consequently, it also lacked a standardized written 
language covering the multitude of languages and regional or local dialects spoken 
on the Italian peninsula at that time. Northern Italy, where the Renaissance move- 
ment originated, was divided into a number of city-states and principalities, whose 
courts strongly promoted arts and literature. Except for a period of fourty years 
in the second half of the 15th century, when the five most potent political powers — 
Milan, Venice, Florence, the Papal States, and Naples — coexisted in "relative 
peace and increasing prosperity", the different powers of the Italian peninsula were 
almost constantly involved in wars (Campbell 2003: 412). 

Nevertheless, Italian vernacular literature reached a first peak in as early as 
the 14th century with the texts of Dante, Petrarch, and Boccaccio (the tre corone). 
The rise of the varieties of the volgare was, however, temporarily interrupted by 
the Humanists, who tried to restore the position of Latin in the written language. 
The 15th century (Quattrocento) can be viewed as a period of "creative experi- 
menting” with the volgare (“secolo delle libere esperienze del volgare"), while the 
16th century (Cinquecento) can be regarded as a century of codification (“secolo 
della codificazione") (Tesi 2007: 147). The main options discussed in the debate 
about the variety to be chosen as a basis for the standard language (Questione della 
lingua) were as follows: 


l. La lingua cortigiana, the latinized language of the Northern signorias, associ- 
ated with the koiné padana (a position defended by Trissino and by Castiglione); 

2. The spoken contemporary language of Florence (the option preferred by Ma- 
chiavelli); 

3. The archaic Tuscan language as it had been created and used in the works 
of the tre corone, especially by Petrarch and Boccaccio (the position fa- 
voured by Bembo in the Prose della volgar lingua (1525) and underlying 
his contributions to the codification of Italian morphology in the second 
book of this work). 


Bembo's conception was later adopted by the work of the Accademia della Crusca, 
which was established in 1582/83. The foundation of language academies was an 
important step in the process of standardization in general, as it implements official 
language politics via a public institution, often associated with the rise of a nation. 
In the case of Italy, the process of linguistic unification took a long time. It was 
efficiently continued but not fully completed in the context of the political unifi- 
cation of the peninsula in 1861. The codification of the written language was based 
on the writings of Manzoni. To the present day, the spoken language is marked by 
regional features. The gradual development of a spoken Italian language (italiano 
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parlato) without particular regional (or: diatopic) markers could be observed in the 
second half of the 20th century and is to be considered a new, dynamic factor in the 
continuum of Italian varieties (Koch and Oesterreicher 2007: 313—368; Tesi 2007: 
115-261). 


4.2. France 


The impact of the Renaissance on the other European countries did not only de- 
pend on their contact with Italy but was also contingent upon their respective 
political situations. In France, it is the 16th century that is deemed the epoch of 
the Renaissance. The first half of this period was marked by the reign of Francois I 
(1515-1547), whose monarchy was, to some extent, a precursor of Absolutism. 
A crucial contribution to the linguistic unification of France were the Ordonnances 
de Villers-Cotteréts (1539), in which the king prescribed the use of the “langaige 
maternel francois” in official documents. Francois I, who led a lifelong struggle 
against the Habsburgs in the Wars of Italy, promoted Humanists and employed Ita- 
lian artists at his court (Leonardo da Vinci, among others). The Italian influence on 
the French court was further intensified by the marriage of Henri II (King of France 
from 1547 to 1559) with Catherine de Medici. 

Contrary to Italy, there was no debate in France about the variety to be chosen 
as the standard language. Since the Middle Ages, the Francien, a dialect of the Isle 
de France used at the royal and judicial courts, enjoyed particular prestige among 
the other dialects of Northern France. The Italian impact on French is especially 
evident in the large quantity of loanwords, mainly from the sphere of Italian culture 
(e.g. balcon, arcade, sonnet, madrigal, etc.). In the second half of the century, 
the Italian influence was criticized, particularly the exceedingly italianized way of 
speaking of the courtiers. 

In the course of the 16th century, French gradually replaced Latin as the written 
language by spreading to new domains and disciplines. Religion was of special 
importance in this context. French translations of the Bible were published in 1528 
(Lefévre d'Etaples) and in 1535 (Pierre R. Olivétan). On the Catholic side, a large 
number of texts was published in French as a reaction to Calvin’s pamphlet Insti- 
tution de la religion chrestienne (1541). In the domain of literary language, poets 
(namely the members of the Pleiade) recommended the enrichment of language 
and style by means of translating from Latin and borrowing words from French 
dialects or other languages (Greek, Latin, Italian). Another domain to be men- 
tioned is the medical sector: Ambroise Paré, for instance, who was the founder of 
modern surgery, wrote all his books in French. 

The Renaissance was the first period in the history of French in which scholars 
seriously contributed to the codification of the language at the levels of ortho- 
graphy (Geoffroy Tory; Louis Meigret), grammar (Jacques Dubois; Louis Mei- 
gret), and vocabulary (Robert Estienne). In the 17th century, their work was con- 
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tinued by the Académie Francaise. Founded in 1634 and based on the model of the 
Accademia della Crusca, the French Academy was the all-important institution for 
the codification of Modern French (Lodge 1993; Chaurand 1999: 187—224). 


4.3. Spain and Portugal 


The union of the kingdoms of Castile and Aragon in 1469 marked the point of 
departure for the development of modern Spain and the Spanish Empire. The 
Reconquista of the last Moorish kingdom (Granada) was completed in 1492 - the 
very same year, in which Columbus first set sail for America and Spain began to es- 
tablish its overseas possessions. In the 16th century, Spain intervened in the Italian 
Wars. The most important sovereign of Early Modern Europe was Charles V, 
who bore the titles of “King of Spain" (as Carlos I) and “Holy Roman Emperor" 
(1519-1556) (Campbell 2003: 718-719). 

Since the Middle Ages Castilian had been the predominant dialect in Central 
and, as a consequence of the Reconquista, also in Southern Spain. With the union 
of Castile and Aragon, it also consolidated its position as a koiné language in 
official domains in those regions of Spain in which other Romance languages were 
spoken (Catalonia, Galicia, cf. Lapesa 1980: 285). The political hegemony of 
Spain in Europe contributed to the prestige of its language in other European coun- 
tries. Castilian in its function as the language of government and culture is called 
Spanish. The first grammars for learners of Castilian were published in the first 
decades of the 16th century. 

The ideas of the Italian Renaissance entered Spain from the kingdom of Naples 
(at that time in the possession of Aragon) via Catalonia. The studia humanitatis 
were established in the newly founded universities between the 15th and the 16th 
century. The study and translation of Latin texts affected Spanish texts and the 
Spanish language and led to the frequent use of hyperbata and the present parti- 
ciple as well as the imitation of Ciceronian syntax. However, relatinization had its 
most visible effect on the vocabulary. In the period of Humanism, a great number 
of palabras cultas were introduced into the Spanish lexicon. 

As in France, scholars contributed to the codification of the vernacular by com- 
piling dictionaries and grammars. Antonio de Nebrija, one of the most important 
representatives of Spanish Humanism, wrote the first complete grammar of Span- 
ish according to humanist principles (Gramatica castellana, 1492). In the famous 
prologue to this grammar, he argued for the cultivation of the vernacular in accord- 
ance with the cultivation of the Greek and Latin language. With the idea of “la len- 
gua compafiera del imperio" he emphasized the close connection between the rise 
of a nascent state and its linguistic identity (Gauger 2005). The modern Spanish 
language received authoritative codification by virtue of the work of the Real Aca- 
demia Espajiola, which, following the model of the French Academy, was founded 
in 1713. 
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Similar trends were observable in Portugal, where the impact of Humanism in 
the Erasmian tradition reached its peak during the first half of the 16th century. 
This resulted primarily in a relatinization of the Portuguese vocabulary but also of 
Portuguese syntax and morphology. The most prominent authors of that time (Joao 
de Barros, Luis de Camóes, and António Ferreira) were influenced by Humanism 
(Teyssier 1994: 462—464). 


4.4. England 


The beginning of the Modern English period can roughly be dated to 1500, and the 
English Renaissance to the time between 1500 and 1650 (Baugh and Cable 2002: 
200). In historical terms, the reign of Elizabeth I (1558-1603) is considered to be 
the most important period of the English Renaissance given that it was the period 
during which England emerged as a world power. Three major events marked the 
new political status of England: the defeat of the Spanish Armada in 1588, the es- 
tablishment of the East India Company in 1600, and the foundation of Jamestown 
in 1607. At that time, England excelled not only in literature (with authors like 
Spenser, Marlowe, Shakespeare, and Ben Jonson) but also in music and painting 
(Górlach 1990: 96). The development of the English standard language during that 
period was closely interwoven with the political situation and general cultural de- 
velopments. 

Just as in other European countries, the introduction of printing - the first press 
was set up by William Caxton in Westminster in 1476 — played a pivotal role in the 
process of standardization. The regularization of orthography constituted an issue 
of major interest. An important contribution to this process in the 16th century was 
the treatise Elementarie (1582) by Richard Mulcaster. In it, the author suggested 
spellings for 7,000 frequently used words, basing his reform on “custom or usage” 
(Baugh and Cable 2002: 212). Whereas in the first half of the 16th century there 
was still great variability in spelling, the task of regulating and unifying English or- 
thography was essentially accomplished in the middle of the 17th century (Baugh 
and Cable 2002: 214). 

The most important phonological changes, known as The Great Vowel Shift, 
provoked a fundamental qualitative transformation of all long vowels, which were 
either raised or diphthongized. These developments began in the 15th century and 
ended by the beginning of the 17th century. The discrepancy between the quality of 
the English vowels and their graphemic representation, i.e. the lack of phonetic 
spelling, is due to the fact that the English orthography had been fixed before the 
Great Vowel Shift was completed (Baugh and Cable 2002: 238-239). 

The change and elaboration of the English language during the Renaissance 
pertains mainly to vocabulary. Whereas English grammar “[was] marked more by 
the survival of certain forms and usages that have since disappeared than by any 
fundamental developments" (Baugh and Cable 2002: 240), the extension of the 
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vocabulary was a primary concern of contemporary writers. Both translations from 
Latin and Greek and the use of English in new domains required an extension of 
the lexicon, which came to be realized with great rapidity. Leith and Graddol 
(1996: 142) mention the number of 30.000 new words which were added to the 
English vocabulary in the time between 1500 and 1700. New lexemes entered the 
language from three major sources: borrowing, new coinages, and word formation 
(Górlach 1990: 105). A large part of the new words stemmed from the classical 
languages, especially Latin. With respect to Latinisms, it is often difficult to decide 
whether they have been borrowed directly or "indirectly through French" (Baugh 
and Cable 2002: 226). In any case, the excessive use of Latin words and their lack 
of integration and semantic transparence for the user provoked critical reactions: 
The attacks against the so called “inkhorn terms” culminated around 1550. Subse- 
quently, the opposition to Latin loanwords decreased gradually (Górlach 1990: 
106; Baugh and Cable 2002: 222). Other sources of borrowing were the Romance 
languages (French, Italian, and Spanish) and the “native sources", i.e. the revival 
of archaisms or the borrowing from dialects, mainly in poetical texts (Baugh and 
Cable 2002: 230-232). 

Just as in other European countries, the linguistic codification in England was 
guided by grammars and dictionaries. The first grammar of English, which was 
still completely based on Latin models, was William Lily’s Shorte introduction of 
grammar. Published in 1523, it constituted the beginning of the English grammar 
tradition and was followed in 1586 by the Bref Grammar for English by William 
Bullokar. The latter had a clear “educational purpose" and was conceived as an in- 
strument for the "art of writing", which was oriented towards the design of ancient 
Greek grammar (Leith and Graddol 1996: 149). The first dictionaries compiled 
in Renaissance England were dedicated to the explanation of *hard words" (Baugh 
and Cable 2002: 232). 

The standardization process of the English language continued in the 18th cen- 
tury, most importantly with Samuel Johnson's A Dictionary of the English lan- 
guage (1755), whose main focus was on the regularization of spelling and word 
use. As Baugh and Cable (2002: 249—251) point out, a lot of this work had already 
been done in the Renaissance period. 


4.5. Germany 


The Renaissance in Germany was primarily dominated by the Reformation. 
Though it had long been considered an abrupt, revolutionary phase, situated be- 
tween the Middle Ages and the Modern Period, the Renaissance is now viewed as a 
period of fluid transition from the late Middle Ages to the Early Modern Period 
(von Polenz 2000: 103). Although new institutions were introduced under the 
Emperor Maximilian (1493-1519) in order to reform the organisation of the em- 
pire, Germany remained “a polyfocal realm" (Keller 1978: 340), in which a large 
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number of principalities and independent towns benefitted from self-government. 
The new universities came to be the centres of Humanism, representing an import- 
ant force in contemporary intellectual and religious life. The Reformation and the 
subsequent Peasants' Wars dominated the first half of the 16th century (Campbell 
2003: 327). 

In terms of the history of the German language, Renaissance and Humanism 
were situated in the period of Early New High German, which spanned from 1350 
to 1650. As a consequence of the political fragmentation of Germany, the German 
language of that time was polycentric and its history varied. While dialects domi- 
nated the domain of spoken language, the written language was characterized by a 
coexistence of different chancery languages. Unifying tendencies could be per- 
ceived in the 14th century already. Due to the introduction of the printing press in 
Germany in 1445, the different regional written varieties were increasingly fixed 
by the printers. In the middle of the 15th century, two dialects were competing for 
the status of Common German: While East Central German was expanding, an- 
other, Bavarian-based variety was developing in the Southwest of Germany, claim- 
ing to represent gemeines Deutsch ‘common German’. This was, in very rough 
terms, the linguistic situation at the beginning of the Reformation. 

The Reformation is generally deemed a movement of great impact on the further 
development of the German language. While the writings of its most important 
representative, Martin Luther, did not constitute a completely new beginning, they 
are considered to be an influential continuation and culmination of traditions upon 
which he drew with extraordinarily creative power (Besch 2000: 1717; von Polenz 
2000: 229). The theologian's most important works include a translation of the 
Bible (finished in 1534), which, linguistically, can be considered a fusion of 
written German based on the language of the Saxon chancery and popular style, 
several treatises (amongst others, An den christlichen Adel deutscher Nation and 
Von der Freiheit eines Christenmenschen, 1520) and a catechism. Luther's writ- 
ings were propagated all over Germany and provoked a large number of theologi- 
cal publications in German (tracts, pamphlets, missives, etc.). At the same time, 
Lutheran language reached "every corner of the German speaking countries" 
(Keller 1978: 355). Linguists ascribe his influence on the formation of German to 
his integrative capacities, which enabled him to find well-balanced solutions with 
respect to the different German varieties. This is primarily true for vocabulary and 
phraseology (von Polenz 2000: 234) but also for orthography and morphology 
(Besch 2000). Thus, it is the East Central German Dialect which spread with the 
Reformation as a koiné language in the process of unifying German and which 
gradually replaced Low German in the domain of written language. This process, 
which was already perceivable in the 14th century, was going to intensify in the 
course of the following centuries. 

Just as in the sphere of religion, the humanist elaboration of the German lan- 
guage in different domains of intellectual and practical life predominantly con- 
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cerned the lexicon. As in other European languages, a large number of technical 
terms was borrowed from Latin, especially in the fields of education and scholar- 
ship, law and administration, general science, mathematics, and the arts. Words 
adopted from Italian were military terms and loanwords from French belonged to 
both the domain of lifestyle and military affairs (Keller 1978: 443-463). Some of 
the Spanish loanwords had their origin in the indigenous languages of the Spanish 
colonies in America. 

With respect to literary language, Germany lacked "literary excellence" when 
compared with Italy, Spain, and France (Keller 1978: 353; von Polenz 2000: 215). 
During the Renaissance, German functioned as the language of popular culture, 
while Latin was reserved for high culture (Keller 1978: 352). The period of Early 
Modern German came to an end with two outstanding works in poetry and gram- 
mar, both of which contributed to the standardization of German: Martin Opitz’ 
Buch von der deutschen Poeterey (1624) and Justus Georg Schottelius’ Teutsche 
Sprachkunst (1641). 


5. Conclusion: language consciousness and reflection on language 


The Renaissance as the concept of a cultural movement which began in Italy in the 
late 14th century and spread gradually across Europe had two distinct facets: On 
the one hand, it involved a continuation of cultural ideologies of the Middle Ages 
and, on the other hand, it entailed innovation, which found expression in the dis- 
covery of nature, the human condition and individuality in art and science. Study- 
ing the works of the Greek and Roman Antiquity, scholars elaborated the concep- 
tion of the studia humanitatis. Translation loomed large in these processes: Not 
only did it guarantee linguistic access to the ancient models but it also facilitated 
their stylistic reproduction in the vernaculars. The socio-historical factors under- 
lying these evolutions were major structural changes in the societies, the political 
systems and the religious domain. An event of utmost importance for the European 
cultures was the invention of the printing press, which promoted the extension of 
literacy and the diffusion of Renaissance ideas. 

Despite the diversity of national histories and contemporary political circum- 
stances in the different European countries, their languages were deeply marked by 
the processes indicated above and exhibit a surprising amount of similarities. For 
the first time in history, European vernaculars were documented and regulated 
according to the model of the ancient languages. As shown in section 4, documen- 
tation and regulation emerged from different sources. They could be found in treat- 
ises on orthography, dictionaries, grammars and, finally, tracts about poetry and 
rhetoric. The orientation towards ancient Greek and Latin models had important 
repercussions on the European languages, such as the fixation and elaboration of 
the vernaculars corresponding to the patterns of the classical languages. The im- 
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pact of the classical languages, especially Latin, was obvious on various levels: or- 
thography, syntax, style, and text genres. Undoubtedly, the most important level 
was the lexicon. The ideal of abundance and variability originating from rhetoric 
directed the work of scholars and writers towards the enrichment of the vocabu- 
laries. Borrowing from the ancient languages, lexical transfers among the 
European languages themselves — with Italian enjoying special prestige — and the 
adoption of dialectalisms and archaisms were the principal methods of lexical 
innovation. 

The relation between Latin and the vernaculars constituted an important sub- 
ject of philological reflection and was frequently addressed in the prologues of 
contemporary grammars, dictionaries and other works of language cultivation. In 
addition to these texts, numerous dialogues on language-related questions deserve 
mention. In the tradition of the dialogues of antiquity and following the contem- 
porary model of Erasmus’ Colloquia familiaria (1522-33), the dialogue repre- 
sented the preferred text genre, as it permitted the representation of an informal de- 
bate depicting different perspectives on the subjects discussed (Aschenberg 2005). 
All these texts can be considered as documents conveying the new consciousness 
about and attitude towards language. What were the main topics of these debates? 


OI The ancient Greek and Roman cultures were adopted as models because of 
their perfection in scientific, ethical, artistic, and linguistic matters and in 
different domains of practical knowledge and life. 

(ii) The key notion defining the relation between the ancient cultures and the 
emerging European nations was imitatio, which meant adoption and inte- 
gration of exemplary patterns of the ancient cultures in the process of culture 
formation in Modern Europe. 

(ii) Inthe linguistic domain, Renaissance scholars held that the vernaculars were 
to be elaborated in order to match the ancient languages and their expressive 
capacities: translation, orthographic and grammatical codification as well 
as lexical enrichment were discussed and implemented as necessary prepara- 
tory steps for the European vernaculars serving the communicative functions 
formerly reserved to Latin. 

(iv) Some scholars add to the conception of imitatio the aspect of aemulatio, 
i.e. the idea of surpassing the ancient models. This is an important notion 
in the process of defining the identity of European cultures. With respect to 
Antiquity, it signifies a clear step towards their proper independence and 
prestige. 


Finally, it can be observed that the effect of the Renaissance on the European lan- 
guages is clearly Janus-faced. While imitatio led to codification and enrichment of 
the vernaculars in correspondence to the patterns of classical languages, aemulatio 
propagated emancipation from the ancient models. It is precisely this idea of ae- 
mulatio, addressed more or less explicitly in the writings of Renaissance scholars, 
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which guided the new value attributed to the vernaculars and which opened up the 
way to the accomplishment of codification of the modern European languages in 
the subsequent centuries.! 


Notes 


1 Ithank Reinhold Aschenberg and Raymund Wilhelm for many helpful suggestions which 
have found their way into the final version of this article. 
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40 Multilingual states and empires 
in the history of Europe: 
the Habsburg Monarchy 


Jan Fellerer 


L Introduction 

2. The linguistic situation in individual lands and its legacy 

3. Aspects of bi- and multilingualism among the Monarchy's subjects 
4. Language policy and diglossia 

5. Linguistic nationalism and language contact 

6. Conclusion 

1. Introduction 


The domus Austriae rose to the dominating monarchy of Central Europe from the 
early 16th century onwards. It assumed the title of Austrian Empire in 1804 and 
became the Dual Monarchy after the Austro-Hungarian Compromise in 1867 until 
its dissolution after World War I. The main focus of this survey will be on the late 
imperial era; i.e. the period when existing and newly emerging elites employed 
language as a key component in the projection of national identity among the 
Monarchy’s ethnic groups, and when mass education, wide circulation of printed 
materials, and individual mobility propelled language usage from smaller, often lo- 
calized oral networks into large anonymous speech communities, potentially com- 
mensurate with the province and beyond. The Dual Monarchy comprised 17 lands 
with separate estates and powers, 15 of them “represented in the Austrian parlia- 
ment (Reichsrat)”. The Lands of the Crown of Saint Stephen, including the former 
Principality of Transylvania, and in union with the Kingdom of Croatia-Slavonia, 
constituted the Hungarian half. It was divided into 71 comitats, 30 townships, and 
the sea port Fiume. Across these territories a large variety of different dialects and 
languages was used both in speech and in writing. In the second section of the 
chapter, we will look at the geographical distribution of some of them. The third 
and fourth sections further explore the effects of linguistic diversity, both among 
individual Habsburg subjects and in society at large. The last section introduces the 
concept of linguistic nationalism, an important novelty in the history of Central 
European languages, which did not combine well with the Monarchy's traditional 
linguistic diversity and language contact. 
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2. The linguistic situation in individual lands and its legacy 


Our knowledge of the Monarchy’s linguistic map is based on historical demo- 
graphic data. The Austro-Hungarian authorities conducted extensive censuses 
every ten years between 1880 and 1910. In Hungary and Bosnia-Herzegovina, 
they asked everyone present for their “mother tongue". In the “lands represented in 
the Reichsrat", henceforth Austria, nationals were registered according to their 
“language of everyday use". Unlike Hungary, respondents had to choose from a list 
of nine vernaculars: German, Czech-Moravian-Slovak, Polish, Ruthenian (Ukrai- 
nian), Slovene, Serbian-Croatian, Italian-Ladin, Romanian, and Hungarian. Yid- 
dish and other minority varieties did not figure. Their speakers had to refer to a sec- 
ond language. The census data inform large parts of Wandruszka and Urbanitsch 
(1980, Vol. 3), still the most recent comprehensive survey of the Monarchy's ethnic 
groups and the source of the following geographical sketch.! The corresponding 
political map was redrawn fundamentally when Austria-Hungary was dissolved 
after World War I (cf. Magocsi ?2002: 125-165). 

According to the 1910 census (cf. Die Ergebnisse 1912; A Magyar Szent 
Korona 1912, 1924), Vorarlberg, Salzburg, and Upper Austria were predominantly 
German-speaking and became part of the Republic of Austria following the Treaty 
of Saint Germain (1919). So did Austria "below the Enns", apart from small 
pockets in the north, e.g. near Gmünd. These were awarded to Czechoslovakia. 
Most people in Lower Austria spoke German. The capital Vienna had a sizeable 
Czech diaspora and an important Jewish presence from across the Monarchy. Ger- 
man was also the main language in northern Tyrol. In the neighbouring Alto Adige 
(Etschland?) Ladin was spoken too. The area was awarded to Italy together with the 
Italian Trentino, while the north and east of the province remained within Austria. 

In the three main “inner Austrian" provinces Styria, Carinthia, and Carniola, the 
"languages of everyday use" were German and Slovene. Upper and Central Styria 
were German-speaking. In Lower Styria, the lands between the central Mur and the 
Sava rivers, most people spoke Slovene, except for Maribor (Marburg), Ptuj, and 
Celje, where a larger part of the citizens used German. Following the dissolution of 
the Monarchy, Styria was divided between the new states of Yugoslavia and Aus- 
tria. Carinthia was also comprised of a mainly Germanophone north and a Slovene 
majority around and south of the river Drava. As elsewhere, there were many bi- or 
multilingual areas, for example the Canale Valley around Tarvisio, where German, 
Slovene, and some Friulian were in use. It was acquired by Italy. Otherwise, most of 
Carinthia became part of the Republic of Austria. In Carniola, the majority of resi- 
dents used Slovene. German-speaking minorities were concentrated in the southern 
Kočevje (Gottschee) county and in the capital Ljubljana. After World War II, the 
entire land eventually became part of Yugoslavia (Slovenia). 

The three provinces of the Adriatic “Coastland” were particularly diverse: 
Górz-Gradisca, Trieste, and Istria. Górz-Gradisca was home to Slovene and Italian 
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speakers. Friulian did not feature separately. Respondents in eastern counties, in- 
cluding Gorizia (Górz) countryside, mainly stated that Slovene was their “lan- 
guage of everyday use”, while Italian (Friulian) prevailed along the lower river 
Isonzo towards the Adriatic Sea. The Treaty of Rapallo (1920) first awarded the 
entire province to Italy. After World War I, Yugoslavia expanded westwards and 
left Italy with Gorizia, Gradisca, and Trieste. In Trieste and suburbs, Italians lived 
side by side with Slovenes, Jews, Germans, and Croatians. Istria had a majority 
of Croatian speakers, especially in the centre and on Veglia. Italian dominated 
in some coastal areas, such as the region around Rovigno. In Upper Istria, north of 
the river Dragonja, Slovene was the main language. Istria was first ceded to Italy. 
Following World War II, it became part of Yugoslavia. Further along the Adriatic 
coast, there was Dalmatia, Austria’s southernmost province. It was mainly Croatian, 
with a large presence of Orthodox Serbs — sometimes erroneously called “Mor- 
laks" — in counties such as Knin, and Italians in Zadar (Zara). After World War II, 
the entire province was awarded to Yugoslavia. 

The Lands of the Bohemian Crown were bi- or multilingual, too. About two 
thirds of Bohemia and Moravia spoke Czech. The large Germanophone minority 
settled along the mountainous fringes, the Bohemian Forest, the Ore Mountains, 
and the Sudetes, in southern Moravia, and in some enclaves, such as Brno, Jihlava 
(Iglau), and the Hfebeésko region (around Zwittau). In eastern “Teschen Silesia", 
the Czech speakers lived in the western county of Friedek. Elsewhere Polish was 
the main language. In western “Troppau Silesia", Czech prevailed in the county 
of Wagstadt. Further north, German was used more often. This was also the case in 
cities such as Bielsko (Bielitz) and Opava (Troppau), and among Silesia's Jews. In 
Prague, on the other hand, Jews increasingly opted for Czech. Moravia, Bohemia, 
and Silesia became part of Czechoslovakia, apart from eastern “Teschen Silesia", 
which went to the Second Polish Republic. 

Galicia, the largest crown land in the north-east, first became Polish too until it 
was divided between Poland and the Ukrainian S.S.R. after World War II. Polish 
was spoken in a triangle between the Carpathian mountains in the south, the 
Vistula in the west and north, and the river San in the east. Further east and along 
the Carpathian mountains people used Ukrainian varieties. There were however 
substantial islands of Polish speakers too, including many of Galicia's large Jewish 
population. Most Bukovinian Jews, on the other hand, continued to associate with 
German. The land was particularly diverse, being home not only to Ruthenians 
in the north-west and Romanians in the south-east but also to more recent Polish- 
and German-speaking immigrants as well as to Hungarian Szeklers near Rădăuți 
(Radautz). The Bukovina was first accorded to Romania but became part of the 
Ukrainian S.S.R. after World War II. 

The linguistic demography of Hungary was no less complicated. On the left 
bank of the Danube, the northern comitats had a Slovak majority. Further south, 
closer to the Danube and the river Ipel’ (Eipel), Hungarian prevailed. There were 
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also many mixed areas, e.g. the comitat Esztergom (Gran), and linguistic enclaves, 
such as German dialect speakers in the mountainous Fatra region north of Krem- 
nica. In 1920, the Slovak-Hungarian border was fixed along the rivers Danube and 
Ipel’. The second Hungarian “circle” was the right bank of the Danube or “Trans- 
danubia". It was mainly Hungarian, with important Slovak and German diasporas, 
such as the “Danube Swabians" of the Bakony region and the Vértes mountains 
north of the Balaton Lake. The border regions were not Magyar either: In the 
western Neusiedler Lake area, the Burgenland of post 1918 Austria, Croatian and 
German prevailed. Further south, the Prekmurje and Medimurje on the banks of 
the lower Mur had Slovene and Croatian majorities. Eastwards on the shores of the 
Drava, later the Hungarian-Yugoslav border, many spoke Croatian and Serbian 
with sub-varieties, such as Sokac. 

The neighbouring Bačka comitat lay in the Kingdom's third “circle” between 
the Danube and the river Tisza (Theiss). Linguistically very diverse, the county 
was home to Hungarian, German, Slovak, “Rusin”, Serbian, and other South 
Slavonic speakers. The Bačka south of Subotica (Maria-Theresiopol) became the 
Yugoslav Voivodina. Its north and the other comitats of the Danube-Tisza inter- 
fluve formed part of “Trianon Hungary” after 1920. Most people used Hungarian 
here, though some German and Slovak were used in and around Budapest. The 
capital had a large Jewish population, most of whom declared Hungarian as their 
mother tongue. The fourth Hungarian "circle" was located east of the Tisza and 
north of the Danube. Its south, the Banat, was exceptionally diverse with a Roma- 
nian majority in the east, Serbians in the west and south, and Hungarians, "Danube 
Swabians", and other minorities across the region. North of the Banat and the river 
Maros, there were Hungarian and Romanian majorities. The Tisza-Maros region 
was divided between Hungary and Romania. Western Banat was incorporated into 
Yugoslavia. 

The right banks of the Tisza further north formed the fifth "circle". It was by 
and large Hungarian to the west and north of Oradea (Gro wardein). Further south, 
along the tributaries of the Kórós, the Romanian Crisana extended eastwards. The 
very north-east of the fifth “circle” along the Carphathian mountains and the upper 
Tisza (around Huszt) had a large Ruthenian presence. It continued into the area 
north and east of UZhorod (Unghvar) which formed part of the Kingdom's sixth 
“circle” on the left bank of the Tisza. From UZhorod in the east to the upper Ipel’ in 
the west, the circle's north was predominantly Slovak. Its south, the Carpathian 
foreland and parts of the western inner Carpathians, spoke Hungarian, e.g. in the 
comitat of Borsod and its capital Miskolc. As elsewhere, there were linguistic 
minorities too, such as the Germanophones around Kežmarok (Käsmark) and other 
non-Slovak speakers in the comitat of Szepes: Hungarians, Ruthenians, and Poles. 
The left bank of the Tisza was divided between Hungary and Slovakia. On the right 
bank, “Trianon Hungary" came to extend from the Tisza Knee to the Hungarian- 
Ukrainian-Romanian tripoint north of Satu Mare (Szathmar). 
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The 15 Transylvanian comitats — today part of Romania — had a Romanian ma- 
jority, especially in the countryside. Cities such as Cluj (Klausenburg) and the east- 
ern Székely Land were predominantly Hungarian. Transylvania's “Saxons” settled 
in various areas, such as around Sibiu (Hermannstadt). In Croatia, most people 
spoke Croatian dialects, apart from large Orthodox Serbian areas adjacent to Bos- 
nia-Herzegovina. They extended into Slavonia with Syrmia where, apart from 
Croats, they met speakers of Hungarian, German, and other varieties. Finally, Hun- 
gary's sea port Fiume was not only home to Italians but also to a large Croatian and 
other minorities. Eventually, after World War II, it became Rijeka in Croatia. 

South Slavonic varieties were also used among the Serbs, Bosnian Muslims, and 
Croats of Bosnia-Herzegovina. The province was jointly administered by Austria 
and Hungary since its occupation in 1878. In 1919, it became part of Yugoslavia. 


3. Aspects of bi- and multilingualism among the Monarchy's subjects 


The geographical sketch outlined in the previous section inevitably omits all points 
of detail. In fact, since Czoernig (1855-1857) Austrian and Hungarian cartography 
had gained international reputation on virtue of the actual level of sophistication 
in mapping the distribution of languages. The censuses since 1880 even produced 
detailed linguistic data down to individual villages. At the same time, however, 
they grew more problematic on the account of two other factors. Firstly, they came 
to be strongly biased towards supra-dialectal monolingualism. Particularly in Aus- 
tria, respondents had to choose from a list of nine languages which excluded other 
idioms such as Yiddish, Istro-Romanian, or Russian spoken among Bukovinian 
Old Believers. It also glossed over the fact that speakers may have considered their 
dialect a language, distinct from the pre-national languages to which the census 
form referred them. For instance, some Cieszyn Silesians may have liked to de- 
clare themselves as speakers of (Cieszyn) Silesian rather than Polish (cf. Batowski 
in Wandruszka and Urbanitsch 1980: 528). Secondly, the later censuses did not just 
produce linguistic data. Practice and spirit of their conduct projected language as 
an indicator of ethnic or national belonging (cf. Brix 1982). National conflict and 
increasing segregation were central to the Monarchy's later history and have re- 
ceived ample scholarly attention (cf. Kann 1950, to refer to just one early, influen- 
tial study). 

What we know much less about, and what the politicized censuses since 1880 
with their focus on supra-dialectal monolingualism do not easily reveal either, is 
how multilingualism, still the status quo in many parts of the late Monarchy, did 
actually work in the first place, both on an individual level and in society at large. 
We will return to the latter in the subsequent section. On an individual level, many 
Habsburg subjects lived in bi- or multilingual areas and, thus, needed or wanted to 
gain some knowledge of a language other than their mother tongue. For a thorough 
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understanding of the Monarchy's linguistic history it will be necessary to identify 
these patterns of bilingual competence and the social settings in which they devel- 
oped. At present, not enough empirical studies are available to draw any firm con- 
clusions. Generally speaking, there appear to have been three important types of 
settings: One was individual migration into a linguistically different environment, 
for example into urban centres in search of work or by way of joining one of the 
usually multilingual regiments of the armed forces (cf. Wandruszka and Urban- 
itsch 1987, Vol. 5). Take for instance a soldier recruited to a battalion of the infan- 
try regiment “Grand Duke Ferdinand of Tuscany", raised at UZhorod (Unghvar) 
in 1860, where, according to Hickmann (no date), at the turn of the century he 
would have found 40 % Slovak, 30 % Ruthenian, 20 % Hungarian, and 10 % Ger- 
man speakers. The second type of setting is characteristic of those many towns and 
regions where speakers of different languages traditionally lived in close vicinity. 
Here, it was often necessary or advantageous for individuals on their *daily round" 
to know the other variety of the area. For instance, Moritsch (1991: 49—50) con- 
cludes that people in the well developed but isolated village Vorderberg (Blace) 
in southern Carinthia did not only speak their Slovene dialect. Even if they 
felt clearly different from them, they also knew the local German variety of their 
(Lutheran) neighbours in the Drau and Upper Gail valleys. It could not have been 
otherwise given that rural trade and transport services were major sources of in- 
come for them. Vice versa, the inhabitants of Eisenkappel (Zelezna Kapla), a larger 
market town further west, spoke German with some sense of urban pride. How- 
ever, as their livelihood was dependent on the Slovene-speaking surroundings, 
male inhabitants of Eisenkappel usually acquired some knowledge of that lan- 
guage too (cf. Moritsch 1991: 38). 

It would be misleading though to suggest that local bilingualism, such as in 
parts of southern Carinthia, was an "innocent" linguistic encounter. Firstly, there 
was wide-spread reluctance or even overt hostility towards learning a "lesser" 
local variety among those who thought of their language as more prestigious, 
especially German speakers, but also native Hungarians, Italians and, to a lesser 
degree, speakers of Polish in Galicia. Secondly, bilingualism had become a central 
issue of national conflict by the end of the 19th century. Often sanctioned by central 
government (cf. Stourzh 1994), national movements effectively fought bi- 
lingualism and aspired to create individuals identifiable by one language only. We 
will return to this point in section 5. For now, let us look — after migration and tradi- 
tional language contact — at a third type of setting in which competence in more 
than one language was frequent. These were institutions of secondary and higher 
education, where the middle classes and those aspiring to them trained to engage 
in a suitable career. To do so, they typically studied a second, "prestigious" lingua 
franca, if they did not speak one as their first language, usually up to a high level of 
proficiency. In Hungary this was mainly Hungarian. In Austria it was predomi- 
nantly German, but also Polish in Galicia and Italian in Dalmatia, the Coastland, 
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and Tyrol. To be sure, other languages gained ground since an imperial resolution 
of 20 July 1859 which allowed for the replacement of German as the main medium 
of instruction in grammar schools. From the late 1860s onwards we can generally 
observe increasing linguistic segregation in education at the expense of German 
and bilingualism in general. Still, knowledge of German (or Polish in Galicia, or 
Italian in the south) remained a wanted or unwanted necessity for many of Aus- 
tria's secondary school students who wished to safeguard their career prospects. 
For example, in 1912/13 42.8 % of all secondary schools (incl. Real-Gymnasien) in 
the Austrian half of the Monarchy still employed German as the (main) language of 
instruction (cf. Urbanitsch and Wandruszka 1980, Table 9). 

To illustrate this in more detail, let us return to the 1860s, when the shift away 
from German as language of instruction at grammar schools had just gained momen- 
tum. In his memoirs, the Czech historian August Sedláček (1843-1926) remembers 
the grammar school in the western Bohemian town of Písek, which he finished in 
1863 (Sedláček 1924). Since 1861, more and more subjects were taught in Czech. 
At the same time, he recalls that mathematics and physics lessons were conducted 
by a German speaker until he left school. The classics teacher used Czech one 
month and German the other. History was again taught in German until the ma- 
turita, which, it appears, was conducted in German too. Sedláček soon started 
reading independently about Bohemian history, both in Czech, e.g. the journal 
Památky archeologické, and in German, e.g. F. A. Heber's Böhmens Burgen, 
Vesten und Bergschlösser. Early writings of his own were in German, until, so he 
reports, he decided in 1864 to write in Czech about “Czech things". Sedláček was 
undoubtedly a Czech patriot and disapproved of germanized secondary school 
education. At the same time it cannot be overlooked that he was a quintessentially 
bilingual intellectual. The same is true of his contemporary Emil Dunikowski 
(1855-1924), the Polish traveller and geologist who received his secondary school 
education in the east Galician town of Brzezany until 1873. In his memoirs 
(cf. Knot 1955: 345—382), he remembers with joy the year 1869, when Polish fi- 
nally replaced German as the language of educational instruction. However, even 
if unwanted, years of lessons and reading in German clearly left their imprint on 
Dunikowski's linguistic outlook as he went on to study not only in Lvov but also in 
Vienna and Munich. There was even a third linguistic component. Dunikowski 
fondly remembers the Ruthenian (Ukrainian) songs and prayers of his fellow 
pupils from Greek Catholic families. He even prides himself in having translated 
Ševčenko into Polish. Memoirs are slippery sources. Still, there is no reason to 
doubt that a middle class Slav biography in 1860s Austria — and also beyond — 
often included knowledge of German. Exposure to another local variety, such as 
Ruthenian in Galicia, was more random and reveals the hierarchical character of 
bi- or trilingual competence. 

Our sketch has so far been biased towards the Austrian half of the Monarchy. 
Bilingualism was wide-spread in Hungary too, particularly among the non-Magyars. 
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Unlike Austria, people in Hungary were asked to state their “mother tongue” and 
any other language(s) of the land they knew. Leaving aside semi-autonomous 
Croatia-Slavonia, the number of non-Magyars who were fluent in their native lan- 
guage and in Hungarian doubled from around 11% in 1880 to ca. 22% in 1910 
(Katus in Urbanitsch and Wandruszka, Vol. 3, 1980: 433). This was not only the re- 
sult of politically orchestrated Magyarization since the Tisza era from 1875. After 
all, bilingualism grew, if modestely, among Hungarians too. More importantly, 
the Kingdom’s non-Magyar speakers considered knowledge of Hungarian a vehicle 
of social advancement, not only in cities, as is well-known, but also in other, non- 
urban areas, as has recently been argued for Slovak and German speakers in 
“Upper Hungary” (cf. Puttkamer 2003). In the case of another region, Transylvania, 
patterns of local bilingualism suggest that knowledge of more than one of its main 
three languages, Romanian, Hungarian, and German, served the purpose of mutual 
understanding in the neighbourhood, rather than social advancement as in “Upper 
Hungary’. Thus, it appears that the late Monarchy still had many bilingual subjects 
for different reasons. This is not to deny that national tensions and imperial re- 
sponses raised the stakes against bilingualism. A closer look at the actual facts on 
the ground does, however, seem to be necessary and is still missing in the literature 
on late 19th century Austria-Hungary. 


4. Language policy and diglossia 


Multilingualism in society at large is another crucial aspect in the linguistic history 
of the Habsburg Monarchy. Societal multilingualism tends to result in diglossia. 
This is the complementary and hierarchical distribution of languages across social 
settings in a given speech community. Relevant speech communities of the late 
Monarchy seem to have been political-administrative units (for an introduction 
cf. Wandruszka and Urbanitsch, Vol. 2, 1975), mainly the crown lands and smaller 
communalities, such as counties, towns, and villages. The relevant social settings 
clustered around larger, public domains, notably administration, jurisdiction, and 
education. Domain-specific usage of languages was subject to an ever increasing 
amount of state legislation and provincial laws. At the same time, actual practice 
established norms of language usage of its own. Language policy and factual usage 
may have converged or not, but they formed two sides of the same coin: Together, 
they reflect conflict and share of power between the major political agents: central 
government, national elites, and representatives of social strata. Language policy 
in the late Monarchy has been much written about since the historical time itself 
(cf. Fischel ?1910). Factual language usage, i.e. actual di- and polyglossic prac- 
tices, have been studied less, even though there is rich source material for historical 
sociolinguistic research in archives across all areas of the former Monarchy. A few 
attempts for selected domains and areas can be found in Schjerve-Rindler (2003). 
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Let us, for now, take a look at two domains: administration and the judiciary, with 
more on education in the subsequent section. 

The "December constitution" of 1867 was a crucial moment in the history of 
language policy and diglossic practices in the Austrian half. Its famous Article 19 
guaranteed equal national and linguistic rights to all Austrian citizens — in theory, 
that is. In practice, there were many different ways of realizing the constitutional 
promise. Legal implementation and actual practice did in fact vary significantly 
across lands and domains (cf. Stourzh in Urbanitsch and Wandruszka, Vol. 3, 1980: 
975-1206). However, we can state some general tendencies: Internal matters in 
public administration were not considered subject to Article 19 by central state 
authorities. Thus, they had to be conducted in German, Austria's unofficial state 
language. There were some exceptions, though, where, for political or practical rea- 
sons, the government accepted the use of other languages in internal affairs, notably 
Polish in Galicia since 1869, as well as Italian and, since 1909 exclusively, Serbo- 
Croatian in Dalmatia. In fact, how problematic German as the privileged official 
language had become came to the fore in 1897. The then new head of government 
Badeni initiated ordinances which promoted Czech to join German as the other of- 
ficial language for inner administrative matters throughout Bohemia and Moravia. 
Civil servants, including those in areas with a predominantly German- or Czech- 
speaking population, would have been required to be proficient in both languages 
by 1901 (cf. Fischel ?1910: no. 417-420). The two ordinances met with bitter op- 
position from the Germans. They were soon watered down and then scraped alto- 
gether. German speakers argued that they would exclude them from civil service as 
most of them did not know Czech. However, they did not want to learn it either, 
while they expected their Czech peers to continue acquiring fluent German. 

Unlike internal affairs, public administration did come under the provision of 
Article 19 in dealings which involved members of the public and bodies represent- 
ing them. They had the right to address state authorities and to be addressed 
by them in their native language if it was "customary in the land". The term was a 
matter of frequent dispute and different practice. In Galicia, for example, Polish 
was customary everywhere. Ruthenian enjoyed the same status in eastern counties 
only. As a result, Galician officials used Polish rather than Ruthenian for internal 
affairs, as a study into language choice in Galicia's public administration suggests 
(cf. Fellerer 2003). Oddly, however, there were hardly any Ruthenian documents to 
members of the east Galician public either, even though on 11 June 1869 the Gali- 
cian government explicitly implemented Article 19 to the effect that Ruthenians 
were entitled to deal with state authorities in their language (cf. Fischel ?1910: 
no. 330). It appears that law and reality did not coincide. It also seems that the gap 
affected particularly those varieties which were often considered dialects rather 
than "historical" languages in their own right. Ruthenian and, for example, Slo- 
vene would have fallen into this category, irrespective of their actual development 
as written standard languages. 
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The constitutional Article 19 extended to the second great branch in Austrian 
administration too. Parallel to the state authorities, this was a hierarchy of auton- 
omous governing bodies in each crown land, from the executive body of the pro- 
vincial diet down to town and village councils. They also had to deal with the pub- 
lic in their language if it was "customary in the land". One of the many legal 
disputes about the term reported by Stourzh (in Wandruszka and Urbanitsch, Vol. 
3, 1980: 1074—1076) erupted in 1903 in northern Bohemia. It involved the execu- 
tive committee of the county assembly of Bóhmisch Leipa (Ceská Lípa) and the 
council of Semily, a city located in a neighbouring county. The committee admin- 
istered in German and refused to accept a representation in Czech from Semily. 
They argued that Czech was not "customary" in their county. The city council 
countered that Czech was "customary" all over Bohemia. The dispute ended up at 
the Supreme Administrative Court. It decided in favour of Semily, based on the ar- 
gument that, in fact, there was a very small Czech speaking community in Ceská 
Lípa. Thus, Czech had to be considered "customary" in the county. Another thorny 
issue was the language of internal affairs at autonomous authorities. They were 
entitled to establish a variety of their choosing, a legal void which proved problem- 
atic in bilingual areas, and which prompted late attempts at compromises in Mora- 
via (1905) and Galicia (1907). 

We encounter a similar void in parts of the judiciary: Criminal courts could 
administer inner affairs in any language of their liking. Civil courts, on the other 
hand, remained subject to the Josephinian and “West Galician" Court Regulations 
from the late 18th century (cf. Fischel ?1910: no. 45, 89). They stipulated that 
proceedings had to be conducted in “the language customarily used in the land” or 
“in the courts of the land". Not only was the term “customary” problematic. The 
rise of oral and public proceedings considerably shifted the boundary between in- 
ternal affairs and dealings with the public. The latter were subject to Article 19. 
It took a long trajectory of more detailed pre- and post-1867 regulations until a 
legal miminum emerged: It granted Slav and, in the Bukovina, Romanian speakers 
the right to have parts of a court case conducted in their language, rather than in 
German, or in Polish at Galician courts since 1869. In the Costland and Dalmatia, 
the provision should have restricted the use of Italian. Yet, an empirical study into 
the actual usage of languages at the Provincial Court of Trieste between 1850 and 
1918 suggests that reality did not live up to this legal minimum. Even though there 
were many Slovene speakers in and around the city, their language features very 
rarely in documents from or to individual accusers, defendants, and witnesses 
(cf. Czeitschner 2003). Again, it appears that language usage fell behind a more 
liberal legislation. 

In Hungary, language policy and usage took a very different shape altogether. 
This had to do with the “Nationalities Law” of 1868 in particular. It stated that all 
citizens, irrespective of their ethnic background, form part of the Hungarian 
nation. As a result, and in sharp contrast to Austria’s Article 19, language laws 
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could not be constituionally enshrined as national rights. They were mere regu- 
lations. Yet, on this account the Law was exceptionally explicit. It contained a list 
of 29 paragraphs which specified language usage for official purposes. Hungarian 
was the state language. The other varieties could be used on certain conditions. For 
example, if requested by one fifth of the assembly of a comitat or township, official 
records had to be kept in Hungarian and a second language. If two assemblies used 
the same second language they could correspond with each other in that variety or 
in Hungarian. Members of the public were entitled to use their language when they 
had dealings with the local judiciary or administration. Village councils chose their 
own language for internal affairs. This selection illustrates that the Law of 1868, 
together with two further articles granting church autonomy, created a distinctly 
liberal regulatory climate in Hungary (without Croatia, where other laws applied). 
Yet, de facto there was strong pressure in all domains to use Hungarian rather than 
any of the other varieties, notably Slovak (cf. Evans 1998: 415—422). More work, 
especially of a more empirical kind than Gogolák (in Wandruszka and Urbanitsch, 
Vol. 3, 1980: 1207-1303), is necessary to properly understand the linguistic situ- 
ation in post-1867 Hungary. 


5. Linguistic nationalism and language contact 


The previous sections illustrated that hierarchical multilingualism, with German 
and Hungarian as privileged (quasi-)state languages, was the status quo in many 
parts of the Habsburg Monarchy. This sat uneasily with a major new political prin- 
ciple in European history since the French Revolution, namely nationalism. It held, 
and still holds, that the "political unit and the national unit should be congruent". 
In Central Europe, that unit received a distinctly cultural definition. At its heart was 
language. “The first and most important concern has to be our nation, our language", 
Karel Havlíček, the Czech journalist and political activist, exclaimed in April 1848 
in an editorial for his newly founded Národní Noviny. 19th-century linguistic 
nationalism poses a range of questions: from the appropriate English translation of 
Havlíéek's term národnost ‘nation’, via widely studied topics in the actual politics 
and social spread of Central European national movements (e.g. Hroch 1985), to 
crucial matters of ideology, such as whether nationalism was a necessary prerequi- 
site for modernization or not. In linguistic history, we can identify two major trends: 
National activists campaigned for linguistic segregation, either with some support 
from an appeasing, power-weary government as in the Austrian half, or against it as 
in Hungary. Simultaneously, language planners worked towards turning vernacu- 
lars into standard languages and standard languages into national ones. 

Education in particular became an important vehicle of linguistic segregation in 
the Austrian half since the 1860s. In paragraph 3 of Article 19, the 1867 Constitu- 
tion stated that in schools there could be no obligation to learn another language of 
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the province. Initially, this was a concession to the German speakers of Bohemia 
who wanted to make sure they did not have to learn Czech. Eventually, the para- 
graph was used by nationally inspired political bodies across Austria to curb bilin- 
gual education. Children should “belong to one language” and, thus, to one nation 
only — to expand on a phrase by the Romantic Friedrich Schleiermacher. For 
example, a study by Newerkla (2003) has shown that there was a period in the 1860s 
when Czech and German were both languages of instruction at the Czech and the 
German grammar schools of Plzeň. Bilingual education had to be discontinued at 
the German Gymnasium following a decision of Plzen’s city elders in 1869, which 
was based on Article 19. Already three years earlier, the governorship in Prague had 
ordered that the Czech Vyssf realní škola of Plzeň must settle for one language of 
instruction, too. It is true that bilingual education often served as a Trojan horse 
to shift towards German or Hungarian in secondary education. However, since 
the 1860s there were also genuine attempts at bilingual schooling. They often had 
the support of parents, for example in Plzeň. After all, proficiency in two languages 
was conducive to their children's career prospects in the city and beyond. National 
activists, on the other hand, saw dual linguistic affiliation as a threat to their cause. 
Recent research has revealed intriguing contrasts between their increasingly suc- 
cessful attempts to “nationalize” education (cf. Stourzh in Urbanitsch and Wan- 
druszka Vol. 3, 1980: 1124-1147) and support from local populaces to keep bilin- 
gual schools in post-1867 Austria (e.g. Judson 2006: 19-65). In the Hungarian part 
of the Monarchy, emphasis on Hungarian as the state language stood in the way of 
linguistic segregation. More detailed work is needed though to assess the character 
and extent of the so-called “Magyarization” since the 1870s. After all, church au- 
tonomy still afforded some degree of linguistic independence to Hungary's Ortho- 
dox Serbs and Romanians in particular, e.g. in their confessional schools. 
Another nationally inspired trend of the 19th century was the final shift away 
from “old”, foreign lingue franche, such as Latin or German, towards new vernacu- 
lar based standard languages. Even though developments differed considerably, we 
shall look at Czech as a typical example. Auty (1979) distinguishes three periods. 
First, enlightenment brought scholarly interest in the vernacular and its origin. 
Josef Dobrovsk$'s Czech grammar of 1809 (21819) came to be the most influential 
fruit of that period. Essentially, it formed the basis of the Czech linguistic revival 
which fed into a second, romantic period. Here, two main factors came to the fore: 
demands for the revived standard language to be afforded the same prestige 
and breadth of usage as the “old” lingue franche, together with further lexical and 
stylistic elaboration. An outstanding example is Josef Jungmann's five-volume 
Czech-German dictionary (1834-1839). Codification and elaboration were pro- 
cesses which carried on throughout the 19th century. The tasks were different from 
language to language. For example, at a time when Jan Gebauer could authori- 
tatively discuss the rules of Czech orthography and inflection in his monograph 
Pravidla českého pravopisu (1902), Galicia’s Ukrainians had had only three dec- 
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ades of agreement on using a vernacular based written language, rather than hybrid 
Church Slavonic, Russian, or Polish. Beyond the differences, there are, however, 
interesting convergences too. 

The Slavonic language planners in the Habsburg Monarchy often knew of each 
other’s work and engaged in cooperation or dispute. An early example is a con- 
certed effort by the ministry of justice in Vienna to codify some legal terminology 
for the Slavonic languages of the Monarchy. In March 1849 a law was passed 
which stipulated that imperial decrees had to be published not only in German 
but also in all the other “customary languages” of the Monarchy. To that end, the 
ministry convened a “Commission for Legal-Political Terminology”, which was 
active between August and November 1849. Beyond this particular initiative, it is 
the Czech, Slovak, Slovene, and Croatian “intellectual vocabulary” more generally 
where we can discern lexical convergence as a result of exchange among the Sla- 
vonic languages of the Monarchy. Its precise genesis may be difficult to ascertain. 
In a number of cases Czech will have served as model, e.g. Czech dükaz, Slovak, 
Slovene, and Croatian dokaz ‘proof’, Czech, Slovak noviny, Slovene and Croatian 
novine ‘newspaper’, and časopis ‘journal’ in all four languages as registered in 
dictionaries between the 1880s and 1920s (cf. Thomas 1992). Another specific 
“Habsburg” layer was made up of German calques and loans, such as Czech pech, 
Strüdl, špitál, Slovak pech, strüdl'a, špitál, Croatian peh, štrudl, Spital, Slovene 
peh, štrudelj, špital for German Pech ‘bad luck’, Strudel ‘strudel’, Spital *hospi- 
tal’. Thomas (1997) identifies 143 such lexemes which entered all or some of the 
four languages under discussion. They are now often non-standard and colloquial 
or obsolete altogether. This is partly due to the fact that they became the target of 
purist tendencies. It is true that such tendencies were neither restricted to the new 
and revived standard languages of the Habsburg Monarchy nor a novelty of the late 
19th century. It does seem, though, that they took on a particularly fervant char- 
acter in the area and period discussed in this paper, such as in the form of a special 
commission convened by the Matice česká in 1877, or as an ongoing search for the 
true Romance character of the Romanian language in Transylvania. Theory and 
practice of linguistic standardization, together with a certain degree of lexical con- 
vergence, are the two areas where we can probably speak of specific “Habsburg” 
features, shared not only by Czech, Croatian, Slovene, and Slovak, but also by 
other languages, notably Hungarian, Galician Polish with its administrative style 
under heavy Austrian German influence, and Austrian German itself, with Sla- 
vonic and Hungarian culinary loans, such as Palatschinke (Hungarian palacsinta) 
‘pancake’, Buchtel (Czech, Slovak buchta) ‘filled sweet bun’. The ambitious ques- 
tion of a Central European “Sprachbund” does not belong here as it takes us well 
beyond the temporal confines of the Habsburg Monarchy. 

We have so far been concerned with standardization. We must now return 
to Auty (1979) and a third period in the linguistic history of the late Monarchy. The 
new or revived standard languages had to be implemented of course. A major role 
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in this process must have been played by the rapid growth in elementary school 
education since the 1870s. Take for example Croatia-Slavonia, where, according to 
Suppan (in Wandruszka and Urbanitsch, Vol.3, 1980: 704), the number of public 
elementary schools almost doubled from 673 to 1189 between 1874 and 1888. They 
were important places for the implementation of new linguistic standards. Little, 
however, is known about this process, notably about the language and the content 
of the textbooks used, about curricula and the actual level of achievement in teach- 
ing linguistic norms. Finally, it would be interesting to see how their implemen- 
tation helped to understand language as a national attribute. Turning, say, a Lower 
Austrian peasant into a German and a Moravian small town dweller into a Czech 
was after all dependent on imagining standard language as national language. 


6. Conclusion 


The history of the late Habsburg Monarchy has fascinated scholars and writers ever 
since its dissolution after World War I. National and linguistic conflicts, paired 
with an ever growing strive in Austria towards solving them by way of “ethnic 
attribution” (Stourzh 1994) and segregation, are crucial aspects of the Monarchy’s 
slow demise since 1848. They have been explored in a large body of excellent his- 
torical research. Linguists concentrated on individual languages and their stan- 
dardization since the 19th century. An area where, it seems, much less work has 
been done is the question of how these new standards were actually implemented. 
The view beyond individual languages is rare too: Which forms did individual 
bi- and multilingualism take, and what were the di- and polyglossic realities behind 
the elaborate legal framework? These may be possible topics for future research. 
They should help to better understand the workings of multilingualism and the his- 
tory of individual (East) Central European languages in the 19th century. 


See also the following chapters in this volume: 5 by Hansen and 23 by Breu. 


Notes 


1 An ethnographic map of Austria-Hungary can be found at the end of this volume. 

2 Political districts are referred to by their historical title; towns and regions by their con- 
temporary name. German parallels are provided where they facilitate locating places on 
a historical map, such as the one referred to here. None of this carries symbolic meaning. 
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1. Introduction 


The multilingual Ottoman Empire was dissolved in 1923, more than six centuries 
after its foundation in 1299. Its successor state, the Republic of Turkey, was officially 
proclaimed on October 29, 1923. The following remarks concern the linguistic con- 
sequences of the gradual decline and dissolution of the Ottoman state from the per- 
spective of European history. The article will sketch the historical background, pro- 
vide some examples of the subsequent development of Turkish in and outside Turkey, 
and briefly comment on some non-Turkic varieties of the post-Ottoman period. 


2. The rise and progress of the Ottoman Empire 


The Ottoman Empire owed its name to Osman, the founder of the Ottoman dynasty, 
who extended the frontiers of the original settlement towards the edge of the Byzan- 
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tine Empire and established its first capital in Bursa. After Osman’s death in 1326, 
Ottoman rule began to extend over the Eastern Mediterranean and the Balkans. The 
first stage of the conquests falls into the second half of the 14th century. Thrace, Bul- 
garia, Macedonia, and Thessaly were conquered, whereas vassal status was imposed 
on Byzantium, Serbia, Walachia, and much of the Peloponnese. After a temporary 
loss of part of the territories at the beginning of the 15th century, the victories con- 
tinued. Byzantium was defeated, and Constantinople was seized on May 29, 1453, 
by Mehmed II “the Conqueror". Serbia and the Peloponnese were annexed. Epiros 
(a region straddling today’s Greek-Albanian border), Albania, Bosnia, Hercegovina, 
and much of Croatia were acquired. By the mid-16th century, the Ottomans had con- 
quered most of the Balkans except for its northwesternmost part. At the height of its 
power (16th-17th centuries), the Empire extended over three continents, controlling 
large parts of Southern Europe, the Middle East, and North Africa. It comprised 29 
provinces and the tributary principalities of Moldavia, Transylvania, and Walachia. 
Its residents covered a multitude of ethnic and linguistic groups. 


3. The decline and fall of the Ottoman Empire 


The decline proceeded gradually, eventually leading to the loss of huge territories. 
Crimea was lost in the Russian-Turkish war of 1768-1774. The first parts of 
the Empire to declare their independence were Greece (1829) and the principalities 
of Serbia, Montenegro, and Walachia (1875). After the Russian-Turkish war 
(1877-1878) independence was granted to Serbia, Romania, and Montenegro, and 
autonomy to Bulgaria. The administration of Cyprus was assumed by Great Britain 
in 1878, and the Austro-Hungarian Empire annexed Bosnia and Hercegovina in 
1908. With the Balkan wars (1912-1913), the Ottoman Empire lost its Balkan ter- 
ritories except for Thrace and the old capital Edirne (Adrianople). The Caucasus 
campaign against Russia in World War I ended disastrously: Parts of present-day 
Georgia, Armenia, and Azerbaijan, which had been taken by the Ottoman forces in 
the final stage of the campaign, had to be given up. The consequences of the terri- 
torial losses in North Africa and the Middle East were equally fatal, though irrel- 
evant for the subject of the present survey. 

World War I led to the dissolution of the Austro-Hungarian Empire and the 
establishment of the kingdom of Serbia-Croatia-Slovenia, which was also joined 
by Montenegro. The occupation of Izmir by Greece in 1919 triggered a national 
Turkic resistance movement that gained control over Anatolia. At the end of World 
War I, the Allies occupied Istanbul and the Ottoman government collapsed. The 
following Treaty of Sévres was superseded by the Treaty of Lausanne, which led to 
the creation of the Turkish Republic and acknowledged the British sovereignty 
over Cyprus. In conclusion, an impressive array of new countries emerged from the 
remnants of the Ottoman Empire. 
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It is clear from this brief sketch that Ottoman Turkey was, from the beginning, 
partly a European state. The Ottoman rule put an end to the weak, fragmented, and 
politically unstable state of affairs in its European territories and shaped them as a 
geocultural entity. 


4. Ottoman Turkish 


Ottoman Turkish (Osmanlica), which belongs to the Oghuz or southwestern 
branch of the Turkic language family, had been established in the 14th century as 
the official language of the powerful Ottoman state. It became the administrative 
and literary language of the Empire. 

The first Oghuz Turks in Anatolia, the Seljuks, had used Arabic and Persian as 
their written languages for administrative, scholarly, and literary purposes. The use 
of Turkish as a written language started during the 13th century in the small prin- 
cipalities — in centres such as Aydin, Balikesir, Kastamonu, Konya, Kütahya and 
Sivas — and continued in the first Ottoman capitals Bursa and Edirne. It is typical 
that the famous Jelaleddin Rumi of Konya wrote his poetic works in Persian, 
whereas his son, Sultan Veled, propagated the Sufi teachings of his father in Tur- 
kish in order to reach less educated social layers (Johanson 1993). 

A new period in the history of Turkish started after the conquest of Constan- 
tinople. When Turkish-speaking groups entered the city in 1453, new linguistic 
varieties were added to the existing ones (Greek, Armenian, and others). The city 
became the dominating urban centre of administration and culture of the Empire. 
The speech habits of Istanbul came to serve as the base for the most important and 
elaborate language in the Turkic world. 

A High Ottoman variety, often called ‘eloquent’ (fasih) Turkish, became the 
written medium of the Empire's administration and literature. It was a Turkish var- 
iety with a genuinely Turkic grammatical structure, but written in Arabic script and 
overloaded with Arabic-Persian lexical borrowings symbolizing the high status of 
the language of the Empire. It differed considerably from the everyday spoken Tur- 
kish of the period and was unintelligible to less educated members of the society. It 
was used for the elaborate high style of classical Ottoman literature during the 
period from the 15th to the 19th century. The poetic works written in this variety 
followed the metrical system of classical Persian poetry (Johanson 1994). Accord- 
ing to a traditional classification, there was also a ‘middle’ (orta) variety of Otto- 
man Turkish and a ‘rough’ (kaba) variety, typical of the lower classes (Johanson 
1989). 

Ottoman scholars showed little interest in documentation and cultivation of 
Turkish, except for its Arabic-Persian components. The first Turkish grammars 
were written by non-natives. 
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5. Standard Turkish in post-Ottoman Turkey 


The post-Ottoman Republic of Turkey was founded as a nation-state based on a 
common Turkish culture and language. The term Republican Turkish has been em- 
ployed to distinguish the language from Ottoman Turkish and other Turkic lan- 
guages. This designation is, however, inadequate since several Turkic republics 
exist today and Turkish is also spoken outside of Turkey. 

How did modern standard Turkish arise and develop? From the middle of the 
19th century onwards, Ottoman Turkish, its immediate precursor, had developed 
into the so-called New Ottoman phase, influenced by Western-oriented literature 
and journalism. After the dissolution of the Empire the character of the state lan- 
guage changed drastically, in particular as a result of the language reforms initiated 
by Mustafa Kemal Atatürk, the founder of the new national and secular state. The 
Arabic script, which was identified with Islam, was replaced by a Roman-based 
script (1928). Under the patronage of Atatiirk a Society for Research on the Turkish 
Language (Tiirk Dili Tetkik Cemiyeti) was established in 1932 and later reorganized 
as the Turkish Language Association (Tiirk Dil Kurumu). The association became 
an independent body in 1951 and was made a governmental body again in 1983. 

The original objective of the language reforms was to ‘nationalize’ the language: 
to create a new variety of Turkish that clearly reflected the post-Ottoman national 
and secular identity and to bring the written language closer to the spoken language. 
The tool for implementing this was a specific concept of linguistic purism. The pri- 
mary task was to replace Arabic-Persian loans with what were considered to be 
native equivalents, i.e. revived older words and words derived from real or alleged 
Turkic roots. The sudden changes shaped a radically different so-called “genuine 
Turkish’ (Oztiirkce), abundant in neologisms. It is a significant fact that even Ata- 
türk's own Nutuk, his well-known speech to the new Parliament in 1927, had to be 
translated into more modern versions of Turkish again and again since the previous 
vocabulary became obsolete at an increasingly rapid pace. Older and younger 
people still differ in their lexical choices, and politically conservative groups still 
tend to use more old-fashioned words (for a critical evaluation, see Lewis 1999). 

Istanbul Turkish, the language of the old capital, has continued to set the tone in 
the country in about the same way as Paris French has in France (Deny 1955: 11). 
The official language is now spoken as a first or second language by almost all resi- 
dents. 


6. The spread of Turkish 


The influence of Ottoman Turkish as the official language of the Empire spread as 
the realm expanded. As a spoken language, however, it was introduced by colon- 
ists and confined to a few areas of the Empire, in particular to the first conquered 
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regions in the Balkans. The first Turkish colonists in Europe inhabited parts of 
Thessaly and Macedonia. Since the Balkan Turkish regions were colonized from 
Anatolia, their dialects show traces of their Anatolian origin. They are mostly di- 
vided into a West Rumelian group, spread over western Bulgaria, Macedonia, and 
Bosnia, and an East Rumelian group, represented in other parts of Bulgaria (Né- 
meth 1956). 

Turkish was a native language in some regions, and a second or foreign lan- 
guage in other regions. The expansion proceeded in several phases. Turkish was in- 
itially introduced into previously non-Turkic-speaking areas, transported by the 
forces of colonialism. Further settlement of native speakers created more stable 
situations and a certain leveling of the varieties introduced. The original dialects 
were gradually nativized, losing their foreignness. 

The subsequent development sometimes led to a shift to Turkish by speakers of 
indigenous languages. Certain Turkish-speaking groups in the Balkans were Is- 
lamicized and subsequently Turkicized indigenous populations. For example, a 
large group of Albanians shifted to Turkish as their first language, too. New lin- 
guistic norms were established via nativization processes in which both settlers 
and shifters participated. The results differed widely according to the varying so- 
ciopolitical conditions. 

All varieties of Turkish spoken outside Turkey display contact phenomena, es- 
pecially in the lexicon. Some of them are more heavily influenced by non-Turkic 
languages. Others show less impact, often due to the relative isolation of the 
speaker communities in mountainous regions and endogamy practices. For in- 
stance, the speech habits of the nomadic (yürük or yörük) groups in the Republic of 
Macedonia and in Greece differ less from varieties spoken in Turkey. According to 
their traditions, other Turkish-speakers in Macedonia are Turkicized indigenous 
groups from which they clearly distinguish themselves (Friedman 2006). Some of 
the Turkish varieties that display exceptionally deep contact-induced features may 
well have emerged as creoles. 


7. Post-Ottoman Turkish varieties outside of Turkey 


The dissolution of the Ottoman Empire entailed a decrease of the influence of Tur- 
kish outside of Turkey. However, there are still millions of native speakers of Tur- 
kish in other countries, representing varying degrees of maintenance of the lan- 
guage. Turkish-speaking minorities exist in several countries that formerly 
belonged to the Empire, e.g. in Cyprus, Bulgaria, Macedonia, Kosovo, Greece, Ro- 
mania, and Serbia. 

A late development not directly connected with the main topic of the present 
survey is the migration of so-called guest workers from Turkey to Western Europe, 
which began in the 1960s and transplanted millions of first-language speakers of 
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Turkish to Germany, France, Great Britain, Benelux, Scandinavia, and other coun- 
tries. Though not all immigrants in these linguistic groups speak Turkish with 
native fluency, a considerable part of them have retained it as their first language. 
The future of Turkish in Western Europe is highly uncertain. 


8. Mass exodus of Turkish-speakers 


The political events before and after the dissolution of the Ottoman Empire 
changed the linguistic situation in the old European - the so-called Rumeli — prov- 
inces. The last decades of the Empire and the first decades of the Republic of Tur- 
key saw continuous waves of mass immigration of Turkish-speakers from the 
Balkans. Large groups of Turks living in Greece, Bulgaria, Yugoslavia, and Ro- 
mania left their homes. Within a relatively short period, the linguistic structure of 
the Balkans underwent profound transformations. 

The exodus of Balkan Turks had already begun after 1878. With the Ottoman 
withdrawal as a result of the Balkan Wars of 1912-1913, the emigration increased 
due to strong pressures on the Turkish-speaking population. 

In 1924 a compulsory population exchange between Turkey and the Balkan 
states took place, sanctioned and supervised by the international community as part 
of the Treaty of Lausanne (1923). This first large-scale population exchange in the 
20th century affected more than one million people. The governments of Greece 
and Turkey regarded it as a measure of homogenization that would stabilize and 
strengthen their national states. The agreement to transfer minority populations 
was based on religion rather than on language or ethnicity. Almost all Greek- or 
Turkish-speaking native Orthodox Christians of Central Anatolia, Ionia, Bithynia, 
Pontus, Eastern Thrace, and some other regions were forced to leave their home- 
lands. About 500,000 persons were expelled from the Balkans, predominantly 
Turks, but also other Muslims, Roma, Pomaks, Cham Albanians, and Megleno-Ro- 
manians. Two groups were exempted from the exchange: Greeks who had lived in 
Istanbul before 1918 and Muslims living in Western Thrace, the Greek part of 
Thrace. 

In spite of the mass exodus, Turkish is still spoken in parts of the old European 
provinces of the Empire, mostly in Bulgaria, Macedonia, and Kosovo. 


9. Turkish in Bulgaria 


Since the dissolution of the Ottoman Empire, large additional groups of Turkish- 
speakers have left their homelands due to economic, political, and cultural pressure. 

The situation in Bulgaria was highly problematic, the Turkish-speaking popu- 
lations being subject to harsh oppression. The official Bulgarian attitudes towards 
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Turkish have varied during the last decades. They were clearly oppressive in the 
1980s, when hundred thousands of Bulgarian Turks fled to Turkey to escape a cam- 
paign of forced assimilation. The Bulgarian government even campaigned to in- 
crease the number of Bulgarian words in Turkish dialects by publishing lists of un- 
desirable Turkish words and their allegedly correct Bulgarian equivalents. The 
official attitude became much more tolerant in the following period. Turkish may 
still have about a million first-language speakers in Bulgaria. 


10. Turkish in Yugoslavia and ex-Yugoslavia 


In the early years of the Kingdom of Yugoslavia, most of the lands owned by Turks 
were confiscated in Macedonia and Kosovo. The Turks were subject to cultural 
pressures. Their right to use Turkish in education was not practiced after 1923, no 
newspapers were published in Turkish, and the use of the Roman-based alphabet of 
Turkey was banned since it might serve to convey nationalist ideas. From 1933 on, 
Turkey and Yugoslavia signed agreements in order to secure the rights of the 
Turks, but the waves of emigration continued. 

After the German occupation of Yugoslavia in 1941, Macedonia was given to 
Bulgaria, which resulted in increased pressure on the Macedonian Turks. Slavic 
endings were added to their family names and education in Turkish was banned. 
Thousands of Turks were killed or exiled during the war. The communist regime 
that came to power in 1944 did its utmost to shield the Turks from influence from 
Turkey and Albania. The introduction of the kolkhoz system led to mass emi- 
grations. An agreement between Turkey and Yugoslavia in 1952 made it even ea- 
sier to force Turks to emigrate. The new Yugoslav policies after 1953 granted the 
Turks more freedom and decreased the rate of the emigration. 

The language policies of the Republic of Macedonia have been considerably 
better than those implemented in Bulgaria. In the post-war period, Turkish schools 
were reopened and Turkish was allowed to be taught in the Latin alphabet used in 
Turkey. However, in 1984 numerous Turks emigrated for fear that the oppression 
observed in Bulgaria might spread to Macedonia. Turkish used to have a strong 
position in Macedonia, and it is still vital there, particularly in the western parts. 
About four percent of the population, also many urban Macedonians, still speak 
Turkish. The language has official status in some municipalities with a high share 
of Turkish-speakers. Also many Macedonian-speaking Muslims declare them- 
selves to be of Turkish nationality and demand Turkish schools for their children 
since they identify Turkish with Islam. Turkish has a high cultural prestige among 
Muslims. Moreover, even today knowledge of Turkish can have a certain impli- 
cation of urban sophistication both in Macedonia and in Kosovo (Friedman 2006). 

As for Kosovo, Turkish enjoys official status in the Prizren district. The situation 
of the Kosovo Turks has been aggravated through the recent Kosovo conflicts. 
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The majority of the Roma in Macedonia and Kosovo - the ‘Khorakhane Roma’ 
or "Turkish Roma' - are Muslims, and many of them speak Turkish as their first 
language. 

The Balkan Yürük groups have become more and more settled but many still lead 
a nomadic life in mountainous regions. The main regions include the Pindus moun- 
tains in northern Greece, Epirus in Greece and southern Albania, the Sar mountains 
in Kosovo, northwestern Macedonia and northeastern Albania, the Pirin mountains 
in southwestern Bulgaria, the Rhodope mountains in Bulgaria, and Dobruja. 


11. Turkish in Greece 


For speakers of Turkish, the situation in Greece after the population exchange in 
1924 has been partly problematic. From 1936 to 1941, the country was ruled by the 
so-called ‘4th of August Regime’, which enacted laws banning the use of Turkish 
in public. In Western Thrace the promotion of Turkish is still relatively restricted. 
Its Turkish-speakers, about 50,000 persons, are not recognized as a linguistic mi- 
nority but belong to the only recognized minority group, the Muslims of Thrace, 
who, according to the Treaty of Lausanne, had been granted special rights, e.g. 
education in Turkish. 

The Karamanlis are originally Turkish-speaking Greek Orthodox believers 
who were, in the framework of the large population exchange, expelled from the 
Karaman and Cappadocia regions of Anatolia to Greece. They may descend from 
indigenous Anatolian groups who maintained Christianity after the Turks had in- 
vaded Asia Minor, or they may go back to Turks who fled to the Byzantine Empire 
and converted to Christianity there. In their old homeland the Karamanlis devel- 
oped refined literary activities, writing Turkish with Greek characters. Members of 
the older generation of Karamanlis still have some command of Turkish. 


12. Gagauz 


Gagauz is a former dialect of Turkish that has developed into an independent lan- 
guage. No evidence of an alleged Kipchak Turkic origin of the language (Kowalski 
1933) has been found. Its speakers are Orthodox Christians. The language is 
spoken mainly in Gagauzia (Gagauz Yeri), an autonomous region in the south of 
the Republic of Moldova, in southwestern Ukraine, and in the Russian Federation. 
A few speakers live in northeastern Bulgaria and Romania. The Gagauz migrated 
from the south, mainly Bulgaria, to the north at the end of the 18th and the begin- 
ning of the 19th century. A connection with the Oghuz-speaking so-called gaga- 
vouzoi of northern Greece is possible, though not proven. At least some of these 
groups may once have spoken varieties closely related to Ottoman Turkish. 


Multilingual states and empires: the Ottoman Empire 737 


13. Language contacts in the Balkans 


The Balkans have been a region of intense multilingualism for many centuries 
(see also Hill, this volume). Its Turkish dialects are traditionally divided into East 
and West Rumelian, according to a classification proposed by Németh (1956). The 
sociolinguistic situations, the motivations, processes and effects of language con- 
tact are highly complex. 

Balkan Turkish dialects have been heavily subject to language contact. Their 
phonology and morphophonology, e.g. the systems of sound harmony, are often 
very similar to those of their non-Turkic contact dialects. Other domains affected 
include derivation and, in particular, syntax, where Turkish generally differs 
greatly from the normal Balkan type. 

Whereas Turkish was a dominant language in the Ottoman period, it is a domi- 
nated language today. Turkish used to be a source for lexical elements in the Balkan 
languages, providing words of everyday life as well as administrative terms. This 
is not the case any more: On the contrary, Slavic and, in some regions, Albanian 
serve as sources for extensive lexical copying into Turkish varieties. The copied 
lexicon has also clearly affected the phonological systems. The change in domi- 
nance relations is reflected in many ways. For example, Turkish words once copied 
into the contact varieties and accommodated there are often reborrowed in the 
shapes they have acquired in those varieties. 

The Balkan Turkish dialects are not in direct contact with the linguistic devel- 
opment in Turkey. As a result of the relative isolation, they may lack certain nuances 
of the republican kind of Turkish. Balkan Turkish media are often highly puristic, 
sometimes even preserving older neologisms that have been abandoned in Turkish 
of Turkey. 

Gagauz has developed specific features due to intensive contact with neighbor- 
ing, mainly Slavic, languages, and thus gives an ‘un-Turkic’ impression (Doerfer 
1959: 270). Its contact-induced syntactic features have been studied by Menz (1999). 

Many issues of language contact deserve further investigation. All Turkic var- 
ieties of the former Empire are products of contact, albeit to varying degrees. Sev- 
eral dialects, e.g those in Albania and Greece, should be studied in more detail. An 
important issue that still poses challenges to linguistic scholarship is the relation- 
ship between substrates and superstrates. There is no self-evident match between 
old ethnic groups and their language varieties. Which dialects are spoken by de- 
scendants of Turkish colonists, and which are spoken by descendants of indigenous 
groupings shifting from Balkan languages to Turkish? How has Turkish as spoken 
by non-Turks influenced the immigrant dialects? It would be important to attempt 
to describe the entire developmental cycle of Balkan Turkish. 
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14. The Balkan linguistic league 


Turkish has been important for the formation of the so-called Balkan linguistic 
league or convergence area (Sprachbund), which is a result of convergence between 
languages of different genealogical affiliations (see also Tomic, this volume). The 
Indo-European languages that constitute the league are Albanian, Greek, Balkan 
Slavic, and Balkan Romance. Many aspects of the Turkish impact on the Balkan 
languages have been studied since Franz Miklosich (1884-1890) but mostly Tur- 
kish has not been considered a genuine member of the league. A closer study of the 
Turkish dialects actually spoken in the Balkans might lead to a revision of this 
view. Recent studies have documented typical Balkan features in Turkish dialects 
and the Turkish influence on other Balkan languages (Hazai 1978, 1990). The Tur- 
kish dialects of Macedonia, in particular, seem to occupy a central position in the 
Balkan linguistic league. 


15. Turkish in other European areas outside of Turkey 


The history of Turkish in Cyprus starts with the Ottoman conquest in 1571. The 
Turks who were settled there by the government in different periods mostly came 
from the area of Adana-Konya-Antalya. Turkish was, on the one hand, the politi- 
cally dominant, prestigious language of the administration but, on the other hand, 
the mother tongue of the numerically and economically weaker part of the popu- 
lation. The conditions of coexistence led to a certain bilingualism. Knowledge of 
Turkish was useful for Greek-speakers when dealing with the authorities. Knowl- 
edge of Greek was important for the Turks in areas where they lived and worked to- 
gether with groups of the majority population. 

With the beginning of the British rule in 1878, Turkish lost its political import- 
ance in favor of English, which became the third component in the multilingual 
setting. The linguistic situation changed radically in 1974, when the island was 
divided into a Greek and a Turkish part. The division ended a Turkish-Greek sym- 
biosis that had developed for nearly four centuries. Only a few small bilingual 
regions were left, e.g. Pile and Potamia in the Greek part and Karpaz in the Turkish 
part. The secession of Northern Cyprus with support from Turkey also led to con- 
tacts with Turkish varieties that were totally unaffected by Greek or English in- 
fluence. Cypriot Turkic was exposed to increasing Standard Turkish impact 
through immigration from Turkey, new mass media, and new educational institu- 
tions. 

As a consequence Northern Cypriot Turkish is currently in a process of change. 
A kind of diglossic situation has emerged. The written language is that of Standard 
Turkish, whereas the dialects have the status of a spoken standard in all domains of 
communication, fulfilling a broad spectrum of functions and enjoying more pres- 
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tige than Turkish dialects in Turkey. The Cypriot Turkish dialects, which actually 
constitute a specific dialect group, are now strongly influenced by the spoken stan- 
dard of Turkey (Demir and Johanson 2006). English has become the most import- 
ant competitor of Turkish. With few exceptions, English is currently the medium of 
instruction at the universities of Northern Cyprus. 

In the post-Ottoman period, Cypriot Turkish was relatively isolated from the 
language of Turkey and thus failed to adopt some of its innovations. On the other 
hand, Cypriot Turkish has been strongly influenced by Greek, the only language 
with which it has had long-lasting and intensive contacts. It displays contact-in- 
duced developments similar to those observed in other Turkic varieties under pro- 
found Indo-European impact (for some typical features see Demir 2002). Many 
parallel structures are found in eastern Black Sea dialects of Turkey and in Gagauz. 

Another interesting case is the fate of Cretan Turkish. Turkish had been intro- 
duced to Crete during the Ottoman rule from 1648 on. Many speakers of Turkish 
also mastered the Cretan Greek dialect. In the course of the dissolution of the Em- 
pire, the descendants of the Turkish colonists and the numerous local converts to 
Islam were forced to leave Crete and to migrate, mostly to Turkey. Successive 
waves of emigration had begun in the 19th century and culminated in the popu- 
lation exchange after the Treaty of Lausanne. Many emigrants still preserve their 
Turkish language and their Cretan Greek dialect. 

Crimean Ottoman was once an Oghuz Turkic variety spoken on the southern 
coast of the Crimea. It has often been referred to as ‘the southern dialect’, though 
the other varieties of Crimean Turkic are of Kipchak Turkic origin. The Crimean 
Khanate (1441—1783), one of the Turkic khanates that succeeded the state of the 
Golden Horde, had come under the protection of the Ottoman Empire in 1475 and 
been aligned with Russia after the Russian-Turkish war of 1768-1774. Crimean 
Ottoman has been extinct for several decades now. 

The so-called Meshkhetian Turks are a Turkish-speaking group from the region 
of Georgia bordering on Turkey. They were deported to Central Asia in the Stalin 
era and later subject to violence there. Their efforts to return to their homeland 
after the disintegration of the Soviet Union have not been successful. Their variety 
of Turkish has not yet been described satisfactorily. 


16. Immigrants in Turkey 


As already noted, Turkey has accepted huge amounts of immigrants both before 
and after the dissolution of the Ottoman Empire. The immigrants were mostly 
Muslim refugees from areas conquered by Christian powers, i.e. by Austria-Hun- 
gary in the Balkans, and Russia in the Caucasus and the Black Sea areas. The 
period 1915-1925 saw a massive influx of refugees. Altogether, over a million im- 
migrants have been admitted. More than 90 percent have arrived from Bulgaria, 
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Greece, Yugoslavia, and Romania. In the period from 1954 to 1956, about 35,000 
Muslim Slavs immigrated from Yugoslavia. 

The immigrants have been settled primarily in the Marmara and Aegean re- 
gions but also in various parts of Central Anatolia. They have made essential con- 
tributions to shaping the basic features of their new homeland. 


17. Turkish varieties in post-Ottoman Turkey 


The development of the modern Turkish standard language has been briefly 
sketched above. It is the descendant of Ottoman Turkish, established on the basis 
of the Istanbul dialect. In spite of the leveling influence of mass media and the edu- 
cation system since the 1930s, there is still much dialectal variation, with mainly 
phonetic and lexical differences. The process of linguistic Turkicization of Anat- 
olia has been going on since the Middle Ages, but it has accelerated after the dis- 
solution of the Ottoman Empire due to the resolute language policy of the Repub- 
lic. The spoken varieties are the results of complex processes in the settlement 
history of Anatolia. Today there exists a relatively uniform supraregional spoken 
standard. The following dialect areas may be distinguished: 


e A western area, divided into northwestern and southwestern dialects. The Mar- 
mara subarea, situated in and around Istanbul, has close connections with the 
East Rumelian dialects. The Aegean subarea extends up to Antalya in the south. 

e A central area, comprising the provinces Eskisehir, Ankara, Yozgat, Kayseri, 
Kırsehir, and Nevsehir. 

A southeastern area, comprising the provinces Adana, Gaziantep, etc. 
e Aneastern area, forming a continuum with Azeri dialects. 
e A northeastern area, stretching from Samsun to Rize along the Black Sea coast. 


The northeastern dialects show analogies with Balkan dialects of the western 
Black Sea shore, Deliorman, Dobruja, and Bessarabia. Németh (1956) attempted 
to explain the similarities by assuming direct tribal relations at the time of the 
settlement. Some dialects of the northeastern area, e.g. the Trabzon dialect, exhibit 
Greek substratum influence in phonology and syntax (Brendemoen 2002, Johan- 
son 2004). The Hemsinli dialect clearly possesses an Armenian substratum. 

Immigrants from the Balkans have partly preserved their Rumelian dialects: 
Deliorman (Ludogorie), Danubian, Macedonian Turkish, etc. 


18. Other Turkic varieties in post-Ottoman Turkey 


Many other Turkic-speaking groupings immigrated to Turkey before and after the 
dissolution of the Ottoman Empire. After the Russian-Turkish war of 1768-1774 
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Turkey received numerous Crimean Tatars, who speak a Turkic language of the 
Kipchak type, albeit strongly influenced by Oghuz Turkic. The bulk of Crimean 
Tatars remained in the Crimea until World War II, when the entire nation was de- 
ported to Central Asia, mostly to rural areas of Uzbekistan. After the collapse of 
the Soviet Union large groups of Crimean Tatars have succeeded in returning to the 
Crimea and resettling there. 

Other small Kipchak Turkic immigrant groups include Karachay and Balkars 
from the Caucasus area. These Muslim peoples had not been deported after the 
Caucasian War of 1817-1864. In 1944, however, both peoples were deported to 
Siberia. After 1957 they were permitted to return to their homeland. The small 
groups of Karachays in Turkey have tended to preserve their language, whereas 
many other Turkic groups have quickly shifted to Turkish. Statistical data on 
speakers of Turkic languages other than Turkish are not available. 


19. Non-Turkic languages in post-Ottoman Turkey 


In the Ottoman period many non-Turkic minority languages were spoken in the 
heartland of Turkey. 

The capital Istanbul played an outstanding role as a multilingual, multicultural, 
and multi-ethnic urban center. Already at the time of the Ottoman conquest, Con- 
stantinople had been a multilingual metropolis. The new situation after 1453 cre- 
ated an environment in which various Turkic and non-Turkic linguistic codes in- 
teracted and fulfilled different functions. Greeks and Armenians constituted 
substantial parts of the population. Speakers of other varieties constantly moved in 
from Anatolia, the Balkans, Transcaucasia, etc. The population movements con- 
stantly changed the situation by adding new varieties to the already existing ones, 
and sociological changes modified the dominance relations between the linguistic 
codes. Non-Turkic minorities played important roles in the interaction, often as 
mediators between speakers of Turkish and other languages. Turkish as spoken in 
Istanbul, the basis of Standard Turkish as we know it today, developed particular 
contact-induced features. 

At the time of the dissolution of the Empire, Turkey still had large Indo-Euro- 
pean linguistic minorities, speakers of Kurdish, Greek, and Armenian. Their sub- 
sequent situation became more problematic due to political conflicts, emigration, 
and linguistic assimilation. 

Speakers of Kurdish, i.e. Kurmanji and Zaza, still constitute large linguistic 
communities. Despite periods of linguistic repression, most of them have retained 
their mother tongues. Large groups who identify themselves as Kurds have shifted 
to Turkish. 

In 1915, after the Armenian insurgency against the Ottoman government, about 
two million Armenians were deported from eastern Anatolia. After the foundation 
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of the Turkish Republic, many Armenian centres in Turkey vanished. Ultimately, 
only Istanbul was left as an urban centre with a vital Armenian community. 

Already under Mehmed II, the Greeks of Istanbul had been granted extensive 
rights. Emigration of Greek residents of Turkey — called Rum in contradistinction 
to Yunan (Ionian), the Greek of Greece — had begun with the Balkan Wars of 
1912-1913. During and after World War I, the Greek population of Pontus and 
other regions was severely decimated. The population exchange after the Treaty of 
Lausanne involved the resettlement of about 1.5 million Greeks. The inhabitants of 
Istanbul and the Aegean islands of Imbros (Gökceada) and Tenedos (Bozcaada) 
were exempted. Many Greeks emigrated due to repressive policies from the 1930s 
onwards. After the Istanbul riots of 1955, directed against the Greek community 
but also affecting Armenians and Jews, no further violent incidents have occurred. 
Greeks now mostly live in Istanbul, the islands of Imbros and Tenedos and the 
Princes' Islands. The number of Greek-speaking residents even seems to have in- 
creased in recent years. 

In the late 19th century, shortly before the dissolution of the Empire, large por- 
tions of non-Turkic-speaking Muslims emigrated from the Caucasus area. A popu- 
lation transfer of many hundred thousands Muslims into the Ottoman Empire took 
place after the Caucasian War of 1817-1864, which ended with the Russian con- 
quest of the Caucasus. The groups involved were mainly Circassians (Adyghe), 
Ubykhs, Abkhaz, and Abaza. Many Abkhaz and also Ajars, Georgians of Muslim 
faith, still live in several areas of Turkey. 

Laz, a sister language of Georgian, is the only member of the South Caucasian 
family that is spoken primarily in Turkey, namely on the southeastern coast of the 
Black Sea. All speakers are bilingual and use Laz primarily in private communi- 
cation (Kutscher 2008). 

The Ottoman Empire also allowed non-Muslim refugees to settle on its terri- 
tory. In particular, Sephardic Jews were given refuge after their expulsion from 
Spain in 1492. They brought the Romance variety of Judeo-Spanish or Ladino to 
Istanbul and the Balkans. Most Jewish residents of today's Turkey are descendants 
of this group. In the 20th century, after the dissolution of the Empire, many Jews 
emigrated but the country still has a small Jewish population. 

A curious detail is the existence of the village of Polonezkóy near Istanbul, 
founded in 1881 by Polish settlers who did not wish to return to their occupied 
homeland after the Crimean War. 

After the dissolution of the Empire, Turkey also remained the home of many 
speakers of the Semitic languages Arabic and Syriac. They remain outside the 
scope of the present article, which focuses on the European situation. 

Due to the socio-political circumstances, most linguistic minorities in Turkey 
are nowadays bilingual, speaking Turkish as a second language with varying levels 
of competence. For details on the language situation in Turkey see Andrews (1989). 
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20. The legacy of the imperial era 


The linguistic legacy of the imperial era in the post-imperial era has not been suf- 
ficiently analyzed but still poses challenges to scholarship. Too little is known 
about the overall roles of Turkish since the beginning of the 20th century, about 
the linguistic variation in the transplanted dialects, and the impact of indigenous 
languages on them. We lack data on the patterns of acquisition, language loss, and 
language shift, and on the degrees of endangerment due to external and internal 
factors, e.g. the varying rates of competent speakers and speakers showing language 
attrition. In other words, this is a wide open and most promising area for future re- 
search. 


Also of interest in this volume: chapter 7 by Menz. 
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42 Multilingual states and empires in the history 
of Europe: Tsarist Russia and the Soviet Union 


Dieter Stern 


1. Introduction 

2. Multilingual Russia — the basics 

3. The language question in the Tsarist Empire 

4. The language question in the Soviet Union 

3. Of peoples and languages - the ethnification of the Soviet citizens 
6. The aftermath 

1. Introduction 


This chapter will deal with the way the Soviet Union and its predecessor, the Tsarist 
Empire, have addressed the issue of multilingualism within their confines. It will 
start with the Russian elites’ first recognition of language, or rather languages, as a 
political issue in the 18th century, highlighting the major attempts at tackling the 
language question in Tsarist Russia. The historical outline will then proceed to lan- 
guage politics in the Soviet Union, which are heir to, though of course not outright 
continuations of, Tsarist politics. In retrospective, official Russia's approach to 
multilingualism appears to vacillate between two poles, encompassing what 
Kreindler (1985), following Fishman (1982), labeled the eastern vs. the western 
tradition of Russian language politics. The former is associated with a tolerant 
approach toward linguistic diversity, whereas the latter is said to pursue the an- 
nihilation of diversity by what has become known as russification. The eastern 
tradition “accepts the diversity of mankind and its tongues as natural and even de- 
sirable, while the other's ultimate goal is a single system of values, culture, beliefs 
and language" (Kreindler 1985: 345). The labeling is suggestive of the alleged ori- 
gins of the divergent ways of dealing with the language question, covertly blaming 
Western Europe for the introduction of intolerant attitudes towards linguistic diver- 
sity into Russian cultural and political thought. Laitin, Petersen and Slocum (1992) 
refute this idea of an "oriental exceptionalism", according to which Russia and 
especially the Soviet Union followed a different, more friendly course in dealing 
with linguistic minorities. They maintain that Russian and Soviet ideas about lan- 
guage and culture did not differ principally from those current in Western Europe. 
The observable differences are rather due to a less successful implementation of 
language politics than to a conceptual difference. Indeed, the emphasis on a prin- 
cipal ideological divide between East and West may have been at times too strong 
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though there is indeed a specific train of thought in Russia which time and again 
comes to the fore. Especially the policy of nativization (korenizaciia) favored 
by Vladimir Ilyich Lenin would be the most obvious example of an openly tolerant 
stance towards linguistic diversity with even a somewhat idealistic flavor about it. 
But, as will be shown, even this concept ultimately aimed at the dissolution of cul- 
tural and linguistic diversity, which was expected to come about via the inevitable 
emergence of the communist society. 


2. Multilingual Russia - the basics 


From its very inception in the Middle Ages the Russian state, or rather states, were 
composed of a multilingual population. On the territory of the Principality of Nov- 
gorod, even within the confines of the capital Novgorod itself, Slavic-speaking 
people lived alongside speakers of Finnic and a minority of Scandinavian speakers. 
But only with the eastward expansion of the Principality of Muscovy from the 
16th century onwards an ever increasing amount of ethnolinguistic groups was 
incorporated into the by then unified Russian state. The maximum expansion was 
reached in the 19th century. Since this time the ethnolinguistic make-up of the Rus- 
sian Empire has basically remained unchanged, with only minor readjustments 
until the collapse of the Soviet Union. The Soviet Union officially counted 150 lan- 
guages. Unofficial estimates give a figure of up to some 200 languages. The 
linguistic kaleidoscope comprises a wide array of genetic affiliations, viz. Indo- 
European, then Turkic, Mongolian, and Tungusic, which are sometimes subsumed 
under the more tenuous label Altaic, further Uralic, consisting of Finnic, Ugric, 
and Samoyedic, and finally Caucasian and Paleosiberian. The latter does not 
represent a language family in the proper sense but a group of genetically isolated 
languages, such as Ket or Nivkh. The same is partly true for the languages lumped 
together under the label of Caucasian languages. 

The sheer number of languages is somewhat deceptive because it suggests a 
diversity of everyday public communication, which stands in sharp contrast to 
the preponderantly monolingual Russian reality of more recent times. According 
to the census of 1939, out of an official total of 150 languages 140 were spoken by 
no more than 9.16% of the entire population. There were only three languages 
which were spoken by a substantial segment of the total population as primary or 
ethnic language, viz. the three Slavic languages Russian (58.39 % = 99 million), 
Ukrainian (16.48 % = 28 million), and Belorusan (3.09 % = 5.3 million) (Grenoble 
2003: 7). For the latter two languages it has to be added that in spite of their 
relatively large number of speakers they were hardly ever used in public because 
they were generally held in low esteem. The same holds for the remaining lan- 
guages, especially those which are spoken by small groups and do not have an 
official function. These were and still are considered a useless burden not worth 
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being handed down to one's children. Thus, multilingualism in the Soviet Union 
was somewhat lopsided in that it played a rather marginal role in daily transactions. 
At least this is true of urban settlements with a strong Russian majority. 

Soviet bilingualism was aptly labeled non-reciprocal by Bilaniuk (2005), 
which means that almost everywhere bilingual people proficient in both Russian 
and their respective titular or minority language had to deal with Russians who 
stuck to an uncompromising Russian monolingualism even where they formed a 
minority. In 1979, 977 % of the ethnic Russian population was monolingual in Rus- 
sian, whereas 82% of the non-Russian population claimed some knowledge of 
Russian (Grenoble 2003: 58). In all cases were non-Russian languages locally 
bound. There was some variation between Soviet republics as to how far Russian 
penetrated into the daily life of the average bilingual speaker. In the Baltic and 
Central Asian republics, an unassimilated bilingualism prevailed where the titular 
language! was retained for use in the majority of social domains, while Russian 
was learnt only for a limited range of purposes. Everywhere, however, especially 
in urban contexts, there was a clear tendency towards an assimilated bilingualism, 
with Russian replacing the titular language in social domains of high prestige, 
while the titular language was restricted to specific domains, such as the family or 
church (Laitin et al. 1992: 140). A speaker of a non-Russian language who moved 
permanently to another part of the Soviet Union practically became a monolingual 
speaker of Russian. Russian was, in a way, the prerequisite for individual mobility; 
it was the de facto lingua franca of the Empire. The present linguistic landscape 
was shaped basically after World War II, when labor migration became particularly 
strong. It was mostly Russians, but also members of other nationalities, who took 
advantage of the enforced industrialization and moved to faraway places for at- 
tractive jobs. Thus, territories with a previously relatively homogeneous non- 
Russian local population became increasingly mixed with monolingual Russian 
speakers, as for example the Estonian SSR, where up to World War II 88.2 % of the 
population were speakers of Estonian. By the 1980s, this figure had dropped to 
65 % (Carmichael 2000: 266). Monolingualism in non-Russian languages, which 
was the rule in the remote provinces of Tsarist Russia, practically ceased to exist 
in the post-war Soviet Union. These facts certainly qualify statements which one- 
sidedly blame Russian and Soviet language policy and planning for the actual rus- 
sification after World War II. Even if Soviet policy makers would have been more 
serious in their support of linguistic diversity, the hardly controlable social forces 
below the political level would have inescapably brought about an almost com- 
plete russification in the long run. 

The most detailed and certainly most inspiring Russian work on Soviet lan- 
guage politics is Alpatov (2000), which deserves praise for not indulging in a blunt 
criticism of Soviet oppression and injustice. Instead, Alpatov tries to come to grips 
with the motives and ultimate goals of Soviet policy makers and their attendants. 
The only systematic survey of Russian language policy from the beginnings of the 
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Russian empire to the end of the Soviet Union to date is Belikov and Krysin (2001: 
332-414). Their historical survey deserves special appreciation for its ample treat- 
ment of the Tsarist epoch. The long-term effects of multilingualism have been ad- 
dressed under the heading of language endangerment and death ever since the be- 
ginning of the 1990s. These issues will not concern us in this paper. Two basic 
references will be given instead. Vachtin (2001) is highly recommended reading on 
the subject of language shift and language death among Siberian native peoples. 
A reference guide to the endangered languages of the Russian Federation is Neroz- 
nak's ‘Red List’ (2002a). There is also a reference guide to the official languages 
of the Soviet Union by the same author (2002b). Finally, general information on 
the languages of Russia and the Soviet Union may be obtained from a number of 
encyclopedic works, most notably Comrie (1981), the five volumes of Jazyki na- 
rodov SSSR [Languages of the peoples of the Soviet Union] and their successor, the 
three volume encyclopedia Jazyki Rosskijskoj Federacii i sosednich gosudarstv 
[Languages of the Russian Federation and neighbouring states]. 

All large-scale studies on the effect of language politics on the multilingual 
landscape of Russia and the Soviet Union have to rely on the data of general cen- 
suses, which time and again may be happily supplemented by microcensuses 
for particular regions. Besides a few notable exceptions there exists practically no 
qualitative sociolinguistic research on the matter of multilingualism. The first All- 
Russian census was held in 1897. The second one was planned for the years 
1915/16 but the project could not be realized due to World War I. The next general 
census which was succesfully accomplished was the First All-Union Census of 
1926/7. From this time on regular censuses were scheduled every ten years but not 
all of them were carried out. Censuses were held in 1939, 1959, 1979, and 1989. In 
the year 2002 the first general census of the Russian Federation was held. The 
problem with the census data is that they are rather unreliable in some respects. 
One principal problem certainly is that censuses operate with predefined ethnol- 
inguistic categories, which are also used as obligatory passport entries. Especially 
in later censuses, when the administrative system of ethnic classification was al- 
ready well established and accepted as authoritative by Soviet citizens, many gave 
the language equivalent to their passport ethnicity when asked for their *mother 
tongue' (rodnoj jazyk), despite the fact that they might not even be able to speak a 
word of that language. An additional distortion resulted from the way passport eth- 
nicity was determined individually. As a rule of thumb, individuals determined 
their ethnicity by descent, taking over the ethnicity entry of their parents, or of one 
of their parents of their own choosing in case of mixed marriages. In regions with a 
strong minority population eligible for certain privileges, almost everyone tried to 
lay claim upon the financially more attractive ethnic label of the minority. Apart 
from the practically motivated insufficiencies of census data, the whole system of 
ethnification in the Soviet Union is flawed on principal grounds, as will be shown 
in detail in section 4. 
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3. The language question in the Tsarist Empire 


The idea of Russian as the state language of the Russian Empire dates back to the 
18th century. Especially Catherine the Great was fond of the idea of linguistically 
homogenizing the subjects of the Empire. The motivation behind it, however, was 
not so much inspired by feelings of cultural supremacy but rather determined by 
considerations of a more practical nature. Catherine the Great was exclusively con- 
cerned with the consolidation of Russian power in recently acquired territories 
(Laitin 1998: 39). However, no further course of action was taken then. Only when 
in 1863 the Polish aristocracy, which found itself under Russian rule since the 
Polish partitions at the end of the 18th century, rose up in revolt against the Empire, 
measures were taken against the use of those languages which were considered a 
threat for the state's security. Besides Polish these measures also affected Ukrai- 
nian, whose recent aspirations toward linguistic independence from Great Russian 
patronage had for some time been suspected of being secretly instigated by the 
Polish aristocracy. Immediately following the uprising in 1863, the minister of the 
interior, Petr Aleksandrovié Valuev, issued a circular which strongly recommended 
to restrict the publication of books in Polish and Ukrainian to fiction and poetry. 
The regulations of the circular were adopted without modifications in the notorious 
Ems ukaz of 1876 and thus given legal status. The Ems ukaz remained in force 
until the first Russian revolution in 1905. Remy (2007) is a recent study on the 
background of both the circular and the ukaz. The circular and the ukaz were, how- 
ever, not just an immediate and inevitable reaction to a direct political threat. They 
also reflected a general change of the political climate under tsar Alexander II, who 
advocated the idea of Great Russian cultural supremacy. As a consequence, other 
minorities and their respective languages came into the line of fire. The adminis- 
trative reform of 1883 for the Caucasian territories was devoted to the russification 
of the resident population (obrusenie tuzemcev). Though this smacks rather of a 
Great Russian mission civilisatrice, its primary concern was, as in the other west- 
ern territories, the consolidation of power by way of establishing a unitary system 
of administration throughout the Empire (Laitin et al. 1992: 135). 

Chauvinistic overtones certainly existed but these alone would not have suf- 
ficed to take political action. This is confirmed by a comparison with the eastern 
territories, which were considered uncivilized and inferior to Russian culture. Es- 
pecially the northern Asian possessions with their nomadic hunter-gatherers 
evoked images of the noble savage among the Russian elites, including Catherine 
the Great. The complimentary attitude would describe the Siberian natives as back- 
ward and miserable and emphasize the positive effect of cultural and linguistic rus- 
sification on them. The leader of the December uprising of 1825, Pavel Ivanovic 
Pestel', considered it his Christian duty to lead the Siberian native out of his ex- 
treme poverty and ignorance (Slezkine 1994: 91). Similar attitudes were widely 
held throughout the 19th century. The principal cultural difference implied the idea 


750 Dieter Stern 


of a civilisatory mission. At the same time, the imbalance of power allowed for a 
far less authoritarian course of action. In 1822, count M. M. Speranskij, governor 
of Siberia, created the legal foundations for a protective and supportive colonial 
paternalism (Kappeler 2001: 140) which would treat the natives as children who 
were to be led to the heights of Russian Orthodox civilisation in a gentle and care- 
ful manner. 

The task of civilizing the indigenous peoples was mainly undertaken by indi- 
viduals inspired by Christian missionary ideals. Naturally, hardly anyone of these 
bothered to differentiate between the hunter-gatherers of the north and the complex 
Muslim societies of the south. This was different with government officials and 
administrators, who attached a far greater importance to the Turkic-speaking Mus- 
lim population than to the small bands of hunter-gatherers of the northernmost ter- 
ritories. While the loyalty of the relatively large population of the Turkic Muslims 
was considered a vital factor for the consolidation of the Empire, the sparse and 
strategically insignificant population of the north was largely ignored (Kreindler 
1985: 350). So it was no accident that the only substantial financial support by the 
state was given to the pedagogue Nikolaj I. Il’ minskij for his educational project 
aimed at the Muslim Tatars. In 1863 Il' minskij founded a school for baptized Tatars 
(kresceno-tatarskaja škola) in Kazan’. This school served as a model for the so-called 
‘new method schools’ (novometodnye skoly), which were erected throughout the 
Muslim territories in the 1880s. Il’ minskij's educational programme adhered to the 
concept of an accommodating and gentle russification. In the elementary classes, 
indigenous Turkic languages were used as languages of instruction. Russian, how- 
ever, was introduced as a compulsory subject right from the first grade and in the 
upper classes Russian replaced the indigenous language as language of instruction. 
Il’ minskij formulated the ultimate goal of his indigenous language schools as fol- 
lows: 


*Obrusenie inorodcev i soversennoe slijanie ich s russkimi po vere i jazyku — vot koneč- 
naja cel’, k kotoroj dolžna stremit'sja sistema inorodéeskogo obrazovanija v krae.’ 
“The russification and complete merging of the indigenous population with the Russians 
in their faith and in their language is the ultimate goal, which the system of indigenous 
education must strive to achieve in our country.' (Quote from Khasanov 1987: 63; trans- 
lation D.S.) 


Thus, what at first glance appears as a tolerant attitude towards linguistic diversity, 
turns out to be just a different, more considerate way of russification in the end. 
This holds for both the Christian paternalism of the 19th century and the nativiz- 
ation policy of the early Soviet Union, which will be dealt with shortly. 

In the end, the efforts of missionaries and other individuals who devoted their 
energies to enlightening the natives could not have any effect on the linguistic 
make-up of the Tsarist Empire. They lacked the necessary institutional support 
from the government, which was obviously not seriously interested in spending 
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large amounts of money in order to turn Siberian hunter-gatherers or other natives 
into Russians, who in their perspective could neither contribute to nor endanger the 
well-being of the Russian state (Alpatov 2000: 31—32). But even where govern- 
mental support was granted, very little was achieved indeed. The foremost problem 
was that there was no compulsory school attendance for all children. Beyond this 
there was no incentive for the natives to have their children sent to Russian 
schools. Consequently, most missionary schools, which had been set up according 
to the principles of Il minskij, were shut down again after only a few years (Beli- 
kov and Krysin 2001: 345-347). 

Tsarist language policy hardly amounted to more than a patchwork of uncoor- 
dinated efforts, which resorted to straightforward oppression where political threats 
were perceived, and restricted itself to the half-hearted support of paternalistic ef- 
forts at civilizing the natives where no real danger was felt. Obviously, no outright 
nationalist programme lay at the foundations of the Tsarist state. At best, there 
were certain government officials who held chauvinistic sentiments. Besides, there 
were other factors which impeded systematic russification. The vast geographical 
proportions of the empire, whose territories had been incorporated in rapid succes- 
sion without giving it time to develop the necessary infrastructures, precluded 
any serious attempt at spreading Russian civilization and learning throughout the 
empire without leading into financial ruin. In the western provinces open and com- 
pulsory russification would certainly have instigated major national uprisings. 


4. The language question in the Soviet Union 


The one political concept which laid the foundations of and became the measure 
for language planning and politics in the Soviet Union, was Lenin's nativization 
policy (korenizacija). This policy began in 1923 and - throughout Soviet history — 
was never officially revoked (Grenoble 2003: 44—45). The basic idea behind nativ- 
ization was that all languages had equal rights. As a consequence, the non-Russian 
languages were to be brought on an equal institutional footing with the dominant 
Russian language. This appears to come close to elementary linguistic human 
rights. Though this support of ethnic particularism stands in contrast to the idea of 
an ideologically and culturally unified working class, the only feasible alternative 
would have been to promote outright russification. This, however, would have 
smacked too much of the Great Russian chauvinism of late Tsarist Russia, which 
the Bolsheviks were eager to overcome. Besides, the enthusiasm of the founding 
years of the Soviet Union brought about a general conviction that class solidarity 
will finally overcome linguistic and cultural barriers and ultimately lead to a uni- 
fied culture of the Soviet Man (homo sovieticus) (Collins 1998). 

In practice, nativization aimed at supporting — educationally and institution- 
ally — all linguistic communities within the Soviet Union, irrespective of their size. 
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Since most languages of the Soviet Union had no literacy of their own, the practi- 
cal aims of nativization called for a tremendous effort as within a short period a 
vast bulk of languages needed to be codified. This took up most of the time and re- 
sources of Russian language planners throughout the 1930s. The programme of na- 
tivization closely interacted with the wider educational programme of eliminating 
illiteracy (likvidacija bezgramotnosti, in short likbez) throughout the Soviet realm, 
which, according to the 1926 census, started with a rate of illiteracy of 74 96 for the 
total population (Grenoble 2003: 47). When the Bolsheviks seized power, only 13 
languages within their sphere of control had an orthography of their own (Grenoble 
2003: 45). In order to deal with the tremendous task ahead, the All-Union Central 
Committee of the New Alphabet (Vsesojuznyj Central'nyj komitet novogo 
alfavita) was founded. The members of this committee tackled the enormous task 
of designing appropriate alphabets for all languages of the Soviet Union. This was 
not as simple as it might appear at first. A major problem was to come to grips with 
dialectal variation and settle upon invariant forms which would be accepted by all 
speakers. For most languages, fundamental linguistic field work had to be done 
prior to creating an orthographic norm. As soon as the written form had been es- 
tablished, textbooks had to be compiled. The results of the committee's effort de- 
serve admiration. In 1924 there were textbooks for 25 languages. Only nine years 
later the number had risen to 104 languages (Grenoble 2003: 47). The enormous 
success would not have been possible without the commitment of a group of out- 
standing linguists headed by turkologist N.F. Jakovlev. However, hastiness also 
played its part in the quantitative achievement. Many languages were described 
and codified in a rather inadequate, schematic way, using a grammar blueprint 
based on Russian grammatical structures and categories. This descriptive practice 
became known as Shapirografiia (Alpatov 2000: 91). 

Since the languages were so many, a general scheme was worked out which de- 
termined which languages were to be dealt with first and what sort of educational 
programme could be provided for them in short term. This scheme had the form 
of a priority list, known as ABCD-hierarchy (Smith 1998: 51). The order of the 
languages on this priority list was determined on the basis of three criteria: the size 
of the linguistic community, the settlement structure of the linguistic community 
(compact or dispersed settlement), and the existence of an orthography. The third 
criterion proved to be somewhat superfluous because, with few exceptions, lan- 
guages with an autochthonous writing tradition were finally treated in the same 
way as the vast bulk of languages without script: In both cases the existing writing 
systems were replaced by newly designed orthographies on a different scriptural 
basis. Right from the beginning there was a general consensus that the Arabic and 
the Mongolian script ought to be replaced by either the Cyrillic or Latin alphabet. 
Obviously, the cultural unity among the respective groups within the Soviet Union 
and beyond the borders, which was signalled by both scripts, was dreaded. Be- 
sides, both scripts had clear religious connotations, which made them hard to tol- 
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erate for Soviet ideologues. Arabic was the script of Islam and the Mongolian 
script was closely associated with Buddhist writings. 

At the beginning of the Likbez campaign alphabetisation was based on the 
Cyrillic script. In 1924, however, Azerbaijanian representatives proposed the use 
of the Latin script instead. This was commonly approved and all languages which 
were written in Arabic script were provided with a Latin alphabet. By the first 
years of the 1930s Latin alphabetization had already come to be the standard for all 
scriptless languages. Languages which had already been provided with a Cyrillic 
orthography were converted to the Latin script. At the peak of this development 
even the latinization of Russian was being discussed. By the end of the 1930s, how- 
ever, the whole process was reversed and all Latin orthographies were converted to 
Cyrillic (Imart 1966). The reasons given for this complete shift were rather flimsy 
and amounted to nothing more than that some children experienced difficulties 
keeping the Latin and Cyrillic script apart when they started to learn Russian as a 
second language in school. The reversal obviously marks a shift in the opinion of 
Soviet officials, among whom Great Russian chauvinism was gaining ground 
again. But why had the Latin alphabet been propagated in the first place? There 
were basically two reasons in favor of the Latin alphabet. Firstly, replacing the 
Arabic script with Cyrillic would certainly have been perceived as an outright and 
blatant act of russification. Secondly, up to the late 1930s Soviet communists still 
subscribed to the ideals of internationalism. For internationalist thinking the centre 
of gravitation, however, lay still in the Latin West of Europe, which was considered 
the furthest developed part of the civilized world. Especially Nikolaj Jakovlevič 
Marr (1865-1934) and his followers, whose New Theory of Language (Novoe uce- 
nie o jazyke) dominated Soviet linguistics at the time, were convinced that latiniz- 
ation would be an inevitable concomitant of the world revolution. 

Whereas Shapirografiia contributed in a rather involuntary fashion to the rus- 
sification of indigenous languages, lexical corpus planning openly advocated the 
modernization via internationalization of the lexicon of the indigenous languages. 
Internationalization, however, meant in practice russification. Even though there 
was a brief period in the beginning when native resources were preferred as the 
medium of adaptation over the perceived lexical requirements of the new political 
order and modern industry, the shift to direct loans from Russian was propagated 
very soon. A general issue in language debates of the time was the prominent role 
of Russian among the Soviet languages as the only true international language. 
Only via Russian with its rich international vocabulary could the superior knowl- 
edge of the western sphere be accessed. This reasoning laid the foundation for the 
final ascent of Russian as the one and only Soviet model language. Already by the 
1930s direct loans from Russian had become the only source for what became 
known as language enrichment (obogascenie jazykov). According to Lewis (1972: 
157), some 70-80 % of all new vocabulary coined in the 1930s and 1940s were 
taken directly from Russian. This was not only a practical matter of economizing 
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corpus planning efforts. In the 1940s a decree on spelling known as the Common 
Rule (obscee pravilo) was issued which commanded that Russian loans were to be 
written in their original Russian spelling, no matter how they were actually pro- 
nounced in the borrowing language. It did not suffice that the word was actually 
taken from Russian, its Russianness had to be visible for everyone. This act aims in 
an obvious way at maximizing the symbolic value of Russian within Soviet society. 
Lexical development became the major occupation of language planners. Special 
committees were created for almost every single language, which issued diction- 
aries that, though claimed to document the entire lexicon of the respective lan- 
guage, were primarily intended to acquaint heritage speakers with the new Soviet 
vocabulary. 

The ultimate aim of both nativization and likbez was to prepare the entire 
population irrespective of their varying cultural and linguistic background for the 
requirements of modern industrial society. In practice, however, the ambitious pro- 
gramme of nativization was doomed to fail. The backbone of nativization was the 
elementary school system, which was required to develop educational programmes 
for all titular and minority languages and to provide every single elementary school 
attended by a sizable number of non-Russian pupils with teachers who were com- 
petent in the respective minority language. The reality was, however, that almost 
exclusively monolingual Russian teachers were available. Thus, the practical fail- 
ure of nativization contributed effectively to the spread of Russian at the cost of 
indigenous languages. There is only one social institution which contributed more 
to the rapid russification of non-Russian speakers, viz. the Red Army, which since 
1936 operated on the basis of compulsory military service and observed a habit of 
forming ethnically mixed units. 

It may be doubted whether the failure of nativization was severely regretted, 
since it effectively promoted the desired unification of the proletariat on the lin- 
guistic level without chauvinistic overtones. Despite its apparent cultural relativ- 
ism and tolerance with respect to linguistic and cultural diversity, nativization itself 
may only be properly appreciated against the backdrop of a unidirectional view of 
cultural and linguistic development. Although languages may enjoy equal rights, 
linguistic diversity ultimately reflects different levels of societal development. 
Where these developmental differences will be overcome, a unitary language of 
the progressive masses, which according to the Marrists ought to take the shape of 
a creole language, will evolve in the end. In later years, especially after Stalin's 
refutation of Marrism, the future ideal of a mixed or creolized language for the pro- 
letariat would be replaced by Standard Russian, which was advocated as the most 
advanced and progressive language. Goodman (1968) gives a detailed discussion 
of the concept of a future world language of the proletariat in Russian linguistic 
thought. 

Though in Soviet disguise, Great Russian chauvinism once again stepped onto 
the political stage with Stalin's address to the XVII Party Congress. In his address 
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Stalin disputed Lenin's conviction that Great Russian chauvinism poses a greater 
danger than a moderated ethnic particularism. Stalin claimed instead that the 
greatest danger derived from nationalism which had been forgotten to be fought 
(Grenoble 2003: 45). The Great Russian turn came, however, only some time after 
World War II. During the Khrushchev area the vision of a Soviet people (sovetskij 
narod) which was not only politically but also culturally and linguistically united 
gained currency within the Soviet leadership. Once again, the international ties of 
the Russian language were played off against the other languages to demonstrate 
the leading role of Russian as a world language within Soviet society. A hierarchy 
of societal importance was established, with Russian at the top end and the so- 
called one-settlement languages (odnoaul’nye jazyki) at the bottom end (Grenoble 
2003: 57; Gutschmidt 1998: 116). Other arguments of a more practical nature, like 
the need for hightened mobility or the Red Army's readiness for battle, were also 
put forward. Within Soviet society Russian was promoted to the official language 
of the entire Soviet Union on the basis of its being the officially recognized lan- 
guage of interrepublic communication (jazyk meZnacional’nogo obScenija). Sub- 
sequent legislative readjustments translated the change of official attitude into ac- 
tion. Clause 19 of the Education Reforms of 1958—59 declared that education in the 
mother tongue was no longer compulsory. Russian was made a compulsory subject 
in turn, though in practice it had already been an obligatory part of the syllabus of 
indigenous schools before. Thus, a cornerstone of nativization policy was re- 
moved, while the policy itself was never officially repudiated. Party rhetorics still 
declared the multinational character of the Soviet Union as one of its basic assets. 
The relationship between the nations was described in most positive terms. Epi- 
thets like ‘blossom’ (rascvet) and ‘rapprochement’ (sbliZenie) were the common 
stock of speeches and statements on multinational matters. The policy of russifi- 
cation in the name of the Soviet people was taken over by Brezhnev and thus lasted 
into the last decades of the Soviet Union. A decisive change was introduced in the 
Constitution in 1977. Ever since 1936, article 45 of the Constitution had guaran- 
teed the “right to school instruction in one's native language." This right was now 
turned into the much weaker guarantee that opportunities for school instruction 
will be provided for by the state (Grenoble 2003: 58). The era of Khrushchev and 
Brezhnev also saw the most devastating impact on smaller languages. Especially 
the introduction of compulsory education of indigenous children at Russian board- 
ing schools during the 1960s caused major disruptions in the transmission of mi- 
nority languages from one generation to the next. 


756 Dieter Stern 


5. Of peoples and languages - the ethnification of the Soviet citizens 


No matter how Soviet policy makers decided the question of the equality of the 
languages spoken on Soviet territory, it ought to be determined what counted as 
a language in the first place, the more so because the recognition as a language 
implied the recognition of a corresponding people who spoke that language, i.e. 
amounted to an act of sociopolitical significance. Identifying a language was no 
straightforward task which could be dealt with in a simple matter-of-fact pro- 
cedure. But exactly this was required by the administrative apparatus, which relied 
for this task on the competence of ethnographers and linguists. The administrative 
need for explicitness dictated that every citizen belong to one and only one of the 
ethnolinguistic categories, the so-called narody and natsional’nosti (narodnosti), 
determined by ethnographers and linguists in the service of the state, who adhered 
firmly to the Herderian notion of the unity of language and ethnos. A basically tax- 
onomic approach to ethnicity combined with the need for clear-cut administrative 
categories to produce a rather artificial system of ethnic and linguistic classes, 
which were more often than not at odds with the fuzzy categories of ethnic self- 
identification. The state actually disenfranchized its citizens of the right to deter- 
mine their identity according to their own ideas and practices. Ethnicity became a 
predefined, non-negotiable administrative category. The acceptance of the validity 
of the new type of top-down ethnicity among the Soviet citizens was finally won 
with reference to its scientific basis. Science was the one social authority everyone 
had learnt not to doubt. The whole complex of state-governed ethnification has 
been critically appraised by Hirsch in her groundbreaking monograph (2005). 
Though Soviet officials always attached great importance to their decisions on 
ethnic and linguistic matters, stressing that they are well-founded scientifically, it 
is obvious that the recognition or non-recognition of a language qua people was, 
above all, a political issue and not just a simple statement of objective truth. An oft- 
cited case in point are the Turkic languages qua nations of the Central Asian Soviet 
republics. Up to 1924, when it was decided to break up the Turkestan ASSR into 
smaller republics, language seems to have played only a minor role in the negoti- 
ation of group identities. Though labels like Kazakh and Turkmen, identifying 
communities beyond immediate clan affiliations, had some currency prior to So- 
viet ethnic engineering, national consciousness based primarily on a strong sense 
of linguistic unity was alien to the region. The modern ethnolinguistic nations of 
the Uzbeks, Kazakhs, Turkmens, Kyrgyz — modelled on the Herderian principle of 
the unity of language and ethnos — were certainly designed on the basis of thorough 
and conscientious linguistic research into the major dialect groupings within the 
Turkic vernacular continuum. As has recently been pointed out by Arne Haugen 
(2004), this process even received support from local authorities, who variously 
petitioned to be separated from particular neighbouring groups. But the process did 
not so much uncover hidden identities than change the ideological foundations for 
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the negotiation of group identities. It remains open to debate whether there was a 
policy of divide et impera underlying all this, but there can be no doubt that intro- 
ducing scientifically determined and therefore non-negotiable ethnolinguistic iden- 
tities certainly helped overcome traditional power structures and integrate Central 
Asia into the overall administrative framework of the emerging Soviet state. In 
other regions the taxonomic rigour of state ethnographers produced highly artifi- 
cial and tenuous ethnolinguistic categories, like the various Samoyedic groups 
presently known as Selkups, who live dispersed over a vast territory stretching 
from the Tomskaya oblast' up into the northern reaches of the Krasnoyarskii krai. 
In the 1930s these groups were subsumed by Mr. and Mrs. Prokof'ev under the 
label of the Selkups for the sole reason that they spoke similar Samoyedic dialects. 
In fact, the groups which make up the Selkup ethnos had no prior knowledge of 
each other and only the northernmost of these groups used the name Selkup for 
self-identification (Horn 2005). Another case in point are the Dolgans on the Tai- 
myr peninsula, which Boris O. Dolgikh promoted to the rank of a natsional’nost’ 
only with reservations. The label Dolgan was used as an endonym as well as an 
exonym for a symbiotic amalgamation of various migrant and resident indigenous 
groups, among them Yakuts, Evenks, Russian, Eneche, and even single Nganasan 
families. Dolgan identity can best be described in terms of cultural creolization 
with its inherent ethnic ambivalence. Anderson (2000) gives a detailed discussion 
of the complexities of ethnic identities on Taimyr. 

The practice of defining and recognizing languages and people resulted in a 
four-tiered hierarchy of languages (Grenoble 2003: 3). The first tier was occupied 
by Russian, which President Khrushchev declared the sole medium of communi- 
cation between the Soviet nations and nationalities (jazyk meZnacional’nogo ob- 
Scenija). Russian was the only language operative above the level of the 15 Soviet 
republics which constituted the Soviet Union. The Russians were concurrently de- 
clared the most advanced and leading nation among the Soviet nations. The second 
tier was the realm of the so-called titular nations and languages. These are the of- 
ficial languages of the Uniate republics, which bear their respective names, like 
e.g. Lithuanian in the Lithuanian SSR or Kazakh in the Kazakh SSR. The third tier 
was reserved for those languages which were officially registered as autonomous 
languages but lacked official functions. These languages and their respective 
peoples, which were given the status of nationalities (natsional’nosti, narodnosti) 
and not nations (natsii, narody), could count on financial and educational support 
as well as personally granted minority privileges (/’goty) for their respective 
members. The fourth tier comprised the languages and peoples without official rec- 
ognition. 
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6. The aftermath 


Adopting an epithet which Lenin used to designate the Tsarist Empire at the eve of 
World War I, the Soviet Union has often been branded as a prison of peoples by 
western journalists. When, following the collapse of the Soviet Union, some of the 
newly independent states passed harsh language laws which aimed at promoting 
the new national language at the cost of multilingualism, this has been commented 
on as a deplorable but nevertheless understandable backlash against Soviet oppres- 
sion. This view takes national identities for granted, as if they were unquestionable 
cultural facts. As was demonstrated above, national and ethnic identities in the 
Soviet Union were, however, rather the product of a language ideology which was 
fostered by Soviet state ethnographers and successfully implanted into the con- 
sciousness of the general public by way of administratively ethnifying every single 
Soviet citizen. So, recent language nationalism in the newly independent states is 
not so much a backlash as a continuation of the Soviet tradition of equating lan- 
guage with ethnicity. In any event, it will and does provoke reactions from those 
who are most severely affected by the new situation. Ever since the breakup of the 
Soviet Union, Russian speakers in the newly independent former Soviet republics 
underwent a process of minorization. The slow realization of the new legal status 
has left the Russian-speaking population in a state of limbo, in which it is left to the 
individual to position themselves linguistically and socially. The unclear situation 
has led to reactions ranging from an insistence on lost monolingual privileges, 
which goes so far as accepting the denial of citizenship, to extreme cases of ethno- 
phile affirmation of the constituting language that has replaced Russian as the 
primary medium of communication. In some cases this problem of individual adap- 
tation has condensed in political reactions. Thus, Moldavia is currently experienc- 
ing a low-level ethnic conflict with its Russian minority, which strives to reinstitute 
Russian sovereignty on the smallest conceivable scale in a dwarf state of their own 
making, known as Transnistria (Bruchis 1996). A more widely known case is the 
conflict between Russian-speaking eastern Ukraine and Ukrainian-speaking west- 
ern Ukraine, which culminated more peacefully in the Orange Revolution of 2003. 


Note 


] Titular languages were the official languages of the non-Russian Soviet republics. Unlike 
genuine official languages they served only restricted official functions. Most notably, 
they were banned from interrepublic and international communication. The only truly 
official languages of the Soviet Union which were recognized above the level of their re- 
spective Soviet republic were Russian, Ukrainian, and Belorusan. 
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Conclusion and outlook 


day southeastern Europe has been marked by 19th century linguistic 
nationalism, an ideology that draws national boundaries primarily on the basis of 
people's mother tongues (Stukenbrock 2005: 33—68). Especially in the territory 
up what used to be Yugoslavia we find the formula confirmed that “lan- 
guages multiply with states, not the other way around" (Hobsbawm 1990: 63). In 
order to understand these processes, it is first of all necessary to understand the his- 
tory of the Serbo-Croatian language and the political context in which it emerged. 
Only then will the dynamic character of Serbo-Croatian, starting out as and giving 


rise to separate literary languages as illustrated in Figure 1, become clear. 


19th Century 1st and 2nd Yugoslavia post 1991 
„Ausbau“ „Umbau“ „Abstand“ 
Serbian Serbian 


ee Serbo-Croatian Croatian 
Croatian 1918-1941: monocentric Bosnian 


Figure 1. 


1945-1991: pluricentric (Montenegrin) 


The “life cycle” of Serbo-Croatian 
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1. The choice of the base dialect for Serbo-Croatian 


Yugoslavia means “land of the southern Slavs”; it is a programmatic supra-ethnic 
term, which as such is a misnomer given that the Bulgarians refused to play along. 
The corresponding language concept that developed is called Serbo-Croatian, 
even if this language is spoken by four nations, namely by Serbs, Croats, Muslims/ 
Bosniaks, and Montenegrins. The designation, however, is logical on historical 
grounds since in the 19th century only Serbs and Croats were involved in initiating 
this programme of linguistic convergence. 

We usually speak of two Yugoslav states existing in the 20th century (Sund- 
baussen 1993): The first, founded in 1918 after World War I, was called the “King- 
dom of Serbs, Croats, and Slovenes" until 1929 and thereafter "Kingdom of 
Yugoslavia". The second Yugoslav state was founded in 1944 by Tito's partisans 
and lasted until 1991. From 1992 to 2002 there existed a state called “Yugoslavia” 
but it only included Serbia and Montenegro. 

Post-Yugoslavian language policy seeks to reconcile linguistic-dialectal and 
ethnic boundaries. A brief explanation of Southern Slavic dialectology will show 
that this is more than futile. 

The dialect division in Yugoslavia was based on two criteria: first, the conson- 
ant area, which is based on the question word “what?” and appears as kaj, ca, and 
Sto. Kajkavian can be found in North and Central Croatia (including Zagreb), 
Cakavian on the Adriatic islands and the coastal strip from Istria to Dubrovnik. 
Stokavian makes up the greatest part of the dialect continuum, namely the bigger 
part of Croatia less the formerly mentioned regions, the whole of Bosnia-Herze- 
govina, and all of Serbia and Montenegro. Secondly, the three main dialects are 
subdivided by the continuation of the Proto-Slavic vowel *é, which is especially 
important regarding the Stokavian dialect: It became /i/ in the less important Sto- 
kavian-Ikavian dialects (1.e. mliko, dite, ric, bili), /e/ (1.e. mleko, dete, rec, beli) 
in Stokavian-Ekavian, spoken in Serbia, or rather the east, and either /je/ (short 
syllabic) or /ie/ or /ije/ (long syllabic) (i.e. mli(j)eko, di(j)ete, ri(j)ec, bi(j)eli) in 
Stokavian-Ijekavian, which is spoken in Croatia, Bosnia, parts of Serbia, and Mon- 
tenegro. To summarise, with Kajkavian and Cakavian the Croats have exclusively 
Croat dialects while there is no exclusively Serbian dialect for the Serbs. The clo- 
sest candidate is Stokavian-Ekavian but it does not cover all of the Serbian-settled 
territory. While all Ekavian speakers (especially the prestigious urban dialects of 
Belgrade and Novi Sad) are Serbs, not all Serbs are Ekavian speakers. 

While in the early 19th century Croatia was marked by significant dialectal dif- 
ferentiation and literary tradition in all of the three above-mentioned main dialects, 
Serbia was characterised by diglossic heterogeneity with the vernacular, on the one 
hand, and the so-called Slaveno-Serbian, on the other hand. This Slaveno-Serbian, 
a mixed language of Church Slavonic and Russian, appeared after the Serbs had 
emigrated from Kosovo in 1690 to seek the protection of the Habsburg monarchy 
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in Southern Hungary. But Russia also held its protective hand over them, e.g. by 
sending books and teachers, which explains the strong impact of Russian on the 
written language. But this language amalgam proved to have only minimal ties to 
spoken Serbian. This is the reason why in Serbian the most radical path was taken, 
unlike in Russian or Bulgarian, where a more or less soft hybridisation process 
ended the diglossia. 

The language model of Vuk KaradZié from the early 19th century is based on 
the most widespread of these previously mentioned dialect types, namely Stoka- 
vian-Ijekavian, which was partially spoken by Montenegrins, Muslims, Croats, 
and Serbs. With Vuk KaradZié's language model, which the Croats received more 
enthusiastically than the Serbs, the door for cultural and political unification of the 
“Yugoslavs” was thrown wide open. The consistently phonetic spelling (Pisi kao 
Sto govoris! [Write like you speak!]) corresponded to an equally consistent sup- 
pression of all Church Slavonic vocabulary and morphology. 

In 1850 the so-called Vienna Literary Agreement, a non-binding declaration of 
intent, was signed by a number of intellectuals from Belgrade and Zagreb. This 
agreement is generally considered the hour of birth and an icon of the Serbo-Croa- 
tian language and it seems clear that the natural basis of the written language in Yu- 
goslavia after 1850 was the Stokavian dialect, as it covers more than 80 96 of what 
was once Yugoslavian territory (cf. Naylor 1980, Okuka 1998: 41—66, Greenberg 
2004: 16—57). This is really the point to which everyone wants to return since the 
dissolution of Yugoslavia, which, of course, is not possible. 


2. From linguistic convergence to political union 


In the Vienna Agreement two important decisions were laid down, based on the 
premise that one nation should have one language. The delegates decided to use a 
real dialect and rejected creating an artificial dialect mixture. This was to be Vuk’s 
dialect, Ijekavian NeoStokavian, which had already proved itself as a successful 
medium for spreading the epic songs of Kosovo myth. It is somewhat surprising 
(and not without irony) that this was decided by people who themselves were not 
native speakers of this dialect, namely Kajkavian speaking Croats from Zagreb and 
Ekavian speaking Serbs from Belgrade. The prototypical representatives of the 
chosen dialect type, the Bosnian Muslims and Montenegrins, were not even pres- 
ent at the meeting in 1850. 

In the course of the 19th century we thus see the convergence of two language 
communities which for centuries had been divided by religion and culture, a divide 
manifesting itself in the duality of the Latin and Cyrillic alphabets. The Vienna 
Agreement remained a declaration of goodwill and no codified standard language 
ever emerged from it. Instead, differences continued to exist, especially in the vo- 
cabulary, which resulted from different diglossic situations and different concepts 
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Table 2. Language status ca. 1800 


Croatia Serbia 

interlingual diglossia intralingual diglossia 

high varieties: German, Latin, Hungarian high varieties: Slaveno-Serbian 

low variety: Croatian low variety: Serbian 

xenophobic purism: elimination of loan ethnographic purism: elimination of Church 


words and strong tendency towards calques Slavonisms but open to internationalisms 


of purism. These lexical duplicates form the core of post- Yugoslavian language pol- 
icy and are used by nationalist language politicians as historical points of reference. 

The Croats had German, Hungarian, and Latin as high varieties so that one can 
speak of an interlingual diglossia. In this situation it was only logical that Croatian 
purism would be xenophobic, i.e. directed against foreign words, which were 
replaced by calques. This kind of internal loaning camouflages the language's 
dependence on a role model and thus serves as an effective means of language 
nationalism. The Serbs, on the other hand, had Slaveno-Serbian as a high variety, 
a closely related Slavic language, which is a form of intralingual diglossia. For 
Serbian purists, foreign words were not a problem; instead, what was to be avoided 
were loans from Church Slavonic and Russian. 

In 1918 Croats and Slovenes suddenly found themselves living in a common 
state with the Serbs, who were merciless in applying their numerical superiority for 
political ends. In the first and second Yugoslav state, Serbian, i.e. the Ekavian vari- 
ant spoken in the political and cultural gravitation centre of Belgrade, was the 
dominant form of speech. Croatian nationalists could thus always use the language 
argument to attack political imbalances. 

Right from the start Serbo-Croatian was a weakly codified language. Even after 
70 years in a common state and despite 99.5 % intercomprehensibility between 
Croats, Serbs, and Muslims, it was clear after no more than two or three sentences 
which area a native speaker came from or to what ethnic group he or she belonged. 
The ethnic markers in the language that ultimately ensured the group boundaries 
were significantly stronger in rural areas than in the urban melting pots. With the 
demise of Yugoslavia, attempts were made to strengthen these "soft ethnic bar- 
riers", which went hand in hand with an idealisation of rural language traditions. 


3. The (re-)nationalisation of Serbo-Croatian since the 1970s 


How is it possible that after 1991 Serbo-Croatian did not simply split into its two 
components but into four successor languages, namely into Bosnian and Montene- 
grin as well? The key to answering this question can be found in Tito's ethnic pol- 
icy (Blum 2002, Voß 2008: 7-13). Tito was intent on making up for the failures of 
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Table 3. | Ethnopolitics in the 20th century 


Number of lst Yugoslavia 2nd Yugoslavia Post-Yugoslavia after 
1918-1941 1945-1991 1991/92 
unitarism federalisation national states 
States 1 1 7 (Serbia, Croatia, 
Slovenia, Bosnia, 
Montenegro, 


Macedonia + Kosovo) 


Nations 1 (as “three tribes": 6 (Serbs, Croats, 6 
(according to Serbs, Croats, Slovenes + 
the constitution) and Slovenes) Macedonians, 


Montenegrins and, as 
of 1968, “Muslims’’) 


Languages 1 Serbo-Croatian 3 (Serbo-Croatian 5 (Serbian, Croatian, 
(“and Slovenian + Slovenian and Slovenian, 
dialects") Macedonian) Macedonian + 
Bosnian) 
6 (all of the above + 
Montenegrin) 


the first Yugoslavia in terms of integration and therefore ventured several ethno- 
political experiments. In 1944 a Macedonian nation was proclaimed and in 1971 
the Muslims were recognised as a national group on the federal level. Tito’s policy 
since the 1960s to counteract all centrifugal national desires also became relevant 
in the domain of language policy: In 1971 it was determined that, in addition to 
Croatian and Serbian, there also existed a “Bosnian-Herzegovian standard idiom” 
and a “Montenegrin sub-variety" — both anything but precise linguistic terms and 
rather symptoms of a walk on political tightrope. 

The emergence of ethnic groups in most of post-communist Europe - to a lesser 
extent in the Soviet Union — has by no means been a reaction to the communist 
period, in which such alleged conflicts were put on ice, but the very opposite. As a 
matter of fact, many ethno-national categories were communist artifacts and the 
creation of six Yugoslav nations has consequently led to the implementation of six 
post- Yugoslav languages. 

What, in terms of language, is the situation in ex- Yugoslavia today (Alexander 
2002-2003, Gustavsson 2002-2003)? The end of the Yugoslav state also sealed the 
fate of the Serbo-Croatian language. But the question is how to create four standard 
languages which are based on (a) ethnic criteria and (b) the same dialect, and which 
are thus highly intercomprehensible? I would like to approach this situation from 
two angles, namely (a) via the linguistic (primarily lexical) reconstruction of Croa- 
tian, Serbian, Bosnian, and Montenegrin, and (b) via typological aspects of this 
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post-Yugoslavian divergence. In doing so I will differentiate between directed re- 
standardisation (section 4) and colloquialisation (especially in section 5), a general 
phenomenon that can be observed across the post-communist Slavic speaking 
world in the 1990s (Okuka 1998: 67-128, Kordić 2004, Bunéié 2008). 


4. The four successors of Serbo-Croatian: 
Strategies of re-standardisation 


Given that Serbo-Croatian is the result of an agreement on a common base dialect, 
it is only logical that regional differences emerge, especially in the vocabulary, 
which reflect different ways of cultural and linguistic roofing (in the sense of Ger- 
man "Überdachung"). The oft-drawn comparison of Serbian and Croatian with 
British and American English is too short-sighted. 


4.1. Croatian 


Language politics in Croatia are clearly the most active in ex- Yugoslavia, based on 
a long and rich literary tradition in all three major Croatian dialect groups (Caka- 
vian, Kajkavian, and Stokavian) before the official shift to Stokavian in the Vienna 
Agreement in 1850. A significant number of neologisms have emerged in Croatian 
since the 19th century as a result of its purist tradition oriented towards the Czech 
model (e.g. veleposlanstvo instead of ambasada 'embassy', which has been ac- 
cepted by the speaker community). In addition, a partial return to an etymological 
spelling, the korijensko pismo, has been initiated, which contrasts with the pho- 
netic principle of Serbian spelling. 

The targeted differentiation between Serbian and Croatian is found mainly in 
the vocabulary since the Croats have been unable to agree on a truly radical step, 
namely the “reintroduction” of the Kajkavian or Cakavian dialect. The language 
politicians have acknowledged that the broad masses are very selective in accept- 
ing neologisms in their day-to-day use of the language. It is difficult to assess 
whether this prolonged policy of Croatian purism will mean that in a few decades 
an interpreter will become necessary for Croats and Serbs to communicate. 

Large parts of the Croatian media ignore the prescriptive language policy. Lang- 
ston/Peti-Stantié (2003) report that the Croats still use Serbisms, especially those 
who rejected Tudman's policy during the 1990s, among them many individuals of 
higher education. The gender aspect of the survey confirms socio-linguistic findings 
that especially women are hypercorrect in their language behaviour (in this case in 
their use of pure, Serbism-free Croatian) while men tend to use more substandard 
and subcultural forms of language defined as particularly manly or cool, with so- 
called “covert prestige". Essentially, the language community is clearly aware of a 
neologism bordering on the ridiculous. For example, brzogriz (literally “fast-bite”; 
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“fast food’) or mamutnjak (‘jumbo jet’) never stood a chance in being accepted in 
everyday Croatian language use. 


4.2. Serbian 


From the point of view of the Croats, the Serbs are the speakers of Ekavian. At first 
sight, this is true because Ekavian Stokavian can only be found in Serbia. However, 
the situation is more complicated than that. The western part of the Republic of Ser- 
bia is Ijekavian speaking, as are Montenegro and those regions in Bosnia and Croa- 
tia where Serbs live. If we believe that the Serbs fought their wars after 1991 so as to 
unite all Serbs in one state, their arguments could not have been based on language. 

To anticipate this event symbolically, it was in 1993 that the Bosnian Serbs 
introduced Ekavian Serbian as the norm in those areas controlled by them - this 
(rather absurd) plan failed, however, and was withdrawn. In 1996 a second attempt 
was made to force the Bosnian Serbs to adopt the language of Belgrade — and failed 
again. This shows that even Serbian nationalists suffer as a result of the fact that 
their written language does not have an exclusively ethnic base dialect. This cannot 
be changed, however, as shown by these two failed experiments of introducing 
Ekavian as the norm in the Bosnian Republika Srpska. 

The absence of a clear and consistent Serbian language policy in the 1990s is 
founded precisely on this contradiction. If Serbian nationalists were to follow their 
Croat counterparts and attempt to turn back the clock to the situation before 1850, 
Le before the language compromise between Croats and Serbs, and thus to return 
to an ideal Ekavian, they would at the same time exclude their Serb co-nationals 
in Croatia, Bosnia, and Montenegro. It is rather the language policy of the latter, 
i.e. the Serb co-nationals in these three post- Yugoslav states, which will in the end 
determine how the Serbian nationalists in Belgrade must position themselves. 


4.3. Montenegro 


The Montenegrin language question is also part of the Ijekavian-Ekavian debate 
(Greenberg 2004: 88-108): Should a Montenegrin language emerge, which inevi- 
tably would need to be Ijekavian, then the Serbs, in a chain reaction, would even 
more be forced to stylise Ekavian as "real Serbian". Alternatively, if Belgrade lin- 
guists defined Serbian exclusively as Ekavian, this would force the secession of 
Montenegrin. It remains to be seen which side will take the first step. 

The Montenegrin language project was initially a one-man show of the linguist 
Vojislav Nikčević, who reinterpreted Vuk KaradZié's imperative in his 1993 book 
Piši kao sto zboriš [Write like you speak]. Based on the word zboriti in place of the 
word govoriti, he argued that Karadzi¢’s legacy was in fact Montenegrin. The most 
striking feature of this planned Montenegrin language is to make it different from 
Serbian/Croatian/Bosnian by means of three new graphemes, all of which indicate 
specifically Montenegrin palatal sibilants. 
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Unlike the other two graphemes, <s> as a Cyrillic character had been widely 
used in Old Church Slavonic manuscripts for [z] with dental epenthesis (as a result 
of the second Slavic palatalisation). But Nikéevié overgeneralised this phoneme 
and applied it even to words such as bendzin, brondzin, jedzero (gasoline, bronze, 
lake’). This radical Montenegrin language experiment of the early 1990s clearly 
needs some more thought. 

The Montenegrin independence movement has been dependent on inconsistent 
European Union policy since 1991. While the EU promoted independence during 
the Milošević regime, Montenegrin independence was suddenly less welcome 
after 2002 and Serbian democratisation. As might be expected, the successful gain- 
ing of independence in 2007 has also resulted in increased language planning 
activity. But considering that, among other things, approximately half of Montene- 
grins declare themselves to be ethnic Serbs, the Standardisation Commission 
established in 2008 can be predicted to steer a moderate course and avoid setting in 
motion an anti-Serbian purism discourse. 


4.4. Bosnian/Bosniac 


Bosnian (Mgnnesland 2004) was declared a state language in the Dayton agree- 
ment of 1995. Here again we have the rare case (like in Macedonia in 1944) where 
a language is declared an official language for which neither a grammar nor dic- 
tionaries are available. Listed below are the main features of the Bosnian language, 
which should better be called Bosniac (bosnjacki) since only Bosnian Muslims 
identify with it: 


preference for the Latin script; 

e cultivation of the sound /h/, as in duhan (Bosnian and Croatian) vs. duvan (Ser- 
bian) ‘tobacco’; cf. similarly local Bosnian pronunciation habits like the fol- 
lowing: lahko vs. the Serbian/Croatian lako ‘light’, hudovica vs. the Serbian/ 
Croatian udovica ‘widow’, or snaha vs. Serbian snaja ‘daughter-in-law’; 

e an intermediate position between Serbian and Croatian in cases like hemija vs. 
kemija (‘chemistry’), which follows Bosnian Serbian and not Croatian, or 
cases like općina vs. opština (‘municipality’), which follows Bosnian Croatian 
and not Serbian; 

e cultivation of the Ottoman-Turkish heritage in the lexicon. 


Bosnian tries to force Turkisms as a symbol of the Ottoman past and a Muslim 
identity. Traditionally, the Bosnian language area is marked by a so-called “variety 
neutrality" between the Serbian and Croatian pole of the continuum, i.e. both var- 
ieties appear in Bosnia. This is revealed in a sample from the Oslo language corpus 
for the Bosnian language of the 1990s (Mgnnesland 2004: 153). The term for 
*week' yields a typical result: Four lexemes were found in the Bosnian press, with 
the Turkism (hefta) used most rarely (only 13 instances) and the neutral term (sed- 
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mica) most frequently (45 instances). The Serbian (ned(j)elja/37) and Croatian 
(tjedan/26) were also used relatively frequently. 


5. Croatian-Serbian differences in the morphosyntax 


Having thus far addressed almost exclusively the lexical differences between 
Croatian and Serbian, I will in the following try to sketch the central post-Yugos- 
lavian dichotomy on the level of morphosyntactic variation typology (based on 
Kunzmann-Müller 2003: 723-725), where the conflicting tendencies of collo- 
quialisation and re-standardisation take effect. Is the divergence in the morphosyn- 
tax of Croatian and Serbian intentional, i.e. a result of focused language policy 
(“Ausbau”) or are the languages drifting apart naturally? 

This question will be reviewed on the basis of several examples. The short list 
discussed below is exhaustive and shows clearly how minimal the differences be- 
tween Croatian and Serbian actually are on the grammatical level — for each phe- 
nomenon the colloquial language provides intersecting sets. The characteristics are 
listed in the order of decreasing significance as "national linguistic markers" and 
summarized in the synopsis in Table 4: 


(i) Above all, the infinitive in Croatian is opposed to the da (‘that )-constructions 
of Serbian. The variance in the sentence translating English J want to work (Croa- 
tian hocu raditi and Serbian hocu da radim) is generally considered the major dif- 
ferentiating feature between Croatian and Serbian on the morphosyntactic level, 
supported by the increasing instability of the infinitive towards the southeast of the 
Stokavian speaking area (cf. chapter 16 by Tomié in this volume). This difference 
is not a construct of the 1990s, even if today the use of this form is highly politi- 
cised. The Serbian infinitive is in fact undergoing language change on a broader 
scale, leading to an increasing degradation of a commonly inherited category. 


(ii) The second important difference between Croatian and Serbian relates to the 
position of clitics, namely between the position demanded by the syntax and the pho- 
netic position in the sentence. Here a definite tendency can be observed in written 
Croatian for the phonetically based variant, i.e. for preserving the maximum clitic 
character, while both written and colloquial Serbian prefer the syntax-based variant: 


(1) Serbian: 


Tih prolet-nih dana bio 
DEM.GEN.PL. spring-ADJ.GEN.PL. day.GEN.PL. COP.PST.PTCP. 
je neobicn-o uznemiren. 


be.AUX.PRF unusual-ADV. nervous. ADJ.NOM.SG.M. 
‘In these spring days, he has been unusually nervous’ 
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(2) Croatian: 
Tih je proljetnih dana bio neobicno uznemiren. ‘In these spring days, he has 
been unusually nervous’ 


(iii) The use of the instrumental with the predicative: In Yugoslav Serbo-Croatian 
the variation between nominative and instrumental was subject to probabilistic 
tendencies. With bare nominals the instrumental tended to be used more fre- 
quently. For example: 


(3) Petar je postao direktor / direktorom. 
Peter be.3.8G. become.PST.PTCP.M.SG. director-NOM / director-INS. 
‘Peter has become director’ 


The nominative form reflects the natural use in colloquial Croatian and the Serbian 
standard. The forced use of instrumentals in the Croatian press in the 1990s can be 
seen as targeted linguistic engineering to develop the divergence between Croatian 
and Serbian. The opposite process, i.e. the natural degradation of categories, can be 
seen in 16th century Polish while the instrumental is still obligatory in present-day 
Russian. 


(iv) With the exception of the postposed definite marker in Bulgarian and Mace- 
donian, the category of finiteness is not grammaticalised in the Slavic languages. 
However, Serbo-Croatian has adopted from Old Church Slavonic the distinction 
between a definite (formally: pronominal) and an indefinite (formally: nominal) 
adjective declination, which today can be discerned exclusively in the paradigm of 
the masculine singular: 


(4) Indefinite: Imam nov auto. 
have.1.SG.PRS. new.ADJ.INDF.SG.M.ACC. car.M.SG.ACC. 
‘I have a new car.’ 


(5) Definite: Novi auto je 
new.ADJ.DEF.NOM.SG.M. car.NOM.SG.M. be.3.SG.PRS. 
dobar. 


good. ADJ.NOM.SG.M. 
"The new car is good.’ 


On the colloquial level, both Serbian and Croatian have largely generalised the 
definite form, with Croatian again being more conservative. 


(v) The same applies to the Croatian-Serbian differences in the inflection of car- 
dinals. While both colloquial Croatian and colloquial Serbian reveal a clear ten- 
dency to fewer inflections, linguistically censored Croatian texts of the 1990s con- 
tain the archaic forms of the cardinal numbers 2 to 4 (only these forms are ever 
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subject to inflection). The fixed expression u očima ta dva Cov(j)eka ‘in the eyes of 
these two people', with the cardinal in the nominative and numbered form in the 
paucal, is characteristic of both Croatian and Serbian colloquial practice, as in (6): 


(6 u ocima ta dva 
in.PREP. eye.LOC.PL. DEM.NOM.PL.M. two.NOM.NF. 
Cov(j)eka ... 
people. NOM.PAUCAL.M. 


‘in the eyes of these two people ...’ 


By contrast, inflected forms of cardinal numbers and of the numbered form, as in 
(7), reflect the Croatian principle of maximum divergence: 


(7) član koalicije cetiriju stranaka 
member.NOM.SG.M. coalition. GEN.SG.F. four. GEN. party. GEN.PL.F. 
"member of the coalition of four parties’ 


This brief overview should be sufficient to demonstrate that the Serbian-Croatian 
dichotomy should not be understood as a contrast between two ontological entities. 
This has also been confirmed by the new Bosnian standard, which — except for the 
geographically-related absence of the infinitival complement construction in (1) — 
reflects the Serbian standard. Bosnian is missing both an instrumental predicate 
and clitics in distant position. This is nothing categorically new but rather an ar- 
chaification of the language. More precisely, it is the withdrawal of certain neo- 
analytical and substandard colloquial trends which in partially Balkanised Serbian 
have become the unmarked norm that during Tito's reign had also spread to Bosnia 
and Croatia. 


Table 4. | Morphosyntactic differences between Croatian and Serbian in their colloquial 
and standard varieties 


std. Serbian coll. Serbian std. Croatian coll. Croatian 


1. infinitive + — + +/— 
da-periphrase + + = = 
2. clitic position 
phonetic — = + = 
syntactic + + - 4 
3. predicative 
nominative + + +/- + 
instrumental — — + +/— 
4. opposition of def. +/— — * +/— 


vs. indef. adjective 
5. flexion of cardinals +/— — + +/— 
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6. Slovenian and Macedonian 


Besides Serbo-Croatian, Socialist Yugoslavia had two other official state languages, 
namely Slovenian and Macedonian, with histories very different from each other. 
The Slovenian dialects are Kajkavian, like the northwestern Croatian dialects 
(around Zagreb). Slovenian codification took place in the period of the Croatian-Ser- 
bian convergence from the 1830s until the 1850s, when the Croats rejected Kajka- 
vian for Stokavian as the written norm. During these decades Slovenian was codified 
above all by the author France PreSeren and reached a masterful literary form. This 
notwithstanding, for the first Yugoslavia the unifying formula of the “three-named 
nation" (troimeni narod) was found, according to which Serbs, Croats, and Slovenes 
were part of a single nation, making Slovenian a mere “dialect” of Serbo-Croatian 
(see Table 3 above, bottom slot of the column “1st Yugoslavia 1918-1941”). 

Macedonian was declared an official state language in 1944 and codified in the 
following decades (the authoritative dictionary of BlaZe Koneski was published as 
late as in the early 1960s). The linguistic strategy was one of maintaining equidis- 
tance from both Serbian and Bulgarian (Voß 2004: 102-106): The distinctiveness 
of Macedonian can be found in a strong Serbian roofing, with an analytical mor- 
phosyntax found in other Balkan languages such as Albanian, Greek, Romanian 
and Bulgarian (cf. Tomié in this volume). 


7. Conclusion and outlook 


The post- Yugoslavian language situation requires a specification of sociolinguistic 
parameters. Wingender (2003) has postulated a gradual “standardness” in the 
Slavic languages. To the traditional criteria of norm, obligation, polyvalence, and 
stylistic differentiation, she adds a social and situational level. With the dissolution 
of Yugoslavia, mental, cultural, and historical factors have increased in import- 
ance. These factors have had significant effects on attitudes and symbolic ideals in 
the former Yugoslav lands. Wingender's differentiated model for describing stan- 
dardisation is able to resolve the apparent contradiction between the new national 
symbolic value of language and the Yugoslav linguistic habitus. 

The high level of mutual intelligibility between Bosnian, Serbian and Croatian 
allows the War Crimes Tribunal at The Hague to implement a pragmatic solution to 
their interpreter activity; speakers of the new Croatian, Serbian and Bosnian stan- 
dards rotate in 30-minute intervals (DraZenovié-Carrieri 2002). The general limit- 
ing of the variance between the three standards to the vocabulary will allow texts to 
be translated from one of these languages to another using simple computer pro- 
grammes, as is current practice in the administrative apparatus of the two entities 
of Bosnia-Herzegovina (i.e. the Republika Srpska and the Croatian-Muslim en- 
tity). Largely identical texts are printed three times, once in Cyrillic as Serbian and 
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twice in the Latin script as Croatian and Bosnian, a symbol of the equality of the 
three ethnic groups living in Bosnia-Herzegovina. 

According to the “Western Balkans" agenda of the European Union, all the 
countries of post- Yugoslavia are to be integrated into the EU within the next 20 to 
30 years. Will the EU remain true to its principle of national plurality and equality 
and adopt all four post-Serbo-Croatian languages? Or will it, for financial reasons, 
make it a condition for joining the EU that the post- Yugoslavia countries return to 
the old standard (see Schubert 2008)? On the surface, this may seem a marginal 
question, but it touches on the central identities of the post- Yugoslav countries as 
well as of the European Union and will not be easy to answer. 


See also the following chapters in this volume: 5 by Hansen, 22 by Hill, 16 by 
Tomic, and 20 and 42 by Stern. 
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1. Introduction 


This chapter examines the close association that language has with various forms 
of political and social organisation and how the spread of the nation state system as 
the way of organising the world has had a profound effect on the language practices 
of most human groups. 

Determining the links between language and political and social organisation 
has only recently become a focus for research and is still unrecognised as an im- 
portant approach. There are, as yet, no politico-linguistics departments to provide a 
consistent macro-dimension alongside the micro-investigations of psycholinguists 
and the meso-focused research of sociolinguists. This is perhaps not so surprising 
since the study of nationalism itself does not have a long tradition in higher edu- 
cation. Early theorists tended to study the phenomenon from an activist position. 
There is no single figure or group that defined the ideology. 

In addition, the linguistic turn did not penetrate as deeply in political science as 
in social science. In the 1960s and 1970s social scientists began to study language, 
not as the neutral medium that simply records social reality, but as the very means by 
which humans construct it. The linguistic turn among political scientists was never 
so widespread and many scholars in this field do not regularly reflect on language. 

Nonetheless, if students of nationalism wanted to do so, there is a small body 
of work which addresses the language question and to which they could turn. De 
Mauro’s (1963) study of the Italians, Weber’s (1976) study of the French, Haugen’s 
(1966) study of the Norwegians are just three examples of histories of nation build- 
ing where the role of language is shown to be central. There is also work which has 
developed the relationship of language and nation building from a theoretical per- 
spective. Anderson’s (1983) study of the role of the printed word in the construc- 
tion of imagined communities and Gellner’s (1983) analysis of the roles of 19th 
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century industrialisation, education and urbanisation in the spread of national lan- 
guages are seminal works, which reveal the constructed nature of national lan- 
guage groups. 

It is this process of construction that I want to address here. Its position as final 
chapter seems fitting since the content of essentially every preceding chapter re- 
lates in some way to nations and nationalism. The chapters on the language phyla 
of Europe evoke the period before the modern nation state. In the pre-national lin- 
guistic landscape, dialects merged into each other along a continuum; only on the 
breaks between phyla was there incomprehension. The chapters on the codification 
and standardisation of language by monarchs and governments show how this situ- 
ation changed and certain vernaculars became national languages. Other chapters 
describe the linguistic minorities that developed as a side effect of nation building 
and the spread of the national standard. Further chapters trace the history of liter- 
acy, once an affair of the church and then the responsibility of the state. Finally, the 
chapters that deal with foreign language learning and lingua francas examine how 
we communicate across national boundaries. In this final chapter, I bring together 
these various aspects of nation building and language and provide a chronological 
account of the relationship in central and western Europe, where the concept of the 
nation state was first elaborated. 


2. Europe before the nation state 


Before Europe became a mosaic of nation states, its political organisation was both 
more European and more local. The ruling class acted on a European stage. Where 
the feudal system pertained, relations of allegiance and alliance constructed net- 
works that spanned the continent. In times of war, these feudal networks were ac- 
tivated and vassals and allies were called upon for support. The major dynasties ce- 
mented alliances by choosing marriage partners for their children among a small 
continental elite and sending their sons to be raised in other households. These 
links usually required a sizeable retinue to accompany the royal person. War and 
diplomacy thus made royal courts of the late medieval period multilingual (Dun- 
babin 1991; Anderson 1983; Given 1990). Such exchanges produced a ruling class 
whose frame for action was Europe. 

The Crusades against the Muslims in the Middle East caused further contact 
among European nobility and produced a common enterprise, where, despite all 
the rivalry and squabbling, individual protagonists saw themselves as part of the 
defence of Christian holy sites and the promotion of Christendom, and interacted 
with their peers in a multilingual army in which one or more military lingua francas 
developed (Cremona 1998). 

Elite interaction was always framed by religion. The boundaries for marriage 
and association were limited by membership of either the Catholic or Orthodox 
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Churches, and the two Christendoms produced two sets of networks. In the Cath- 
olic west, the pope presided over an organisation which penetrated all polities and 
claimed ultimate authority. Papal authority was centralised in Rome (and briefly 
Avignon) and transmitted in a hierarchical system to the most far flung parish. 
Latin was the lingua franca which permitted this to happen and even the lowliest 
priest had at least some knowledge of the basic liturgy so that he could fulfil his 
function within this system. Those who were literate in the medieval feudal system 
were usually so because the Church had educated them; in many periods and places 
the ordained were the only literate members of society, and they were literate in 
Latin. In the Orthodox east there was a similar situation and two sacred lingua fran- 
cas: Greek and Church Slavonic. 

Religion thus also contributed to the idea that Europe was the frame of refer- 
ence. We can categorise the clergy as ‘European’, even if they themselves would not 
have understood this categorisation. While many, perhaps most, would not have 
moved far from their place of birth, they belonged to religious organisations that 
crossed boundaries, and used sacred lingua francas that permitted them to do so. 

In contrast to the nobility and the clergy, the peasants were anchored within 
local structures. Most serfs had no right to leave the village in which they were 
born or to choose marriage partners freely. Peasants, with greater rights, were also 
rooted in the land, married within their communities and did not venture far from 
home. The infrastructure for travel was not there and it was dangerous to do so. 
Throughout their lives the vast majority of Europeans had little need to engage in 
communication with outsiders or even with relatively close neighbours. Most com- 
munities were scattered rural settlements with slightly differentiated language 
practices, a situation which produced linguistic fracturing and distinct dialects 
along the dialect continua. The language that the medieval peasant spoke was thus 
a variety whose currency was extremely limited in geographical terms. The excep- 
tions to mono-dialectal peasant groups were rare: there was mixing along the trade 
routes and in the fairs; there was mixing in times of war, when the lord took foot 
soldiers with him on campaigns. The crusades against the Holy Land drew com- 
moners as well as knights. These experiences produced some bilinguals and bi-dia- 
lectals among poorer members of medieval society. At times the peasant may have 
relocated from the local to the European level of identity and multilingualism. If 
this happened it was an exception to the rule. 

In terms of identity, language and culture, the medieval European world was 
thus both more European and more local in its structures and identities than the 
societies which were to follow it. It, therefore, makes little sense to talk of nations 
and nationalism in this period. However, not all authorities are agreed on this. 
A minority makes the case that national identity begins in the medieval period. For 
example, Hastings (1997)) argues that the Anglo-Saxon kings of the 9th and 10th 
centuries were beginning to set up the framework where cohesive ‘national’ iden- 
tity and convergent language practices could develop in Southern Britain. In the 
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French tradition it is unexceptional to find Charlemagne presented as the father of 
the nation, with his educational policies, monetary and legal reforms labelled 
national initiatives. We should beware, however, of viewing the feudal system 
through the prism of nationalism. Allegiance derived from personal relationships 
could reconfigure in the space of a generation and populations could easily find 
themselves transferred, through marriage, inheritance or conquest, from the orbit 
of one dynasty to the orbit of another, without moving physically at all. There was 
no territorial integrity. 


3. The Modern period and the first nation states 


Such integrity began to develop in the 16th and 17th centuries as frontiers were 
fixed by war and treaty, and some European states began to take the shape on the 
map that they have conserved until this day. France, Spain, Portugal, England and 
Sweden were in the first wave of states where rulers started to act in a way that can 
be defined as national. Wresting power from the barons and replacing the loose or- 
ganisation of feudal structures, they governed through bureaucracies (Smith 1991; 
Greenfeld 1992). Louis XIV of France, for example, created a new class of admin- 
istrators drawn from neither the Church nor the feudal elites, answerable directly to 
the crown and overseeing the whole country. He appointed bureaucrats from a new 
middle class, and men like Colbert and Vauban began to manage the economy and 
the defence of France in a national way. The Tudors in England did much the same 
thing. The new class of bureaucrats used the language of the monarch's capital and 
court to carry out their functions, replacing Latin, the lingua franca, and the multi- 
lingual accommodations of the feudal system. Use of the language of the power 
centre was then enshrined in law, as in the case of the British Act of Union (1536) 
that required any administrators of the Crown in Wales to communicate with 
London in English, or as in the case of the French Ordonnance de Villers-Cotteréts 
(1539) that required all contact with the state be in ‘langage maternel francois 
et non autrement' (CNRS 1983). Such use in legal and administrative domains, 
together with literary creation in the vernacular supported by the monarch's pa- 
tronage, promoted the process of linguistic standardisation and convergence within 
the state. 

Other societal developments increased the number of users of the prestige 
version of the vernacular. As Anderson (1983) has shown, printing and the Refor- 
mation interacted, encouraging use of the vernacular as a written as well as a 
spoken language, and aiding standardisation. This happened first in Germany, as a 
by-product of the religious debate sparked by Martin Luther in 1517. The message 
of Luther's writing spread more extensively and penetrated more deeply than the 
ideas of earlier challengers to the pope's authority. Writing in a form of the ver- 
nacular which drew on chancellery literary practices, and using the newly intro- 
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duced printing press allowed him to reach a mass readership, with effects that had 
consequences for language as well as for religious practices. His translation of the 
New and Old Testaments had enormous influence on the development of Standard 
German. 

The translations of the Bible into other vernaculars helped fix the form of a 
number of national languages. There were several translations into English, culmi- 
nating in the King James Bible, which played a central role in the development of 
Standard English. When Denmark and Sweden defined themselves as two discrete 
states and settled on the position of the frontier between them, the translations of 
the Bible in the two polities were purposefully differentiated; the Swedes kept to 
archaic forms in their translation and the Danish translator modernised for his 
readership (Bucken-Knapp 2003). This was an early example of Ausbau (Kloss 
1967), the conscious distancing of a language from its neighbours on the dialect 
continuum, to mark political division. 

Religion contributed in other ways to the construction of national groups; the 
religious wars of the 16th and 17th centuries ended in treaties that set out to divide 
Catholics and Protestants. The principle of cuius regio eius religio, which required 
subjects to follow the same religion as their ruler, homogenised the population 
of one kingdom and differentiated it from its Catholic (or Protestant) neighbours. 
With such divisions and religious literature increasingly in the vernacular, Europe 
began to have territorially based linguistic demarcation among polities, in the 
printed if not the spoken word (Wright 2004). 

The revolutionary movements of the 18th and 19th centuries that brought to 
an end the power of the absolute monarchs and introduced a new political organi- 
sation that drew its legitimacy from the sovereign people also contributed to the 
spread and penetration of the national language. When the people were subjects, 
there had been no need to consult them; when they became citizens and the ulti- 
mate arbiters of who would rule, the question of language competence and com- 
munity membership became a major issue. 

We can see this very clearly in France at the end of the 18th century. As soon as 
the revolutionaries took power, they looked to create a linguistically homogeneous 
people with whom they could communicate. In the very early days of the Revol- 
ution, the Abbé Grégoire conducted a language census (1790—1792) to establish 
the extent of linguistic diversity in France. He found that over half of this new cat- 
egory, French citizen, knew no or only a little French. Information, proclamations 
and decrees were thus translated into the various languages used on French terri- 
tory. However, this practice was very soon abandoned along with any tolerance of 
diversity. A 1793 law forbade the use of anything but French in contracts and for- 
mal agreements, and non-French speakers came under pressure to shift to French. 

Some of the reasons for this were defensive; it became clear that resistance to 
the Revolution was strong in certain language communities. Some of the reasoning 
was ideological. In Barére's speech to the Convention in January 1794, we can see 
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these two strands of thinking. There is a much quoted section in which he con- 
structs those French citizens who do not speak French as potential enemies of the 
Revolution. In a much less well known passage he articulates the view that citizens 
need a common language if they are to exercise any kind of power: 


La monarchie avait des raisons de ressembler à la tour de Babel; dans la démocratie, 
laisser les citoyens ignorants de la langue nationale, incapables de contróler le pouvoir, 
c'est trahir la patrie ... Chez un peuple libre, la langue doit être une et la même pour 
tous. 


It suited the monarchy that (the country) should be a Tower of Babel; in a democracy, 
however, it would be a betrayal of the nation if its members did not understand the 
national language, could not exercise power ... For a free people the language must be 
one and the same for all. 


When John Stuart Mill (1993[1861]: 392) wrote that ‘among a people without fel- 
low feeling, especially if they read or speak different languages, the united public 
opinion necessary to the working of representative government cannot exist’, his 
opinion was reflecting the view of the French revolutionaries. Ruling elites in this 
phase of nation state formation believed that the nation needed one single language 
and set out to homogenise disparate populations into a cohesive linguistic whole. 


4. Ethno-linguistic nationalism 


In the second wave of nation state formation there was again a desire to achieve 
the congruence of nation and state, but here leaders came to the process from the 
opposite direction. The nation building argument in this tradition was not that 
members in a polity needed a common language but that those with a common lan- 
guage should have their own state. In the German and Italian speaking worlds, 
there had been no centralising monarch uniting culturally and linguistically diverse 
groups in a centralised state in which a single language and culture developed. Ger- 
man political philosophers such as Herder (2002 [1784]) and Fichte (2009 [1808]) 
elaborated the idea that shared culture and continual interaction in a distinctive lan- 
guage made German speakers a nation that should be together in a polity. German 
and Italian unification were in essence the creation of states for groups from sec- 
tions of the Germanic and Romance dialect continua.! 

Language played a central role in all 19th century national(ist) movements. As 
various groups within the Russian, German, Austro-Hungarian, Ottoman and Brit- 
ish Empires fought for self-determination for their group, it was clear that their 
(proto) elites needed to convince the mass of the people that they did indeed con- 
stitute a ‘nation’. This could be demonstrated effectively if the group had a lan- 
guage which united them, and which, at the same time, differentiated them from 
their neighbours. This was easy to show where a language was surrounded by lan- 


Language and nation building in Europe 781 


guages from a different phylum (e.g. Estonian), but not always self-evident in the 
great dialect continua. As Kloss (1967) has shown, the linguist's job was to de- 
velop a national standard that promoted maximum comprehension and cohesion 
within the nation and maximum difference from others. Thus the breaks in the 
Slavic, Germanic and Romance continua are in part the result of natural conver- 
gence towards the language variety of different political centres, but they are also 
the result of conscious efforts to plan difference. 

The linguist was also of use, alongside the historian and the archaeologist, in 
establishing the long-standing existence of the group. If the language could be 
presented as dating back far into history, this suggested an ancient culture rooted in 
ancestral territory. And, for the national project, it was not enough to claim descent 
from this ancient language; the link had to be exclusive. Bulgarians, Croatians 
and Macedonians, for example 'advance[d] mutually excluding claims to the un- 
broken continuity of their national languages with Old Church Slavonic’ (Kamu- 
sella 2009: 28). 

Linguists were also involved in the collection of folk tales and national epics 
(e.g. Finland's Kalevala, Estonia's Kalevipoeg) which were circulated to demon- 
strate that the group had a long history. Where there were no appropriate works to 
hand, they were often invented to order. Hobsbawm and Ranger (1983) give details 
of some of this ‘invention of tradition’ (e.g. the works of the Scottish poet Ossian). 
Whether straight invention or the adoption of an ancient story for modern pur- 
poses, all these works tended to offer a myth of origin and promote the idea of an- 
cestral right to territory. 

The great moment for many linguistic nationalist movements came at the end 
of World War I. The desire of the victors to punish the losers and the ambition of 
national elites to gain homelands for their groups coincided. Following Woodrow 
Wilson's philosophy of self-determination, Czechoslovakia, Estonia, Hungary, the 
Kingdom of the Serbs, Croats and Slovenes (later to become Yugoslavia), Latvia, 
Lithuania and Poland were carved out of the defeated German and Austro-Hungar- 
ian empires and post-revolutionary Russia.” 

When and wherever national movements achieved their goal of a separate 
homeland, we witness national language planning. Status planning made the lan- 
guage of the group the national language. However, it was usually a myth that only 
one language group inhabited a space, particularly in central Europe. In the states 
mentioned in the previous paragraph there were sizeable linguistic minorities 
either because of the way frontiers were drawn or because of migration in previous 
centuries (see Kamusella 2009: 52 for details). Sometimes groups with different 
language practices were made to assimilate linguistically to the national language. 
If they did not, they were at a disadvantage in the new nation state system. As using 
the national language became a test of loyalty and an easy measure of who was 
with and who against the nation state, the concept of linguistic minority developed. 
At best, recalcitrant speakers of other languages and dialects were marginalised 
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and ridiculed; at worst, they suffered ‘ethnic cleansing’, either through deportation 
or genocide. 

The nation state system had, and has, little space for multilingualism. Switzer- 
land is often cited to show that the multilingual nation state is possible, and we can 
agree cautiously with this, although the claim ignores that there is territorial and 
linguistic congruence in the cantons, the level of much political decision making. 
Belgium used to be another state traditionally given as an example of a multilin- 
gual polity. Belgium now appears to be fracturing along linguistic lines. Yugosla- 
via, a third supposed illustration, has already fractured, and the nation building in 
its successor states repeats the relentless drive in European polities towards "one 
people; one language, one territory’. 

The nationalists’ fear of linguistic diversity is logical. As soon as one sub- 
scribes to the ethno-linguistic doctrine, one believes that a group with its own lan- 
guage will want independence and to take part of the ‘ancestral territory’ with it 
when it secedes. Worse still, it may want to regroup with an ‘alien’ group across the 
border, increasing the power of ‘Them’ and weakening ‘Us’. 

Once status planning is completed, the next step is corpus planning. Nationalist 
corpus planning is part of Kloss’ Ausbau process mentioned above. The national 
language is developed to be different in the national academy, the national diction- 
ary and the national grammar. Linguists root out foreign influence, particularly the 
influence of those who had previously dominated the national group. Thus Esto- 
nians replaced words from German and Russian; Norwegians split in the battle to 
distance Norwegian from Danish (Fodor and Hagége 1983). 

The third strand of language planning is acquisition planning. The state makes 
provision for all to learn the national language and become literate in it. The 
national education system provides for the younger generation and literacy cam- 
paigns work among adults. The national language is not simply acquired in an edu- 
cational setting, conscription into the national army also has the (side) effect of 
promoting the national language. The national language is learnt in the melting 
pots of large urban and/or industrialised centres. It is promoted as the bureaucratic 
state intervenes more and more in citizens’ lives and demands that all interaction 
with it be in the language of the state (Gellner 1983). 

Because the national language has been planned to be different from allied 
varieties and because it has been imposed top down, right to the very border 
of national territory, the ancient dialect continua start to weaken. As the traveller 
crosses the border from Italy to France to Spain, for example, it is evident that 
these are language as well as political borders. Only those who hold onto their local 
dialect, against all odds, maintain the possibility of comprehending close geo- 
graphical neighbours living on the other side of national frontiers. 

As explained above, the nation state system introduces the idea of minority lan- 
guage. It also establishes the idea of majority. One can no more talk of ‘majorities’ 
in the feudal, medieval period, than of ‘minorities’ or of ‘nations’. Until the nation 
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state system, with its national institutions and protected domestic markets, encour- 
aged internal cohesion and external difference, there was no language that was 
widespread as well as penetrating deeply into society. Majority linguistic groups 
are thus as much constructs as minorities. They come into being through the pro- 
cesses of top down pressures to shift and bottom up decisions to do so, working 
themselves out in the linguistic melting pots of towns and cities. The top down 
pressure came mainly in the shape of the national education system. Bilingualism 
was not encouraged and use of the minority language was rooted out by derision 
(as in schools in the Dolomites, Wales and Brittany, for example) or by coercion (as 
in the case of Catalonia during the Franquist regime). The bottom up decisions to 
shift derived from an appreciation that the national language represented cultural 
capital (Bourdieu 1982) and made social promotion more likely (Ager 1991). In 
the melting pot of new industrial areas or large towns, the national language acted 
as a lingua franca; on the battlefield the national language became the means of 
communicating with other conscripts, understanding the officers and a necessary 
tool for survival 

As a result of these different stick and carrot policies, most Europeans came 
to be monolinguals, fluent in their national language, with perhaps some compet- 
ence in a foreign language learnt formally in school. In the 20th century Europeans 
were less likely to have the complex linguistic repertoires found in other parts of 
the world, where several different languages share the public space. In all linguistic 
mapping of the world, Europe appears as a space of little linguistic complexity 
compared with other continents. Multilingualism and language diversity were 
eclipsed in the national era. 


5. The weakening of the nation state system 


As we know, nationalism exacerbated, where it did not cause, the major European 
wars of the 20th century. The nation state in its European version turned out to be a 
rather dangerous form of political organisation, causing clear lines of demarcation 
among groups in the continent. The European movement had its roots in the inter- 
war period and grew in strength and gained popular support after World War II. 
Two movements set out to dismantle or at least weaken the strong nationalist senti- 
ments that had supported the two world wars. The first, which was to result in the 
European Union, began as a desire to pool the industries which contribute most 
clearly to war. The six founding nations achieved this with the creation of the Euro- 
pean Coal and Steel Community (1952) and Euratom (1958). When this core group 
also agreed to come together in a common market, they began to shake the foun- 
dations of the nation state system, which sees the protected domestic market as its 
linchpin. Further dismantling of the sovereignty of the nation state has continued 
and the number of countries wishing to take this route has grown (27 members in 
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2011). Since the 1992 Treaty on European Union, citizens of the member states are 
also citizens of the European Union and there is provision for the free movement of 
goods, capital, people and services. In some domains, EU law takes precedence 
over national law. 

The second anti-nationalist movement was the Council of Europe, formed to 
promote transnational contact in the immediate post-war period, which has taken 
on the role of promoting human rights. With much less power than the EU, the 
Council of Europe has nonetheless had influence on the convergence of attitudes. 
Its Convention for the Protection of National Minorities and Charter for Regional 
or Minority Languages provide a legal framework for minorities to make states 
recognise their heterogeneity. Its court provides the possibility for the individual 
citizen to seek justice outside and above the state. 

A third development has also promoted a supranational layer of governance. 
Defence is no longer a national affair. In Europe, the Northern Atlantic Treaty Or- 
ganisation has become the international organisation which is the main guarantor 
of European defence.? Thus the nation state is no longer a protected market for its 
citizens, the sole law-enforcer or the single source of their defence. In the econ- 
omic, justice and defence realms, we can argue that we are moving towards a post- 
national period in Europe. And since the national market, national justice and the 
national army were three of the key elements in the spread and maintenance of the 
national language, we might expect to see linguistic developments in parallel with 
the new political landscape. These are indeed discernible and involve two distinct 
sets of phenomena. 


6. The rise of the small state and the autonomous region 


The first of these concerns the change in the status of *minority' groups as many of 
the linguistic groups made minorities by the nation state system become indepen- 
dent or achieve a large measure of regional autonomy. In western Europe, the Cata- 
lans, the Basques, the Galicians, the Welsh and the Scots have all gained some au- 
tonomy. In central Europe, the Slovaks have split from the Czechs, and Yugoslavia 
has fractured into a number of smaller polities. With their new autonomy or inde- 
pendence, these groups have adopted nation building language policies redolent of 
the 19th century. The new polity adopts a language of power, different from the 
previous prestige variety. Politicians make status planning decisions and introduce 
the language variety into the public space and into key domains of use. The next 
step is corpus planning and the language is standardised by linguists. The desire is 
to differentiate one's language from that of neighbours from whom one has sepa- 
rated. Serb, Croatian and Bosnian linguists have spent the last decades on Ausbau 
work. Finally there is acquisition planning and the language is taught in the edu- 
cation systems. 
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We can speculate on the reasons for the rise of the small polity and the auton- 
omous region in Europe. First, as we have noted, the present supranational arrange- 
ments resolve the need for a polity to be a certain size to guarantee its economic and 
military survival. Second, the growth in the acceptance of human rights has given 
instruments to those dubbed ‘minorities’ of the recent nation state past, which 
allow them to choose their own patterns of association. In this they have often been 
supported by the international community (e.g. Bosnia, Kosovo). Whatever the 
mix of reasons, the current linguistic situation sees a renaissance and revitalisation 
of some of the language varieties eclipsed in the 19th and 20th centuries by the 
centralising nation state. Interestingly, these are being imposed in much the same 
way as in the earlier wave of nation state building. In the 21st century Europe is 
home to more national standard languages than in the 20th century. 


7. Globalisation and supranational trends 


The second concerns globalisation. Power may be relocating to the smaller polity 
or autonomy because it is also relocating to the supranational and global level 
in many spheres, thus weakening or circumventing the national level of power. 
Globalisation is the second phenomenon we need to take into account in changing 
language patterns. 

The nation state system is undergoing immense changes as authority and ulti- 
mate responsibility relocate to levels of governance above and below it. We cannot 
know in the present state of the process where it may end, but it is clear that sov- 
ereignty and rights, markets and cultures are no longer exclusively national, in the 
way that they were four or five decades ago. Unsurprisingly, globalisation, inter- 
nationalisation, supranationalism and transnationalism have consequences for lan- 
guage as well, because there must be a means of communication to allow new con- 
tacts and networks at supra-state level. 

This need is felt particularly in Europe. As the member states of the European 
Union pool large swathes of sovereignty and move closer together in economic, 
social, judicial and political domains, they must find ways to communicate. What- 
ever example of collaboration that we take (e.g. among universities for research 
and student exchange; among police forces for the pursuit of international criminal 
networks; among legislators for the harmonisation of national and community law, 
etc.) we find the requirement of a community of communication. 

Different ways of achieving this have been proposed and the Commission itself 
promotes multilingualism. In a growing number of cases, however, the language 
problem has been resolved by the use of a lingua franca. In the early days of the 
European movement this tended to be French; it is now, increasingly, English. 

The spread of English as a lingua franca is not universally welcomed by Euro- 
peans. This is understandable among peoples whose recent nation state experience 
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has led them to see language as an expression of political allegiance as well as a 
means of communication. Given Europe's nation building tradition, it is hard for 
many Europeans to accept that English can be International English. They have 
been socialised to see English as British or American English, the national lan- 
guage of the UK and US. However, this perception may change and the spread of 
English as a denationalised lingua franca may become less contentious. 


8. Conclusion 


As we return in some measure to the European and the local, after two centuries 
when the individuated nation state system held sway, the language regime is recali- 
brating. Now there is greater need for more of us to communicate across borders 
and we need to find an efficient way to do it. This probably rules out the general 
multilingualism promoted by the Commission. At the same time some Europeans 
are returning to language varieties which are used by relatively small numbers in 
small polities and autonomies. In between these two developments the national 
languages continue to dominate, but they do not dominate to the exclusion of other 
languages, as in the old monolingual past. These are interesting times and the 
longer term linguistic outcomes of political developments are not at all clear at the 
beginning of the 21st century. Widespread bi- or tri-lingualism in order to accom- 
modate local, national and supranational association could perhaps be the next 
phase in the history of language in Europe. 


Notes 


1 Not all ‘Italian’ speakers were included. Garibaldi's Nice, for example, was ceded to 
France. The Kleindeutschland solution favoured by the Prussians excluded the German 
speakers from Austro-Hungary. Political considerations always prevailed over cultural 
and linguistic ones despite the rhetoric. 

2 In the USSR, the early idea was that there should be one language per socialist republic, 
although this was later repealed and Russian imposed. 

3 A number of European states (Finland, Ireland, Austria, etc) are not members of NATO. 
Nevertheless, the point holds. If they were to be attacked, such states could rely on the in- 
ternational community, either NATO or the UN, to come to their aid. The absolute neces- 
sity of having an army capable of defending the state has attenuated. 
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V. Research traditions in Europe 


45 Research traditions up to 
and including the 18th century 


Nicola McLelland 


The relationship between language, thought and reality 

The origin of language and languages, and the relationship between languages 
The emergence and development of theoretical grammar 

Studying the vernaculars of Europe and beyond 


Tear c 


This contribution will examine lines of enquiry in linguistic thought from five dif- 
ferent perspectives. Since antiquity, philosophers have pondered the relationship 
between language, thought, and reality. At different times, different answers were 
given to these fundamental questions, which have in turn affected the way work 
on language has been carried out (section 1). A second fundamental question was 
that of the origin of language and languages, and, allied to that, the question of 
relationships between existent languages (section 2). The remaining sections in 
this chapter deal with theoretical grammar (section 3), and the study of the ver- 
nacular languages, both European (motivated by patriotic humanism) and non- 
European (spurred on by the needs of missionary activity in Asia and the Ameri- 
cas) (section 4). 


1. The relationship between language, thought and reality 


Discussion amongst philosophers and language scholars about how and why 
words convey meaning in language can be traced back in Europe at least as far as 
Plato's (429—347 BC) Cratylus dialogue. This dialogue, bearing the sub-title *The 
rightness of names', provided thorough and nuanced discussion of the question 
of whether words conveyed meaning by their nature, or by mere convention. (See 
Joseph 2000 for an accessible but detailed study). The question is often summed 
up as the physei vs thesei debate. Do words convey their meaning by their inherent 
nature (physis), or are they arbitrary tokens, whose meaning is simply agreed by 
speakers (thesis, ‘convention’ or nomos, *law")? A common answer over the cen- 
turies was a partial ‘yes’, though the reasons differed. Words originally conveyed 
meaning naturally, either because language was given by God, or by a first name- 
giver (as in Plato's Cratylus), or (to give a rationalist, eighteenth-century answer) 
because the first language began as a natural expression of emotion, or exertion; 
or because language began as the imitation of sounds found in nature, of which 
surviving onomatopoeic words furnished evidence. The question about nature and 
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convention in shaping language was one way in which European scholars explored 
the fundamental relationship between language, thought and reality. 

The view of the Socrates character in the Cratylus dialogue was that the first 
namegiver for each language had direct insight into reality and was able to assign 
names that reflected what they denoted more or less closely, depending on his per- 
ception and ability to convert what he perceived into speech sounds. Thus Plato 
made his Socrates figure account for the existence of different languages: the dif- 
ferent communities’ namegivers had differing insights into reality. 

Carolingian grammarians realized the need to distinguish carefully, as Aristotle 
already had, between language, thought (studied as logic), and reality, and they 
pondered the relationship between the three. Led by Alcuin (ca 735-804) (who 
spent a decade at Charlemagne’s court and became his biographer), they began to 
reflect on what logic (the ‘dialectic’ of Aristotle) had to tell them about language. 
With a logician’s concern for the truth or falsehood of statements about the world, 
they wondered whether categories of language (such as noun, verb, etc.) were 
identical to the mental categories, the concepts, we use for analysing reality. Later, 
in the eleventh century, an anonymous commentator of Priscian's influential Insti- 
tutiones grammaticae (see section 3) took care to distinguish between 'signifi- 
cation’ and ‘reference’. A linguistic sign such as homo ‘human’ signified a mental 
concept, the quality of being human, but denoted or referred to individual human 
beings in reality. 

Twelfth- and thirteenth-century grammarians cast grammar as a ‘speculative’ 
science, i.e. a science that depended on universal principles to explain its object of 
study. Speculative grammar was, accordingly, a universal grammar, which defined 
parts of speech as (in theory) universal to all languages, though in practice the 
object language was always Latin. They sought to define grammatical categories 
unambiguously, completely independent of the meaning of any individual word, 
by their ‘modes of signifying’ — of which there were various sub-types, both ‘es- 
sential’ and ‘accidental’. To take the example of homo ‘human’, the three general 
essential modes of signifying of condition, stability, and definite reference (per 
modum habitus et quietis et per modum determinatae apprehensionis) determined 
that homo belonged to the category of the noun. Since the ‘noun’ at this period 
encompassed both ‘substantives’ — i.e. what we understand by a noun today — and 
adjectives, a further special essential mode of signifying then specified the sub- 
stantive. So-called accidental modes of signifying determined the properties, or 
‘accidents’, of the noun: whether ‘respective’, relating to the syntactic function of 
the word-class — its gender (masculine), its number (singular), and case (nomi- 
native) — or ‘absolute’ (not relating to its syntax), of which an example is the acci- 
dent of figura, a property which specified whether a noun was simple or compound. 

The advantage of the speculative approach was that it could express the rela- 
tionship between grammar, thought and reality in a unified theory. Characterizing 
the parts of speech relied on ‘modes of signifying’ (properties of language), but 
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these in turn expressed the properties of understanding (properties of mental con- 
cepts, the modi intellegendi), which were in turn derived from properties of things, 
the modes of being, modi essendi. This can be represented schematically as in 
Table 1 (following Law 2003: 175): 


Table I: The modes of being, understanding and signifying 


things (res) in the world properties of the thing (modi essendi) 
concept (res intellecta) in the mind properties of the concept (modi intellegendi) 
signifié (res significata) in the word properties of the signifié (modi significandi) 


Later, in Humanist and Baroque Europe of the 15th to 17th centuries, linguistic re- 
flection on the relationship between language, meaning and reality took a more 
mystical turn. A number of German thinkers believed in some ineffable link be- 
tween language and reality, a link that they were convinced was best preserved 
in their own vernacular. The founder of the German Reformation Martin Luther 
(1483-1546) had written that he felt an affinity between the word Liebe ‘love’ and 
its meaning, which he doubted could be expressed as perfectly in any other lan- 
guage. The Protestant self-taught mystic Jakob Bóhme (1575—1624) went further, 
claiming to be able to discern in German the remnants of the Natursprache, the 
original natural language of Adam before the Fall (cf. Gardt 1994). Bóhme be- 
lieved that the very way that words were articulated in the mouth expressed meta- 
physical truths contained in them. J. G. Schottelius (1663), the most important 
seventeenth-century German grammarian, while also citing approvingly Luther's 
speculations on Liebe in German, followed the ostensibly more reasoned argu- 
ments of cultural-patriotic scholars from various Germanic countries in the six- 
teenth and seventeenth century. Schottelius believed with them that their own lan- 
guage was the most natural and original because it preserved the most 
monosyllabic words, in which the original meanings of words, most closely tied to 
reality, were expressed (cf. section 2). 

Others in seventeenth-century Europe were less sanguine about the reliability 
of any natural language for discussing reality. These were the exponents of artifi- 
cial language schemes which flourished in the seventeenth century, amongst which 
Bishop John Wilkins's Essay on a Universal Character (1668) is the most elabor- 
ate and best-known (Subbiondo 1992; Lewis 2007). Recognizing the flaws in natu- 
ral languages — including homonymy and synonymy — which could lead to ambi- 
guity and lack of precision when talking about reality, Wilkins and others like him 
devised their own languages. Wilkins's language was a ‘philosophical’ language in 
the sense of a ‘scientific’ language, i.e. it rested in its foundations on a scientific ac- 
count of reality. His Essay accordingly began by dividing the world into an exhaus- 
tive list of categories and sub-categories, and then building up a language that ex- 
pressed these categories in a perfect one-to-one relationship between language and 
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reality. Others who published universal language schemes were the Germans Joa- 
chim Becher (1635-1682) and Athanasius Kircher (1602-1680), who published 
their efforts in 1661 and 1663 respectively (Strasser 1988: 155-195), and George 
Dalgarno (1626-1687) (Cram and Maat 2001). 

The syntax of natural languages was another avenue through which to explore 
the link between language, thought, and reality. The Latin grammarian Priscian 
had proposed the idea of a ‘natural’ word order, based ultimately on philosophical 
doctrine: the noun should precede the verb, a principle that also found pedagogical 
application in at least one medieval (9th-10th century) pedagogical text (from 
St Gall) (Luhtala 2000: 522). In the seventeenth century, the Grammaire générale 
et raisonée of Lancelot and Arnauld (1660, often called the Port-Royal grammar) 
greatly developed this idea. As the title suggests, the authors of this ‘general and 
rational' grammar were interested in those elements of language common to all 
languages, amongst which (like the medieval speculative grammarians) they in- 
cluded the parts of speech. Here too, as in speculative grammar, logic determined 
their grammatical theory, as a way of describing the relationship between language 
and reality, for one of the most important innovations of the Grammaire was the 
way it applied the framework of the logical proposition (‘X is Y"), to language, 
where a sentence was reduced to consisting of a subject, copula and predicate. 
It was assumed that the subject-copula-predicate order was the natural order: the 
order of logic and hence of thought. As readers may know from their own experi- 
ence, this analytical approach still underlay the grammar that schoolchildren en- 
countered in the late twentieth century. 

The heirs to the Port Royal tradition in the eighteenth century applied philo- 
sophical grammar to advance the cause of instruction in the mother tongue, which 
they equated with training in logical thinking. In France, Nicola Beauzée 
(1717-1789), who contributed numerous articles to the Encyclopédie of Diderot, 
published his own Grammaire générale ou exposition raisonée des éléments 
nécessaires dy langage pour servir de fondement a l'étude de toutes langues 
(1767). Like Lancelot and Arnauld, Beauzée saw the primary function of language 
as the expression of thought. Sylvestre de Sacy also produced a work titled Prin- 
cipes de Grammaire générale mis a la portée des enfans et propres à servir d'in- 
troduction de l'étitude de toutes les langues (1799), whose title specifically noted 
its target audience of schoolchildren and its purpose as an introduction to the study 
of languages. Exponents of general grammar in Germany were Johann Werner 
Meiner (1723-1789), Georg Michael Roth (1769-1817), August Ferdinand Bern- 
hardi (1769-1820), and, in England, James Beattie (1735-1803) and John Horne 
Tooke (1736-1812). Later general grammars increasingly included comparative- 
historical reflections on language, and this leads us to section 2. 
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2. The origin of language and languages, 
and the relationship between languages 


We have already seen that the participants in Plato's Cratylus dialogue discussed 
two competing views on the origin of language: whether language is natural (the 
physei-view of language), conventional (language as nomos ‘law’ or thesis ‘con- 
vention’), or both. The notion of an original name-giver was appealed to there, but 
if ancient Greeks had a myth about the origin of language, it has not survived. 
Plato's Protagoras (4th C BC) argued that sounds and words were the product of 
humans' technical wisdom, no less than buildings or clothing. Diodorus Siculus 
(1st C BC) thought that speech began as meaningless and confused vocal ex- 
pressions, but that people gradually agreed on a token for every object. This, he 
believed, happened in many times and in many different places, so that from the 
beginning there had been many different languages. 

From the early centuries AD until the seventeenth century, it was the Biblical 
account of Adam naming the creatures brought to him by God that provided Chris- 
tian Europe with the explanation for the origin of language. It was not until the En- 
lightenment that such mythical explanations were rejected and more anthropologi- 
cal enquiries began. The Italian Giambattista Vico (1668-1744, Scienzia nuova 
1725, and 2nd ed. 1744) is a relatively early example of these new speculations on 
the origin of language. Vico believed that people first expressed ideas by gestures 
or by using physical objects which had natural relations with ideas, such as three 
ears of corn to mean three years; writing began at the same time, with ideograms 
which relied in the same way on similarities between the symbols and the ideas 
they represented. Condillac (1714—1780) and Rousseau (1712-1788) argued in 
similar ways that language began as imitative gestures (Condillac wrote of a 
langage d'action) and natural cries, which then developed into human language. 
Condillac believed that people's capacity for thought developed in tandem with the 
availability of signs with which meanings could be associated; these signs were in 
turn perfected by advances in human thought. Rousseau emphasized how people 
came almost deliberately to agree on meanings in line with the social contract. 
Herder (1744—1803) — whose essay on the origin of language, published in 1772, 
won the Berlin Academy Prize on the subject (Neis 2003) - took up the idea of the 
interlocking of developments in language and thought advanced by Condillac. 
Herder believed that language emerged only once, and he sought analogies for its 
emergence in the ontogenetic development of language (i.e. in language acquisi- 
tion by the child). Given the short space of time that Herder and his contemporaries 
believed humans had been on the earth, Herder expected to find aspects of the first 
language still present in the "primitive" languages of his day, as did his British con- 
temporary James Burnett (Lord Monboddo, 1714-1799). Such primitive features 
included lack of abstract vocabulary and "inadequate" grammatical organization. 
Many also held (in part correctly) that inflexional and derivational elements of 
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words were the remains of independent words that had become reduced to forms 
added to the root word (a process we would now call grammaticalization). Though 
rational, these eighteenth-century accounts were also more or less fantastic — they 
relied on speculation, thought-experiments. Wilhelm v. Humboldt (1767—1835) is 
perhaps the last well-known exponent of this essentially eighteenth-century specu- 
lation on language origin, though in fact he did not so much explain the origin of 
language as accept that something about the essence of language and its origin re- 
mains beyond our grasp. He viewed language origin as a creative process, seeing 
the capacity for creating language (energeia) as an essential feature of the human 
mind. Individual languages reflected the thought patterns of their speakers in their 
basic structure, some aspects of which were common to all peoples, other aspects 
of which were semantic and grammatical peculiarities that reflected the unique 
spirit of each speech community. 

As for the question of kinship among languages, scholars were hamstrung for 
centuries by their adherence to the biblical explanation of the diversity of human 
languages, the ‘confusion of tongues’ at the tower of Babel (Genesis 11: 1-9). Exe- 
gesis of this passage and of the list of the tribes of Israel (in Genesis 10) had arrived 
in the second century AD at a total of 72 languages and — although some commen- 
tators preferred a total of 69, 70, 75 or 76 languages - this remained the accepted 
truth. It was accepted from very early on that these languages fell into language 
families. Noah's three sons, Japhet, Shem and Ham gave their names to three lan- 
guage families identified: the Japhetic or European, the Semitic, and Hamitic 
(north-African languages). Later, Joseph Justus Scaliger (1610) identified four 
*matrix' or cardinal languages within Europe: Latin (including the Romance lan- 
guages), Greek, Slavonic, and Germanic. Colonizing and missionizing from about 
1500 onwards in Asia and the Americas gave Renaissance scholars opportunities 
to study many ‘new’ non-European languages, and the notion of only 72 languages 
became increasingly difficult to maintain. Conrad Gessner (1555, cf. Colombat and 
Peters 2009) gave samples of 22 different languages, but gave information on a 
further hundred or so. Gessner reconciled this linguistic plurality with the Chris- 
tian interpretation of Genesis by saying that there were indeed 72 languages, and 
that the remaining ‘languages’ identified would come under the heading of an 
existing ‘genus’, which might include more than one dialect. 

While the Church Father Augustine pointed out that we could not know what 
language Adam spoke before the Fall and before Babel, many after him ignored 
him, and Theodor Bibliander (1504/9-1564) wrote with some justification that most 
scholars considered Hebrew to be the original language (1548: 4, 37—38). In the six- 
teenth and seventeenth centuries, the status of Hebrew as original language was 
re-assigned to an ancestor of Germanic by a whole band of enthusiastic German cul- 
tural patriots, who saw in the simplicity and abundance of Germanic monosyllabic 
‘rootwords’ the proof that their language was the oldest and most original of the 
European languages. Their posited ancestral language was variously called Japhetic 
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(after the descendant of Noah said to have brought the language to Europe), Celtic, 
Teutonic, Scythian or Cimbrian, all terms used as equivalents by Schottelius (1663). 

Meanwhile, the first awareness of similarities between European languages 
with other languages further east was gradually seeping in. In the late sixteenth 
century, Frangois Ravelenghien wrote to Justus Lipsius about correspondences 
he had detected between Persian and Flemish words for ‘tooth’, ‘moon’, ‘brother’, 
and others. (The idea, however, spread in wildly overstated form, and led to as- 
sumptions that German or Dutch and Persian were virtually mutually comprehen- 
sible.) By the end of the seventeenth century, there was a consensus about a parent 
language no longer spoken; and about the idea that languages could change and so 
split into dialects which in turn became languages; and there were a few rudimen- 
tary methodological precepts for determining borrowings or common ancestry. 
In the late eighteenth century (1786), William Jones made the crucial breakthrough 
when he reported in an address to the Asiatic Society on the similarities he observed 
between Sanskrit on the one hand and Latin, Greek, Persian, Gothic and Celtic on 
the other. With that, the foundation was laid for the birth of Indo-European com- 
parative-historical linguistics in the nineteenth century. 


3. The emergence and development of theoretical grammar 


It may be difficult for anyone today who has studied a foreign language to imagine 
a time before grammar existed. However, grammar in Europe only really began to 
emerge as a discipline in Rome of the early centuries AD. Ancient Greek philos- 
ophers had treated linguistic matters under logic, one of the sub-branches of phil- 
osophy (alongside ethics and physics). The Stoics did write treatises on speech, but 
none have survived. Nevertheless, logic had a major influence on language study 
over the centuries, as we saw in section 1. One long-lived Stoic influence on Euro- 
pean grammar was the equation of linguistic analogy (regularity, adherence to pat- 
terns), as an expression of reason, with naturalness (physis). In contrast, anomaly 
(irregularity, aberration from the system) was considered unnatural. Certainly 
this discussion influenced Marcus Terrentius Varro (116-27 BC), whose De lingua 
latina (composed 74—45 BC) is Europe's earliest surviving fundamental treatise on 
language. It consisted of 25 books, divided into three sections, although only six 
books survive (books V to X). Varro applied dichotomies (binary pairs of con- 
cepts) from the Pythagoreans to analyse language and so to determine its funda- 
mental nature. For example, the Pythagorean opposition of one to many Varro 
found expressed in the marking of singular and plural of noun and verb forms in 
Latin. Paying attention to the endings of words in this way (inflection, or declina- 
tio, to which one whole section out of the three in his work was devoted), Varro 
also divided words into those that were inflected (that took “endings’) and those 
that did not. He proposed four classes of words: 
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those which have case but not tense (e.g. nouns and adjectives) 
those with tense and not case (e.g. finite verbs) 
those with both case and tense (e.g. participles) 
those with neither case nor tense (e.g. adverbs) 


Pe NaS 


Varro, then, was interested in the form of language, much more so than his prede- 
cessors and contemporaries. A key topic to him was, as already noted, what he calls 
declinatio. Literally ‘declension’ or ‘inflection’, declinatio encompassed both in- 
flection and derivation (i.e. word-formation from existing elements), and for Varro 
it was a key motor in language. It enabled its speakers to make do with a limited 
stock of original words, extending their usability by grammatical inflection and by 
derivation. (The centrality of inflection to European theoretical grammar from the 
very beginning incidentally caused many headaches for the first grammarians of 
vernacular languages that did not inflect as richly as Latin.) 

The Stoic question of analogy and anomaly in language was central to Varro’s 
reflections on language. Since language is by nature rational, one would expect it 
to be subject to logical, regular patterns (analogy). Yet in many instances anomaly 
(irrational, irregular forms) had crept in, whether by error or by idiosyncratic nam- 
ing. Now, if an anomalous, irrational form had become established usage, or was 
used by authoritative writers, what should be done? Varro weighed arguments for 
and against the operation of analogy in language, and came to the conclusion that 
both analogy and anomaly had their place. Anomaly - irregular or even arbitrary 
language use — was acceptable in name-giving. Once such names were established, 
however (and this *name-giving' might include deriving new names or words from 
existing names or words), they then submitted to the same regular patterns of in- 
flection (i.e. analogy) as any other word. 

This problem - the potential conflict between custom and rules based on rea- 
son — remained a thread that ran through the history of European grammar. It was 
taken up by Marcus Fabianus Quintilian(us) (ca 35 AD — ca 100 AD) in his Insti- 
tutio oratoria, ‘Educating the orator’, which contained three chapters on grammar 
that give us some insight into the teaching of grammar in Rome in the early cen- 
turies AD. Like Varro, Quintilian pondered the issue of authority and usage in gram- 
mar, and identified four potentially competing factors: reason (including analogy), 
usage of ancient writers, authority of literary writers, and usage or custom. Both 
Varro and Quintilian saw analogia in (potential) opposition to the dictates of es- 
tablished authority, custom, and (in the case of Quintilian) antiquity. Both equated 
analogia with ratio, which was in turn, in Stoicist fashion, equated with nature. 
Quinitilian certainly did not argue against analogy as a whole, but allowed for more 
instances where analogy might not operate — he did not restrict anomaly to the area 
of name-giving alone, as Varro had done. 

Besides Quintilian, there were other grammarians of this period, but although 
we know the names of some, none of their works survive. We do, however, have 
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some fifteen “school grammars’ (Schulgrammatiken) from the early centuries AD, 
of which the most famous are the Ars minor and Ars maior of Aelius Donatus (ca 
350AD), which were reproduced in countless variations over the centuries and 
continued to be studied into the sixteenth century. Quintilian had addressed the 
sounds of speech, the parts of speech, and questions of orthography, as well as fig- 
ures of speech. But it is from Donatus's works that the basic structure of European 
grammar for many centuries was inherited. Donatus presented the eight parts of 
speech: noun (subsuming substantive and adjective), pronoun, verb, participle, 
adverb, preposition, conjunction and interjection, and dealt with each in turn. In so 
doing, he furnished both the metalanguage and the structure of grammars of Euro- 
pean languages from that day right into the twentieth century. 

The 'school grammars' like those of Donatus were characterized by their sys- 
tematic, strictly hierarchical structure, and by their logical organization into chapters 
and within chapters. In their content, they tended to emphasize semantic categories 
rather than formal ones. However, from about the fifth century onwards, the function 
of Latin grammars began to change. The learners were no longer native speakers of 
Latin, but were learning it as a foreign language, and that called for a new type of 
grammar that spelled out rules and word-forms far more explicitly than had been 
necessary for native speaker learners. The most influential of this new type of gram- 
mar, known as the Regulae (lit. ‘rules’), was Priscian's Institutio de nomine et 
pronomine et verbo (*instruction on the noun, pronoun and verb', sixth century AD), 
written for non-native (Greek) learners, who nevertheless had good Latin. Priscian 
classified nouns, pronouns, verbs and participles by their inflections, and so pro- 
vided a useful framework for giving a form-based overview of Latin of the sort that 
non-native speakers needed. Also worthy of note is Priscian's Institutiones gramma- 
tices, a much larger work devoting two whole books to syntax, a new development. 
From about the eleventh century onwards, commentators on the works of Donatus 
and Priscian, adapting these standard works to the needs of their particular target 
audience, often encouraged some limited contrastive analysis of Latin grammatical 
structures with those of the student's first language, by translating the Latin 
examples into the vernacular. They also began to develop some grammatical termi- 
nology in the vernacular. Aelfric's grammar for Old English (d.1036) and an anony- 
mous German school exercise from St. Gall are eleventh-century examples. 

A second major change to grammatical study in Europe after the Roman Empire 
was its Christianization. Quintilian had divided grammar into two branches: recte 
loquendi scientia, ‘the knowledge of how to write correctly’, and poetarum enarra- 
tio, ‘the interpretation of the poets’. The latter of these branches now received an 
entirely different focus: the interpretation of the Bible, rather than the pagan poets. 
The most famous instance is Bede's (673—735) revision of Donatus's Ars maior, 
substituting Biblical and Christian examples for those given by Donatus. 

Over the succeeding centuries, theoretical grammarians posed key questions 
about the nature of language, questions cast in a slightly different way by each suc- 


800 ` Nicola McLelland 


ceeding generation. One fundamental concern was the expectation of finding ratio 
*reason, rationality' in human language, and the question of what was understood 
to constitute ratio. Simplifying greatly, we can see a see-saw over the centuries be- 
tween, on the one hand, relying on logic and ratio as the basis for determining cor- 
rect language, and, on the other, turning to rhetorical criteria and usage as arbiters 
of correctness. But at each see-saw, the two opposing positions changed somewhat 
too. While the logic-based, universal speculative grammarians of the twelfth and 
thirteenth centuries (see section 1) found the rational essence of language in their 
logic-based categories, the very word ‘elegance’ in the title of Laurentius Valla's 
work De linguae Latinae elegantia (1540) was indicative of the Humanist shift in 
emphasis towards studying grammar by observing good usage - i.e. the elegant 
Latin to be found in the classical authors, not the barbarous Latin of the medieval 
scholars. J. C. Scaliger (1484-1558) swung the pendulum back the other way. The 
title of his De causis linguae Latinae (1540) proclaimed his allegiance to Aristote- 
lian logic, applying to language the *four causes' of Aristotle (a way of explaining 
the world by examining the factors that contribute to making it the way it is). The 
four kinds of 'causes', or 'explanatory factors' of Aristotle provide answers to 
four types of question one might ask about something, for example, about a man: 
1. ‘What is it made from?’ (material cause); 2. ‘What is its form or essence?’ (for- 
mal cause); 3. “What produced it?’ (efficient cause); 4. ‘For what purpose?’ (final 
cause). Scaliger presented custom as an irrational factor, an irritant that caused 
changes to a linguistic system that was originally rational. Petrus Ramus 
(1515-1572) backed away again somewhat from Aristotle — but not nearly as much 
as the famous title of his 1536 thesis ‘all Aristotle’s doctrines are false’ might sug- 
gest. Ramus still insisted on a logical approach to scientific method, and his gen- 
eral tenet that one must proceed in science — grammar included — from the general 
to the particular was still thoroughly Aristotelian. But Ramus aimed to keep logic 
and rhetoric properly distinct, each in their ‘proper’ domains. He favoured like 
Valla the teaching of Latin by literary rather than by logical methods. Sanctius’s 
Minerva. De causis linguae Latinae (1587) harked back in its title to J. C. Sca- 
liger’s Aristotelianism, though in practice it was a not always happy marriage of 
ideas from Scaliger and Ramus. Vossius (1635) once again made analogy the ulti- 
mate guarantor of ratio in language. We have already seen in section | the mental- 
ist rationalist approach of the Grammaire générale et raisonée (1660) and its im- 
pact on its successors. 


4. Studying the vernaculars of Europe and beyond 
We saw above that grammarians teaching Latin to non-native speakers quite soon 


began to incorporate the mother tongue to a greater or lesser extent, giving bi- 
lingual examples, glossing Latin terms with equivalents from the mother tongue. 
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By the late Middle Ages, there were quite elaborate ‘glossed’ versions of the Ars 
Minor in circulation (Ising 1966, 1970, Puff 1995), in part motivated by Humanist 
pedagogical reform. From about the sixteenth century onwards, this practical peda- 
gogical impetus was joined by other motivations to study the vernacular: the cult- 
ural-patriotic, Humanist desire to promote the vernacular as a worthy expression of 
one's own culture; the practical desire for language prescription and standardiz- 
ation (first felt in the area of orthography) in the face of increasing record-keeping 
and interregional communication in the vernacular in state and town bureaucracies, 
and the increased availability of texts thanks to the advent of printing. In the 
Germanic countries, the Reformation — whose proponents encouraged Christians to 
approach the word of God for themselves — was also an impetus to higher levels of 
literacy and so to a desire for certainty in using the written vernacular language. 

Following Dante's Latin De vulgari eloquentia (written 1303-1307, but 
printed in 1529), praises of the vernacular in the vernacular became established as 
*a Renaissance genre' (Burke 2004: 65). Examples include Sperone Speroni's de- 
fence of Italian in Italian in 1542, Dialogo delle lingue; Joachim Du Bellay's 
defence of French, Défense et illustration de la langue francaise (du Bellay, 
1549); and Simon Stevin's Dutch-language Uytspraeck Vande Werdicheyt der 
Duytsche Tael (1586), ‘Oration in Praise of the Worth of the Duytsch Language’. 
At around the same time came the first vernacular grammars. A grammar of Ice- 
landic, the so-called First Grammatical Treatise of ca. 1170, and one of Occitan, 
the Leys d’Amors (ca. 1332) for Italian and Catalonian troubadours, are early 'out- 
liers’ in this vernacular grammatical tradition. They were followed in the fifteenth 
and above all sixteenth centuries by grammars of the major European languages: 
Antonio de Nebrija's Spanish grammar of 1492, Grammatica de la lengua castel- 
lana, John Palsgrave's lesclarcissement de la langue francoyse published in Eng- 
land in 1530, followed a year later in France by Estienne (1531); the first German 
grammar by Albertus (1573), a Portuguese grammar Gramatica da Lingua Por- 
tuguesa by Fernao de Oliviera (1507-1581), the first Dutch one by Hendrick 
Spieghel, Twe-spraak vande Nederduitse Letterkunst;and the first English one by 
William Bullokar (1586), Pamphlet for Grammar. Grammars of Hungarian ap- 
peared in 1539 (Sylvester 1539), and of Czech in 1571 (the Gramatika Ceská of Jan 
Blahoslav (1523-1571). In some languages, grammars did not appear till later in 
the seventeenth century: Peder Syv's grammar of Danish from 1685, and the 
first Russian grammar later still, written by a German under Peter the Great, Hein- 
rich Wilhelm Ludolf's Grammatica russica (1696), in the same year as the first 
Swedish grammar, by Tiällmann (1696). The first grammars were often aimed 
at non-native speakers (thus for instance Palsgrave and Albertus), and closely fol- 
lowed Latin grammatical models, but they nevertheless laid the groundwork for 
autonomous grammatical traditions. Schottelius's German grammar (1641, later 
editions in 1653 and 1663) is an important landmark in European vernacular gram- 
matical tradition. He took the concept of analogy from Vossius's theoretical gram- 
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mar of 1635 (but ultimately going back to Varro; cf. section 3) and applied it to 
a vernacular language — German - for the first time, and in an original and coherent 
way. He thus laid the foundation for a theoretical linguistics of the vernaculars, 
and his innovation had a significant influence on Dutch, Swedish and Russian 
grammatical traditions too (McLelland 2011). In the 17th and 18th centuries, lexi- 
cography of the vernaculars also became established. The first monolingual dic- 
tionaries were rarer than — and emerged later than — bilingual or even polyglot 
works for language learners. For instance, the pedagogue Matthias Kramer's Ger- 
man-Italian dictionary (1676-1678) is the first full dictionary of German root- 
words, preceding Stieler's Sprachschatz of 1691. Meanwhile, the growth of inter- 
est in teaching and learning foreign vernaculars also allowed for the development 
of reflections on language teaching methods, of which the ideas of Jean Luis de 
Vives (1492-1540), William Bathe (1564-1614) and Jan Amos Comenius (b. 
1592) are the most important. 

Besides the development of grammatical traditions for the European lan- 
guages, the global expansion from about 1500 onwards of Catholicism to Asia and 
the Americas, followed by Protestant missions from 1600 onwards, opened Euro- 
pean eyes to the existence of other languages. The results were grammars and 
dictionaries of these newly encountered languages, serving the pressing needs of 
communicating God's word and of commerce. Early examples are a Chinese- 
Portuguese dictionary (1595) produced by the Jesuit Matteo Ricci (1552-1610) 
and Father Michele Ruggieri, a grammar of Tagalog (a language of the Philippines) 
by the Dominican Tomas Pinpin (1610), or Gabriel Sagard's dictionary of the 
Amerindian language Huron (1632). Roger Williams's (1643) work is a traveller's 
phrase-book of Eastern Algonquian terms arranged by topics of buying and selling, 
seeking lodging, eating and entertainment, etc. There was much translation work, 
too, of which John Eliot's (1604—1690) translation of the entire Bible into Massa- 
chusett (an Eastern Algonquian American Indian language) (1663) is an early land- 
mark. 

By the end of the eighteenth century, Latin had lost its position as the default 
object of language study. The study of vernacular grammars — both European and 
non-European — was well-established. As well as countless practically oriented 
works, there were also theoretically rigorous grammars of the vernaculars, whether 
in the universalist tradition of Port Royal (1660) or the analogist model of Schot- 
telius (1663). Reflection on language origin and history had broken free from the 
shackles of Biblical tradition, and had taken an anthropological turn. Europeans' 
encounters with languages on other continents had broadened their understanding 
of what languages could be like, and how languages might be related. The stage 
was set for what has been called ‘the birth of linguistics’ in the nineteenth century. 


See also the following chapters in this volume: 35 by Myhill, 36 by Grzega, 39 by 
Aschenberg, 44 by Wright and 46 by Swiggers. 
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1. The histor(iograph)y of 19th century linguistics: the ‘double helix’ 

2. Linguistics: a 19th century term, discipline, and fulfillment 

3. Historical-comparative linguistics: successive generations, attainments, criticisms 
4. 19th century linguistics: main currents and islands in the stream 

5: Linguistics in Europe and the languages of Europe 

6. 19th century linguistics in retrospect 

1. The histor(iograph)y of 19th century linguistics: the ‘double helix’ 


There are two sides to the history and historiography of 19th century linguistics. 
On the one hand, there is the history traced by the present-day historiographer: a 
reflection of the successive generations of scholars, of parallel or continuous lines 
of development in the approach of language(s) and in the elaboration of linguistic 
methods and practices, and of the evolution of linguistics as an institutionalized 
discipline (a 19th century achievement, cf. section 2). On the other hand, there is 
the history of 19th century linguistics as told, in the second half of the 19th century, 
by its practitioners at that time (e.g. Theodor Benfey, Geschichte der Sprach- 
wissenschaft und orientalischen Philologie in Deutschland seit dem Anfange des 
19. Jahrhunderts mit einem Rückblick auf die früheren Zeiten, 1869; Rudolf von 
Raumer, Geschichte der germanischen Philologie, 1870; Berthold Delbrück, Ein- 
leitung in das Sprachstudium. Ein Beitrag zur Geschichte und Methodik der ver- 
gleichenden Sprachforschung, 1880). The reader of the historiographical account 
that follows has to bear in mind that the “outer” and “inner” stories of 19th century 
linguistics have points in common, but differ, not only in perceptual distance, but 
also in the overall judgments on lines of development and in the foregrounding and 
relegation (or oblivion) of particular linguistic views held by language scholars in 
the 19th century. [For general surveys of 19th century linguistics, see Amsterdam- 
ska (1987), offering a sociohistorical analysis; Morpurgo Davies (1975, 1996), two 
well-documented accounts of language classification in the 19th century, and of 
the trends in historical-comparative, general and synchronic linguistics; Koerner 
(1975), a study of major paradigms in 19th and 20th century linguistics. Useful bio- 
bibliographical information can be found in Tagliavini (1963), Sebeok (ed. 1966) 
and Stammerjohann (ed. 2009). For selections of key texts, see Lehmann (ed. 
1967) and Christmann (ed. 1977). On 19th century linguistics read from within and 
from a distance, see Hoenigswald (1974, 1986).] 
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2. Linguistics: a 19th century term, discipline, and fulfillment 


The term linguistics, designating the field of (comparative) language study, is a 
19th century creation (cf. Swiggers 1996). The Latin(ate) term linguistica (a neuter 
plural referring to materials or books dealing with languages) is sporadically at- 
tested before the 19th century, but had no strict disciplinary connotation; the Ger- 
man term Linguistik, as referring to a (newly created) discipline with specific aims 
and techniques, is first attested in 1808 (in the title of the Allgemeines Archiv 
für Ethnographie und Linguistik edited by Johann Severin Vater [1771-1826] and 
Johann Bertuch [1746-1822]) [the French term linguistique is first attested in 
1812, the English term linguistic(s) in 1827]. Until the middle of the 19th century, 
the term linguistics, and its German (re)translations Sprach(en)kunde/Sprachwis- 
senschaft, would refer to a novel approach in which the comparison of languages 
stood central (as opposed to traditional grammar, which was centered on only one 
language at a time). The nature of this comparative approach was, however, not 
well defined, and remained imprecise for several decades: at least up to the late 
1830s, language comparison comprised a mixture of (proto)typological work, 
of studies belonging within historical-comparative grammar, glottogenetic specu- 
lations, and wide-ranging genealogical classifications. The English term com- 
parative philology, which survived into the 20th century, reflects the fluctuating 
contents and practices within the new comparative approach (the novelty of which 
was at times exaggerated or even unjustifiedly claimed). 

The writings of the major “general linguist” of the 19th century, Wilhelm von 
Humboldt [1767-1835], cristallize the fusion of language typology and “characte- 
rology", of historical-comparative studies, and of speculations on the nature and 
evolution of language (seen as a dynamic energeia). Humboldt, a philosopher, 
diplomat, writer and universal spirit, was familiar with the classical Indo-European 
languages, had studied Sanskrit, as well as various modern languages of Europe 
(including Basque), and was strongly interested in the Chinese language and writ- 
ing system. With the help of his brother Alexander and through his contacts with 
American philologists such as Peter Stephen Du Ponceau [1760-1844] and John 
Pickering [1777-1846], Humboldt gathered a unique collection of books, pamph- 
lets, and manuscripts on the American Indian languages of North, Middle and South 
America. He carefully read and rewrote grammars of exotic languages, as we know 
through the discovery of his manuscript materials in the Krakow university library 
at the end of the 20th century, which are now being edited. Humboldt’s interest was 
in language as the culture-bound expression of a world view (Weltanschauung); the 
latter had to be studied through the typological analysis of grammatical forms 
(grammatische Formen). Humboldt, moving beyond early 19th century classifi- 
cations of language types, elaborated a typology of grammatical forms (as express- 
ing the more or less achieved integration of "matter" and ‘form’), of which he exam- 
ined the correlation with cultural stages and with the nature of the writing system 
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used. In the 1820s Humboldt wrote extensively on the nature and status of gram- 
matical forms in Indo-European, Chinese, Basque, and American Indian languages. 
His more philosophically slanted opus magnum, on the Kawi language, appeared 
posthumously; it is introduced by a very long essay on the diverse ‘architecture’ 
(Sprachbau) of linguistic systems, on the construction of grammatical forms, and 
on their linguistic and cultural functioning (Uber die Kawi-Sprache auf der Insel 
Java nebst einer Einleitung über die Verschiedenheit des menschlichen Sprach- 
baues und ihren Einfluss auf die geistige Entwicklung des Menschengeschlechts, 
1836-1839). On Humboldt’s linguistic interests, see Mueller- Volmer (1993); on the 
philosophical foundations of Humboldt's general linguistics, see Borsche (1981) 
and Manchester (1985). On language comparison, and the reception of Humboldt's 
ideas in early 19th century American linguistics see Swiggers (1998). 

The ‘science of language’ which emerged at the beginning of the 19th century 
was strongly inspired by German Romanticism, as can be seen in the works of Frie- 
drich (von) Schlegel [1772-1829] and his brother August- Wilhelm [1767-1845]. 
Its general orientation was historical-comparative, and it was based on the recog- 
nition and empirical proof of genetic relationship (Sprachverwandtschaft). Such 
relationships had already been claimed in the 16th, 17th and 18th century by 
Humanist language scholars, and, in their lead, by Gottfried Wilhelm Leibniz 
[1646-1716]; the language families identified, in very broad terms, were called 
*Japhetic' (corresponding roughly to the languages of Europe) and ‘Semitic’ (as a 
properly linguistic term, Semitic was proposed in 1781 by August Ludwig von 
Schlózer [1735-1809]), but all these authors worked within the Biblical concep- 
tion of the dispersion of language families and ethnic groups, and the genetic rela- 
tionships claimed were (almost exclusively) based on lexical correspondences. 
Things changed at the turn of the 18th century. The growing information on the 
languages of India led to the recognition of the genetic affinity between idioms at- 
tested from India to the Germanic-speaking territories in Western Europe. This ex- 
plains why these languages were first called Indogermanisch (‘Indo-Germanic’ ); 
as a matter of fact, the now much more common term /ndo-European came later 
(1826), but would not completely replace the former term (still used today in Ger- 
man scholarly publications). It is worthwhile noting that long before the end of the 
18th century the attention of European scholars had been drawn to the importance 
of Sanskrit for linguistic comparison with European languages, but the writings of 
precursors such as Benjamin Schultze [active ca. 1725], Gaston-Laurent Coeur- 
doux [1691-1770] and Nathaniel Brassey Halhed [1751-1830] had hardly any im- 
pact. It was Sir William Jones [1746-1794] who in 1786 formulated the hypothesis 
of a common ancestor for Sanskrit and the (classical) languages of Europe, but 
without offering linguistic arguments; Jones's discourses for the Royal Asiatic(k) 
Society had, however, the positive effect of triggering linguistic comparative re- 
search. On precomparativism, and on the role of the ‘rediscovery’ of Sanskrit see 
De Mauro and Formigari (eds. 1990) and Mayrhofer (1983). 
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With respect to the emergence of Indo-European comparative linguistics, three 
facts should be noted: 


(1) The westernmost boundary was thought, at the beginning of the 19th century, 
to be constituted by the Germanic languages. The Celtic languages, at that 
time insufficiently studied by scholars on the continent, and often believed to 
be of Semitic descent (or related to Basque), were incorporated into the Indo- 
European family tree only in the second third of the 19th century (starting 
with Adolphe Pictet [1799-1875], De l'affinité des langues celtiques avec le 
sanskrit, 1837). The eastward extension of the geographical coverage of the 
Indo-European family would follow only at the end of the 19th century, with 
the discovery of Tocharian texts; their decipherment was achieved in the first 
decades of the 20th century. 

(2) As already noted, intuitions (and also explicit statements) concerning the 
genetic unity of several of the languages recognised as Indogermanisch go 
back to the proto-comparative work of Renaissance scholars and their fol- 
lowers. But the “discovery” of the (Indo-European) languages of India, as 
well as of the important Indian tradition of grammatical analysis, opened new 
perspectives and offered a tool for phonemic and morphological analysis. 

(3) Thebeginnings of Indo-European comparative linguistics are posterior to the 
first solid comparative works in which languages of the Finno-Ugric family 
are analysed in detail and shown to be related: in 1770 János Sajnovics 
[1733-1835] published his Demonstratio idioma Ungarorum et Lapponum 
idem esse, and his empirical proof was refined and extended by Sámuel 
Gyarmathi [1751—1830] in 1799 (Affinitas linguae Hungaricae cum linguis 
Fennicae originis grammatice demonstrata nec non vocabularia dialecto- 
rum Tataricae et Slavicarum cum Hungarica comparata). On the transition 
from precomparativism to historical-comparative grammar, and on the foun- 
dation of Indo-European comparative grammar see Zeller (1967), Neumann 
(1967), and Desmet and Swiggers (2000). 


3. Historical-comparative linguistics: 
successive generations, attainments, criticisms 


The insight (diffused by F. Schlegel; cf. section 2) that Sanskrit (and also Avestan) 
was related to Latin, Greek, and to most of the modern European languages stimu- 
lated scholars to look for linguistic proof in terms of structural correspondences 
between linguistic forms that had to be accounted for through a common origin. At 
first the approach was primarily comparative and based on the correspondence of 
morphological segments, as we can see in the works of Rasmus Rask [1787-1832] 
(Undersógelse om det gamle Nordiske eller Islandske Sprogs Oprindelse, 1818, in 


19th century linguistics: practice and theory 809 


which Sanskrit is not yet integrated in the comparative frame) and especially of 
Franz Bopp [1791—1867], who laid the foundations of Indo-European comparative 
grammar (Über das Conjugationssystem der Sanskritsprache in Vergleichung mit 
jenem der griechischen, lateinischen, persischen und germanischen Sprache, 
1816) and who produced, in the second third of the century, the first comparative 
grammar (Vergleichende Grammatik des Sanskrit, Zend, Griechischen, Lateini- 
schen, Litauischen, Gothischen und Deutschen, 1833-1852). By then linguistic 
comparison had developed its methodological core, viz. the comparative method, 
based on the triangulation of forms (phonemic or morphemic segments), i.e. the 
horizontal alignment of forms in the daughter languages, and the vertical connec- 
tion with ancestral forms, and the postulation of accountable changes in the evol- 
ution from the ancestor language (initially thought to be Sanskrit). The application 
of the comparative method to subgroups within Indo-European led to the “histori- 
cization” of comparative grammar: here Jacob Grimm's [1785-1863] work on 
Germanic (Deutsche Grammatik, 1819-1837) was foundational, and it demon- 
strated the importance of establishing laws for sound alternations. Phonic corre- 
spondences had to be established through a close analysis of the lexical material; in 
this way etymological research became a central pillar of comparative grammar. 
August Friedrich Pott [1802-1887], a student of Bopp, established Indo-European 
etymology as based on the principles of historical grammar (Etymologische 
Forschungen auf dem Gebiete der Indo-Germanischen Sprachen, 1833-1836); 
the first etymological dictionary of Indo-European was published by August Fick 
[1833-1916] in the second half of the century. The new science of historical-com- 
parative grammar found soon its way into the German university system, es- 
pecially in Prussia, which was witnessing an increase in number of students and 
institutions and which underwent a radical change of perspective (towards 
research-oriented locations). On the development of the comparative method, see 
Hoenigswald (1963); on Bopp's importance, see Sternemann (1984). 

The initial model of historical-comparative grammar, as exemplified in the 
work of Bopp and Grimm, was adopted and refined by Friedrich Diez [1794—1876] 
for Romance comparative linguistics. Diez corrected the unfounded views of 
scholars such as Frangois-Juste-Marie Raynouard [1761-1836], and replaced 
Lorenz Diefenbach's [1806-1883] typologically slanted approach of Romance 
(Ueber die jetzigen romanischen Schriftsprachen, 1831), with a truly historical- 
comparative study, both for the grammar and the lexicon of the Romance lan- 
guages (Grammatik der romanischen Sprachen, 1836-1844; Etymologisches Wör- 
terbuch der romanischen Sprachen, 1853). The view prevailing throughout the 
first half of the 19th century, viz. that the ancestor of the Romance languages 
was Classical Latin, was definitively corrected by Hugo Schuchardt [1842-1927] 
in 1866 (Der Vokalismus des Vulgärlateins, 1866-1868). In the Slavic field, for 
which Mixail Lomonosov [1711-1765] had laid the foundations in the 18th cen- 
tury, the first full comparative grammar was produced by Franz Miklosich/Franc 


810 Pierre Swiggers 


Miklosié [1813-1891] (Vergleichende Grammatik der slavischen Sprachen, 1852— 
1875). In 1853 Johann Kaspar Zeuss [1806-1856] established the comparative 
study of Celtic (Grammatica Celtica e monumentis vetustis tam Hibernicae lin- 
guae quam Britannicae dialecti Cornicae Armoricae nec non e Gallicae priscae 
reliquiis construxit J.C. Zeuss). These scholars all worked in the framework of the 
first generation of comparatists; their focus was on the written languages and their 
historical testimonies, and their view of linguistic comparison was primarily 
grounded in etymological work. They operated with sound laws and with analogy, 
without postulating absolute regularity. 

It was this type of linguistics that became part of university curricula (starting 
with Berlin, where Bopp was appointed in 1821, on the recommendation of W. von 
Humboldt). There were three main reasons for this academic recognition: (a) the 
historical orientation of this type of linguistics; (b) its claim of being "scientific"; 
(c) the fact that its principles were generally applicable, and could thus be taught. 

These three features also explain the exclusion of other linguistic approaches 
from the academic scene, during the larger part of the 19th century: 


(a) The study of the “modern languages" was seen as something purely practical, 
i.e. an art and not as a science, and something without explanatory power; 
only at the end of the century did modern language teaching gain academic 
recognition, albeit to a limited extent. 

(b The philosophically slanted approaches of languages, at times still fed by 18th 
century theories of language, were felt to be non-transmittable: this explains 
the marginalization of Humboldt’s linguistics, and also of Heymann Stein- 
thal [1823-1899] and Franz Misteli's [1841—1903] typological work, and of 
Steinthal and Moritz Lazarus’ [1824-1903] Völkerpsychologie. In 1883, 
Philipp Wegener [1848-1916], a student of Steinthal, published his Untersu- 
chungen über die Grundfragen des Sprachlebens, which foreshadows modern 
(semantico)pragmatic ideas; the book hardly attracted attention from histori- 
cal-comparative scholars and its importance was recognized only much later. 


Whereas the first generation of historical-comparative linguists (Bopp, Rask, 
Grimm, Diez) was concerned with the establishing of the new discipline, and with 
applying a still imperfect frame to the family of Indo-European languages and 
a number of its subgroups (Germanic, Romance), the second generation (inspired 
by the recent developments in the natural sciences, especially in evolutionary bi- 
ology and geology) was eager to define the field of Glottik (language study) as con- 
stituted by a more rigourously defined and practised comparativism (vergleichende 
Sprachwissenschaft). On the influence of biology and geology on 19th century 
linguistic thought, see Hoenigswald and Wiener (eds. 1987), and Neumann, Plank 
and Hofbauer (eds. 1992). 

A major concern was the separation between linguistic study (the study of lan- 
guage forms on themselves) and philology. Between the 1840s and 1870s histori- 
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cal-comparative linguists stressed their distance with regard to the work of philol- 
ogists (text editors and commentators; students of the literary history of languages; 
scholars interested in comparative mythology and religion). It was especially 
August Schleicher [1821-1868] who insisted on the opposition between Glottik 
(the study of organic language structures from an evolutionary point of view, 
comparable to botanic classification) and Philologie (an ‘entertaining occupation 
for text-lovers’). Schleicher's important contribution to language study, and to 
Indo-European comparative studies, does not lie, however, in his quarrel with phil- 
ologists, nor in his plea for an evolutionary, and more specifically Darwinian ap- 
proach in linguistics (Die Darwinsche Theorie und die Sprachwissenschaft, 1863; 
Über die Bedeutung der Sprache für die Naturgeschichte des Menschen, 1865). 
It rather lies in his refinement of the comparative method (which he consistently 
used for the reconstruction of common Indo-European), in his admirable work on 
Lithuanian (Handbuch der litauischen Sprachen 1856-1857; Litauische Märchen, 
Sprichwórte, Rdtsel und Lieder, 1857 — a gem of philological (!) work on an 
oral tradition), and especially in his authoring of the impressive Compendium der 
vergleichenden Grammatik der indogermanischen Sprachen (1861—1862; second 
to fourth edition 1866-1876), which was translated into various European lan- 
guages. To this second phase of historical-comparative studies, emblematically 
represented by Schleicher, also belong scholars such as Georg Curtius [1820- 
1885] and Johannes Schmidt [1843-1901]. Curtius was a specialist of the histori- 
cal grammar of Greek and Latin; his Griechische Schulgrammatik, translated into 
almost all of the modern European languages, testifies to the integration of com- 
parative grammar in language manuals for school use. Schmidt, a faithful student 
of Schleicher, revised the Compendium, and made his major theoretical contribu- 
tion by suggesting the wave-theory (Wellentheorie) for describing and explaining 
macroscopic changes in the evolution of languages springing from a common 
source. In his 1872 booklet Die Verwandtschaftsverhdltnisse der indogermani- 
schen Sprachen, Schmidt was the first to formulate in print this alternative to 
the family-tree theory (Stammbaumtheorie); Hugo Schuchardt, in a 1870 lecture, 
had expressed similar views (using ‘oceanographic metaphors’), with regard to 
linguistic irradiation within the Romance subgroup, which was published only 
in 1900 (Über die Klassifikation der romanischen Mundarten). On chronological 
grounds one should also assign to this phase the American Sanskritist and com- 
parative scholar William Dwight Whitney [1827-1894], who spoke forcefully 
against the comparative mythological work of Friedrich Max Müller [1823-1900], 
and who wrote on the principles of the ‘new linguistic science’ (historical-com- 
parative linguistics). But Whitney's interest in the links between language and 
society make him a figure apart (even with regard to the third generation of his- 
torical-comparative linguists in Europe), and force us to view him as a founding 
father of modern general linguistics as shaped by Ferdinand de Saussure and his 
students. 
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The second generation of comparative scholars soon came under fire. In the 
1870s a group of young Indo-Europeanists at Leipzig, inspired by the Balto-Slav- 
icist August Leskien [1840-1916] and headed by Hermann Osthoff [1847-1909], 
Karl Brugmann [1849-1919] and Berthold Delbrück [1842-1922], rejected the 
evolutionary views of Schleicher, strongly relativized the aims and usefulness of 
linguistic reconstruction, modified on various points Schleicher's description and 
explanation of changes within the Indo-European languages and, most importantly, 
radicalized the concept of regular linguistic (especially: phonetic) change. The 
Leipzig group, strengthened in its conviction and its (polemical) stand by the dis- 
covery, in the years 1875 through 1878, of anumber of important sound laws (Laut- 
gesetze) explaining several large-scale processes in the history and diversification 
of Indo-European, claimed their views to be scientifically superior to any previous 
or competing approach in the historical-comparative grammar of Indo-European 
(and, by extension, in historical-comparative practice in general). The Leipzig 
comparatists, designated by the term Junggrammatiker (‘Neogrammarians’ ), first 
used as a nickname, was remarkable for its strong doctrinal (and social) cohesion, 
for its innovative empirical work, and for its astonishing capacity of producing 
monumental syntheses. Three of its most prominent members produced multi- 
volume compendia: Brugmann (Grundriss der vergleichenden Grammatik der 
indogermanischen Sprachen, 1886-1893, 5 volumes; second edition with B. Del- 
brück, 1897-1916, 9 volumes); Delbrück (Syntaktische Forschungen, 1871-1888, 
5 volumes); and Hermann Paul (Deutsche Grammatik, 1916-1920, 5 volumes). 
The group had in its ranks a few scholars with theoretical ambitions. As a young 
scholar, Brugmann (with H. Osthoff approvingly co-signing) formulated the prin- 
ciple of the absolute regularity of sound change (if not perturbated by analogical 
forces). This principle, stated in the preface to the first volume of Brugmann and 
Osthoff's Morphologische Untersuchungen (1878), was the cornerstone of their 
uniformitarianist assumption that language evolution followed the same lines and 
principles throughout time, being universally subject to identical (structural) con- 
ditions. The study of linguistic change in vivo (as observed or thought to be observ- 
able in dialects) was hailed as the touchstone for explaining, by chronological 
‘retroprojection’, changes in the past. The Freiburg Germanic scholar Hermann 
Paul [1846-1921], the major theoretician of the Junggrammatiker, whose Principien 
[later: Prinzipien] der Sprachgeschichte (first edition, 1880; fourth edition 1920, 
the last edition revised by Paul himself) constituted the Neogrammarian theoretical 
synthesis for diachronic linguistics, stated that the *historical' (geschichtlich) ap- 
proach was the only scientific one in linguistics; he offered a theoretical frame 
for the explanation of language change, as caused by individuals and adopted 
by (part of) the community. Although the Neogrammarians admitted that the laws 
of the evolution of languages cannot be seen as ‘natural’ laws, and although they 
stressed that linguistics is a historical discipline, they never yielded to Steinhal's 
psychologically based view nor to Wilhelm Wundt’s [1832-1920] sociologically 
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based view of language and linguistic evolution. On the Neogrammarians, see Jan- 
kowsky (1972), focusing on their diachronic conceptions, Einhauser (1989), ana- 
lysing the cohesion of the group, and Koerner (1981), studying their impact; Brug- 
mann's crucial role is analysed by Morpurgo Davies (1986); for a survey of sound 
laws established by the Neogrammarians and their successors, see Collinge (1985). 

The Neogrammarians had the merit of stressing the need for phonetic accuracy, 
as a basic requirement for the adequate description of sound change (caused by 
the physical properties and processes of the articulatory apparatus). The second 
half of the 19th century witnessed a considerable increase of phonetic studies, and 
the rise of experimental phonetics. This phonetic work was at first at the service of 
(historical-)comparative linguistics (cf. Eduard Sievers [1850-1932], Grundzüge 
der Lautphysiologie zur Einführung in das Studium der Lautlehre der indogerma- 
nischen Sprachen, 1876), and historical linguistics (Henry Sweet [1845-1912]; 
Paul Passy [1859-1940]; Otto Jespersen [1860-1943]), and then extended to 
dialectology (cf. Pierre Rousselot [1846-1924]) and language teaching (Wilhelm 
Viétor [1850-1918]), while slowly gaining autonomy. In 1886 the “Phonetic 
Teachers’ Association" was founded, the forerunner of the “International Phonetic 
Association" (1897). 

The principle of ‘regular sound change’, which soon became a powerful heu- 
ristic tool, was in turn criticized by contemporaries of the Junggrammatiker, 
mainly from three angles: 


(1) scholars such as Georg Curtius and especially Graziadio Isaia(h) Ascoli 
[1829-1907] (an Italian 'glottologist who was at home in Indo-European as 
well as in Semitic, and who made decisive contributions to Romance dialec- 
tology) stressed the important role of conservative factors and of substratal 
influences in the history of languages; 

(2) students such as Whitney and the French Indo-Europeanist Michel Bréal 
[1832-1915] criticized the neglect of social and cultural factors by the Neo- 
grammarians in their study of language change; Bréal's socio-operative con- 
ception of language, rooted in the French tradition of language philosophy, 
which stressed the role of human intelligence, led to the establishment of 
the field of semantics, the study of the socio-functional side of language (see 
his La science des mots: Essai de sémantique, 1897; the term sémantique was 
coined by Bréal in 1883); 

(3) the most relevant, and harsh criticisms were aimed at the principle of regular 
sound change itself and its mechanistic nature. The main antagonist was 
Hugo Schuchardt. In his pamphlet Über die Lautgesetze. Gegen die Jung- 
grammatiker (1885), apart from stressing the fact that all phonetic processes 
are restricted in time and space, he showed that 
(i) there is no ‘pure linguistic community’, language mixture being the 

normal situation in a society; 
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(ii) there is no decisive, overall criterion to distinguish between the work- 
ings of a sound law and those of analogy; 

(iii) linguistic change cannot be mechanically explained, given the com- 
plexity (and changing variety) of conditional factors. [On the ‘sound 
law controversy', see Wilbur (ed. 1977). On Schuchardt's criticisms, 
and their embedding in an innovative socio-anthropological view of 
language, see Swiggers (2000).] 


Schuchardt's objections were confirmed on a large empirical scale by dialectologi- 
cal work (Ascoli; Georg Wenker; Heinrich Morf; Jules Gilliéron; Louis Gauchat) — 
an instance of history's irony, since the Neogrammarians had put strong hopes on 
the study of "living languages", where they expected to find confirmation of the 
sound laws they had postulated for past stages. The fine-grained work in dialectol- 
ogy (especially in Romance territories) would lead to the epistemological and me- 
thodological redefinition of linguistics (and the rejection of Neogrammarian theor- 
izing) by Ferdinand de Saussure [1857-1913] and Antoine Meller [1866-1936] at 
the beginning of the 20th century, and to the elaboration of a sociologically based 
theory of language in which the crucial distinction is that made between language 
system (langue) and concrete speech facts (parole). On the evolution of dialect 
studies in the 19th century, and their methodological importance, both internally 
(cf. the issue of dialect boundaries) and externally (with reference to topics of gen- 
eral linguistics, such as variation and change), see Desmet, Lauwers and Swiggers 
(1999). 

Saussure had studied in Leipzig and had written there, at age twenty-one, a truly 
innovative dissertation on the sound system of Indo-European (Mémoire sur le sys- 
téme primitif des voyelles indo-européennes, 1879 [December 1878]), in which 
vowel alternations are explained by different quantitative grades and by the colour- 
ing role of a “coefficient sonantique” (prefiguring the laryngeal theory, for which 
the discovery and decipherment of Hittite, after 1915, would provide evidence). 
But while remaining a lifelong practitioner of Indo-European comparative gram- 
mar, Saussure, who had never adhered to the theoretical principles of the Jung- 
grammatiker, engaged himself in linguistic theorizing and in the elaboration of 
a theory of linguistics (restructured, on the basis of lecture notes, by the editors of 
the posthumous Cours de linguistique générale, 1916). Saussure criticizes the 
Junggrammatiker for their theoretical insufficiency, and reacts even more critically 
against the French school of "linguistique naturaliste", which was very active in 
France between 1870 and 1910. This school, which comprised scholars mostly 
connected with the École d'Anthropologie in Paris, defended the evolutionary 
views of Schleicher (and Max Müller), and practised a typological-evolutionary 
approach to the study of languages (coupled with racial distinctions and develop- 
mental stages). This ‘naturalist’? school of linguistics (cf. Desmet 1996) was 
efficiently marginalized by the French Indo-Europeanists (Bréal, Abel Bergaigne 
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[1838-1888], Meillet, Louis Havet [1849—1925]) and Romance scholars (Gaston 
Paris [1839-1903], Paul Meyer [1840-1917) and its role ended with the extinction 
of its journal Revue de linguistique et de philologie comparée (1867—1916). 


4. 19th century linguistics: main currents and islands in the stream 


Looking with hindsight at the impressive mass of empirical and theoretical work 
produced by 19th century language scholars one can say that Indo-European lin- 
guistics, with its subdomains (Romance, Germanic, Slavic, ...), constitutes the 
bulk of the production. Principles of historical-comparative grammar also made 
their way into didactic grammar manuals (especially in France and Germany), in 
which the facts were assumed to have their (best) explanation in history, and not in 
synchronic functioning. It was also the work on Indo-European and its branches 
that underlay theoretically slanted works on (general) linguistics, on the history of 
languages, on the "life and growth" of a particular language. The model of Indo- 
European comparative grammar was applied in the field of Semitic, and in the first 
attempts at writing comparative grammars of "exotic" languages. 

The massive, strongly data-based production in Indo-European historical-com- 
parative grammar and the hegemony of this field within university curricula are the 
principal factors responsible for the slow recognition of modern language teaching 
(present-day “applied linguistics"), of language typology and “characterology”. 
Humboldt's general linguistics was altogether felt to be too speculative and ro- 
mantically flavoured. The increasingly positivistic and mechanistic stand of com- 
parative linguists led to the rejection of evolutionary linguistics in the wake of 
Schleicher, and of musings on the origin of language (a topic excluded from the 
agenda of the Société de Linguistique de Paris in 1866). 

However, a panoramic view of 19th century linguistics reveals the presence of 
a number of research topics, with enduring success, although these did not enjoy a 
widespread academic interest. Among these, two important research issues should 
be singled out: 


(a) The study of the origin, nature and function of creole languages. Here the key 
figure was, once again, Hugo Schuchardt (cf. section 3), who between 1882 
and 1914 published extensively on creole languages all over the world. 

(b) Work on an international (auxiliary) language. There was a proliferation 
of international language projects (with Esperanto and Volapük being most 
successful) between 1875 and 1910, in which a number of professional 
linguists (Hugo Schuchardt, Jan Baudouin de Courtenay [1845-1929], Otto 
Jespersen) and logicians (Giuseppe Peano [1858-1932], Louis Couturat 
[1868-1914]) participated. This topic must be viewed in the light of political 
and socio-economic events that occurred in the 19th century (unification of 


816 Pierre Swiggers 


political states; rise of nationalism; ethnicity conflicts; social revolts; mili- 
tary escalation; the industrial revolution and the spread of socialism), and 
in the context of newly arisen needs such as worldwide political and social 
negotiations, and the communication between scientific scholars; in short, 
‘living floor’ conditions which were radically different from those that had 
characterized the Ancien Régime. For an analysis of 19th century projects of 
an international auxiliary language as part of the larger history of the search 
for a perfect/universal language, see Eco (1995, chapter 16); for an assess- 
ment by one of the leading figures in the international language movement at 
the beginning of the 20th century, see Couturat and Leau (1907). 


5. Linguistics in Europe and the languages of Europe 


The 19th century saw the emergence, development and institutional implemen- 
tation of Indo-European comparative linguistics, the comparative approach having 
its origin in the romanticist interest in the past and in cultural heritage (cf. sec- 
tion 2). Nationalist feelings were a strong factor in the institutionalization of the 
discipline in German universities. Also, from the beginning, racial considerations 
played a role, given the then too easily established link between language and race; 
in the case of French naturalist linguistics (cf. section 3) one can even speak of rac- 
ist preconceptions. 

How did the languages of Europe fit into this picture? Being inflectional lan- 
guages, they received a prominent place in Friedrich Schlegel's typology (Ueber die 
Sprache und Weisheit der Indier, 1808, and the elaboration of grammatical forms as 
found in Latin and especially in Greek led Humboldt (cf. section 2) to claim greater 
perfection in the classical Indo-European languages (as compared, for example, with 
Basque or the American Indian languages). As a general rule, Indo-Europeanists 
did not show a specific interest in the history of Europe's languages, partly because 
of the ancestry of Sanskrit and Persian. A number of historical-comparative scholars 
(e.g., Wilhelm Scherer [1841-1886], H Paul, O. Jespersen) engaged in the writing of 
language histories (of German, of English), a genre which in the Romance-speaking 
countries was most often cultivated by philologists, lexicographers or literary auth- 
ors. One author, however, stands out: August Schleicher, the key figure of the second 
phase of comparative studies (cf. section 3). In 1850 Schleicher published a less tech- 
nical work Die Sprachen Europas in systematischer Uebersicht, which was trans- 
lated into a number of European languages. The book is remarkable not only for its 
treatment of Europe's languages on their own, but also for its linguistic contents: as a 
matter of fact, the work is not a detailed account of the development of the languages 
of Europe, but is intended as a history of linguistic types as these are present on the 
European continent. Later, in 1860, Schleicher published a general work on the ori- 
gin and evolution of the German language (Die deutsche Sprache). 
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6. 19th century linguistics in retrospect 


Historical-comparative grammar was the institutional(ized) face of linguistics in 
the 19th century. This approach was primarily applied to the Indo-European family 
and to its subgroups, especially to Indo-Iranian, Hellenic, Italic and, soon a separ- 
ate field of study, Romance, Slavic, and Baltic. (The Celtic languages became a 
specific domain of research after Zeuss's comparative grammar; Armenian gained 
its place through the work of Heinrich Hübschmann [1848-1908] in the 1870s.) 
For other language families, comparative studies either further developed a tradi- 
tion existing since the end of the 18th century (e.g., for Finno-Ugric; cf. section 2), 
or created a new field of study, though not always recognized at the academic level 
(cf. Mattias A. Castrén [1813-1852], De affixis personalibus linguarum Altaica- 
rum, 1850; Robert Caldwell [1814—1891], A Comparative Grammar of the Dravi- 
dian or South-Indian Family of Languages, 1856; Wilhelm H.I. Bleek [1827-1875], 
A Comparative Grammar of South African Languages, 1862-1869; Hans Conon 
von der Gabelentz [1807-1874], Die melanesischen Sprachen, 1860-1873; Carl 
Meinhof [1857-1944], Grundriss einer Lautlehre der Bantu Sprachen, 1899). The 
comparative study of the Semitic languages, which gained momentum in the sec- 
ond half of the 19th century, received a strong impetus after the decipherment, in 
1857, of the Akkadian cuneiform (the writing system of the Assyrian and Babylo- 
nian language). 

The work in historical-comparative linguistics was truly impressive, and re- 
mains foundational for today's practice. It also provided the impetus for a scientific 
approach to the study of dialects and of place names (toponymy became the first 
scientific branch of onomastics), and criticisms raised against the (too) mechanistic 
approach of the Neogrammarians proved beneficial to the rise of sociological 
and psychologistic (‘idealistic’) views in linguistics, leading eventually to various 
prestructuralist approaches. In the United States the end of the 19th century saw 
the introduction of anthropological linguistics. The massive presence of historical- 
comparative linguistics left little place for general linguistics and language typol- 
ogy, two research interests of the early 19th century which converged much later in 
the writings of Georg von der Gabelentz [1840-1893] (see his Die Sprachwissen- 
schaft: ihre Aufgaben, Methoden und bisherige Ergebnisse, 1891). A fortiori, there 
was no place in linguistics then for the ingenious work in mathematical and formal 
logic of George Boole [1815-1864] and Gottlob Frege [1848-1925]: an intellec- 
tual injustice the 20th century would repair. 


See also the following chapters in this volume: 26 by Auer, 27 by Woolhiser, 28 by 
Murelli and Kortmann, 45 by McLelland, 47 by Albrecht, 48 by Sornicola and 49 
by Reuland and Everaert. 
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Jórn Albrecht 
1. The prehistory of structural thought 
2. What does “structure” mean? 
3. Ferdinand de Saussure as one of the “founding fathers” of European Structuralism 
3.1. Langue — parole 
3.2. Synchronie — diachronie 
3.3. Signifiant — signifié [signifier — signified] 
3.4. Syntagmatique — associatif (paradigmatique) 
4. The most important schools of European Structuralism (and their methods) 


4.1. The Prague School 

4.2. The Geneva School 

4.3. The Copenhagen School 
4.4. The Russian School 

4.5. The London School 


4.6. Scholars between the schools 
5. European versus American Structuralism 
6. Concluding remarks 


Although considered today an “outdated paradigm”, European Structuralism is one 
of the most influential models of linguistic analysis and description in the history of 
modern linguistics. It has left numerous “conceptual traces” even in those linguistic 
approaches that may seem antagonistic to structural thought. In contrast to former 
and later trends in linguistics, structuralism treats the language system as an auton- 
omous object which can and should be abstracted from adjacent factors such as the 
history of the language, the anatomic and neurological requirements for the faculty 
of speech, the cognitive and social conditions of verbal communication and, last 
but not least, the practical purposes of speech acts. European Structuralism never 
manifested itself as a unified “school” but rather in the form of a geographically 
scattered group of scholars with widely varying interests. No neat boundaries can 
be drawn between structuralism and functionalism. 


1. The prehistory of structural thought 


If we take “linguistic structure” to be any kind of pattern occurring regularly 
in speech, then linguistic structuralism is as old as the study of language itself. The 
champions of the “comparative and historical” paradigm in the nineteenth century 
are generally considered to have had little truck with structural thought, but their 
discovery and description of phenomena like the Great Vowel Shift in English or 
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the first and the second Germanic Sound Shift is an impressive manifestation of a 
structural approach ante litteram in linguistics. Hermann Paul (1846-1921), the 
leading theorist of the third generation of so-called Neogrammarians, was much 
nearer to the structural approach than Saussure realised. In 1909 Saussure said to 
his student Albert Riedlinger “Il faudrait reprendre, pour le réfuter, tout ce que Paul 
et les modernes ont écrit là-dessus (i.e. la linguistique synchronique).” [One should 
re-examine, in order to refute it, everything that Paul and other modern linguists 
have written on this subject] (Koerner 1972: 274). In fact some important ideas ad- 
vanced by Saussure — at least the ideas of the “apocryphal” Saussure in the Course 
in General Linguistics (CLG) — had been expressed previously in the works of Her- 
mann Paul, though not always very prominently. Lack of space forces me to restrict 
myself to one example. In his Deutsches Wörterbuch (61966: 280b) Paul observes: 


Zu beachten ist, daß der Verwendung von h/aben] weitgehend die von bekommen 
(kriegen) und behalten entspricht, die sich zu h[aben] verhalten wie werden und bleiben 
zu sein. 

[Note that the use of haben ‘to have’ largely corresponds to that of bekommen (kriegen) 
‘to get’ and behalten ‘to keep’ which relate in the same way to haben ‘to have’ as 
werden ‘to become’ and bleiben ‘to remain, stay’ relate to sein ‘to be’.] 


With the help of some exegetic additions, the proportional series bekommen 
(kriegen) : haben : behalten = werden : sein: bleiben [get : have : keep = become : 
be : stay] can be represented in Figure 1 (for more details see Albrecht 1994: 393; 
Albrecht 2007: 147-148.): 


entstehend bestehend fortbestehend 

[emerging] [existing] [continuing] 

bekommen haben behalten 
Relation [get, obtain] [have] [keep] 
[relation] 

werden sein bleiben 

[become] [be] [stay] 


Figure 1: Hermann Paul as a forerunner of so-called “structural semantics” 


The equation (a: b = c: d) is the formal expression of analogous thinking, and a 
pattern of analogous relations holding between the components of a complex set of 
phenomena is commonly called a “structure”. Hermann Paul deserves to be con- 
sidered a late successor of the School of Alexandria (see Robins 1967, ch. 2.) and 
an early forerunner of linguistic structuralism stricto sensu, especially of so-called 
Structural Semantics. 

However, the foundations of the Structuralist paradigm (or “disciplinary ma- 
trix") were not only prepared at the level of linguistic data arrangement but also on 
the higher plane of "ideological" argumentation. There was a general reaction 
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against the basic assumptions about language and the epistemological convictions 
propounded by the representatives of comparative grammar (or “comparative phil- 
ology", Lyons 1969: 22), especially against the so-called Neogrammarians (Jung- 
grammatiker). 

In contrast to so-called “American structuralism” (the members of this school 
preferred the labels “descriptivism” and “distributionalism”), European Struc- 
turalism — with the exception of the “London School” — can be regarded as a reac- 
tion against positivism, more specifically against the traditional positivism of Au- 
guste Comte and his followers, which is not to be confused with neo-positivism or 
logical empirism. The forerunners of European Structuralism, i.e. philosophers 
like Franz Brentano (1838-1917) and Edmund Husserl (1859-1938), psycho- 
logists like Oswald Külpe and Christian von Ehrenfels, the founders of the gestalt 
psychology and linguists like Baudouin de Courtenay (1845-1929), W. D. Whitney 
(1827-1894) or Hugo Schuchardt (1845-1927) opposed the following four prin- 
ciples of positivism, principles that had never been explicitly formulated. (1) 
Against the principle of *atomism", i.e. the conviction that only single objects or 
facts can be accepted as the starting point for any kind of investigation and that in- 
ductive generalization is the prime source of truth, they argued that it is not single 
fact that should be studied in linguistics, but rather the connections and relations 
that can be ascertained between those facts (relations being considered as "facts" 
sui generis). (2) Against the principle of substantialist thinking, they affirmed that 
in their view what kind of matter an object was made of was less important than the 
characteristic shape by which it can be distinguished from other objects and the 
functions they may serve. (3) Against the principle of evolution, i.e. the belief that 
we know nothing about a phenomenon if we ignore its origins, they argued that 
knowing a phenomenon means determining the place it occupies within a network 
of other phenomena. (4) Against the principle of naturalism, i.e. the conviction that 
any phenomenon susceptible to scientific investigation must be a natural object or 
at least a phenomenon reduced to a natural object (for instance, meaning reduced to 
behaviour), they argued that linguistic facts belong to the sphere of "symbolic 
forms" and that “symbols are no part of our physical world" (Cassirer 1945: 114) 
but rather phenomena that cannot be reduced to natural objects. (For more details, 
see Coseriu 1988: ch. 2-3; Albrecht 2007: 14-15.) 


2. What does “structure” mean? 


"Structuralism" is a vague collective term that probably makes its first appearance 
in the early writings of Roman Jakobson. It was applied post festum to a number of 
linguistic schools and approaches that, at first glance, seem to share certain basic 
assumptions but in fact differ widely in their methods and ideas (Matthews 2001: 
48—51). They have only two (perhaps three) highly generic features in common. 
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(1) They all concur in attributing high importance to synchronic descriptions; (2) they 
all assign a secondary role to the behavior and the psychological make-up of the lan- 
guage-using individual. (A. Bühler 1992: 722) 


Probably the most distinctive feature of the structuralist method is the emphasis it gives 
to wholes, to totalities. (Lane 1970: 14)! 


Nevertheless, it may be useful for the aims of the present chapter to examine the 
meanings that the term structure can take on in different linguistic schools. The 
lexeme structure (derived from Lat. structura « struo 'to build") appears in the 
context of language and literature at a very early stage. Cicero uses the expression 
verborum quasi structura (‘the structure of the words, as it were’) and Ovid speaks 
of structura mei carminis (*the structure of my poem") (Lepschy 1981: 43). But 
in German Romanticism, linguists like August Wilhelm Schlegel or Wilhelm von 
Humboldt used the metaphor “organism” when they wanted to emphasize that lan- 
guage is not a conglomeration of single elements but a system made up of parts that 
are dependent on each other. Forerunners of European Structuralism like Georg 
von der Gabelentz or Ferdinand de Saussure used the term system, not structure 
(Rensch 1967: 78—79).? But they did not use the term "instead" of structure, as 
some influential accounts of the history of linguistics suggest; they employ it rather 
as a kind of generic term: a system is an entity whose characteristic property is that 
of being structured, of possessing a structure. This also applies to the term Gestalt, 
referred to above in connection with gestalt psychology, which frequently is trans- 
lated into English as ‘structure’ or ‘configuration’. 


According to gestalt theorists complex perceptions cannot adequately be accounted for 
simply as the sum of the sensations that go to make up those perceptions. The mind acts 
to organize or structure perceptions in such a way that the resultant structure of parts in 
relation has characteristic properties which are not subsumed in its constituent ele- 
ments. (Lane 1970: 23) 


Some of the definitions of structure in the Oxford English Dictionary fit in rather 
well with (at least the intuitive) conceptions of the representatives of linguistic 
Structuralism. The relevant definitions also reflect the development of the meaning 
(“glissement sémantique") of structure from an entity to the property of an entity. 


(7) An organized body or combination of mutually connected and dependent 
parts or elements. Chiefly in Biol. applied to component parts of an animal or 
vegetable organism. 


(4) The coexistence in a whole of distinct parts having a definite manner of 
agreement. 


(3) The mutual relation of the constituent parts or elements of a whole as deter- 
mining its peculiar nature or character; make, frame? 
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However, the most important distinctive feature of the various persuasions within 
the structuralist approach to language does not consist in the specific understand- 
ing of the notion of structure but in that of function. There are two radically differ- 
ent conceptions of function to be found in the different schools of Structuralism: 
(1) Function as a relation between the elements of a system and “the world out- 
side", for instance: "The function of the distinctiveness of the phoneme consists in 
guaranteeing perfect communication” (= fj). (2) Function as a relation between 
two or more elements within the system. For example: “‘Solidarity’ is the function 
which holds between content and expression", or “The function of form is that of 
selecting substance” (=f,). Function in the sense of f, is characteristic of all so- 
called functionalist approaches, e.g. the Prague School, the work of André Marti- 
net and, at least partly, the London School, which was strongly influenced by Bro- 
nislaw Malinowski's contextualism. Function in the sense of f, is characteristic of 
the Copenhagen School, more especially of its “hard core", so-called glossematics. 
In order to avoid misunderstandings, it should be stressed that neither European 
nor American structuralism is concerned with formalization in the mathematical 
sense of the term, although the complicated diagrams to be found in the writings of 
structuralist linguists are likely to make a rather formalistic impression on the un- 
initiated reader. 


3. Ferdinand de Saussure as one of the “founding fathers" 
of European Structuralism 


There is a kind of communis opinio in the historiography of modern linguistics that 
the Swiss linguist Ferdinand de Saussure was one of the founding fathers of Euro- 
pean Structuralism, perhaps indeed the most important of them. John Lyons af- 
firms: “ ... as we shall see, it is Ferdinand de Saussure who is generally regarded as 
the founder of modern structural linguistics".(1969: 6) Indeed, apart from 
H. A. Gleason's Introduction to Descriptive Linguistics and Charles F. Hockett's 
Course in Modern Linguistics, there has been no important introduction to lin- 
guistics since 1930 that does not refer to the legendary scholar from Geneva. While 
it is not completely unjustified to attribute to Saussure an important role in the gen- 
esis of European Structuralism, some major reservations need to be voiced. First, 
Saussure died long before the emergence of linguistic Structuralism stricto sensu; 
he was, if anything, “un structuraliste sans le savoir", (Mounin 1971), a struc- 
turalist unaware of the fact. Second, it was not Saussure himself who exerted such 
influence on the later Structuralist schools, but a book that appeared three years 
after his death, a compendium of three lectures on general linguistics held at 
Geneva and put together from lecture notes by Charles Bally and Albert Seche- 
haye, two colleagues who never attended his courses. Today it is generally ac- 
cepted that the Course in General Linguistics (CLG) reflects the opinions of the 
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early Geneva School rather than Saussure’s own thought. (See among many others 
Martinet 1993: 237; Bouquet 1997: iv; Milner 2002: ch. 1). Third, later Struc- 
turalist schools have had an influence on the understanding of the CLG itself, con- 
temptuously called the vulgate by Saussureologists. Unbiased interpretation even 
of the “apocryphal” Saussure seems impossible. Today the CLG is read - if at all — 
with "structuralist" preconceptions that imperceptibly modify the original ideas 
(Jäger 1984: 25; Milner 2002: 17). Be that as it may, for the aims of this chapter we 
need to trace back some important ideas in the writings of Structuralist linguists to 
the vulgate of Saussure’s writings. 


To what extent the published text of the Cours faithfully reflects Saussure’s thinking 
about language is a matter of some controversy. That controversy, however, is irrelevant 
to the subsequent chapter in the history of ideas which opens with the Cours. For, 
whether Saussure would have ‘authorized’ it or not, the text as published became the 
cornerstone of modern linguistic theory ... (Harris and Taylor 1989: 178). 


The “real” Saussure was completely unknown during the whole period in which 
Structuralism flourished in Europe. For reasons of space we will focus our atten- 
tion here on the four famous dichotomies, often wrongly called antinomies. 
The “real” Saussure seems to have preferred dualité, the French equivalent of di- 
chotomy.^ 


3.1. Langue — parole 


The distinction between langue (language as a system) and parole (language in 
use) is the bestknown and also the least original of the four dichotomies. One does 
not need to be a linguist to distinguish between “He speaks English” (“He is able to 
understand and use the English language" = langue) and “He is speaking English” 
(“He is speaking in English"). The French terms do not correspond exactly to the 
everyday meanings of the corresponding French lexemes, parole, in particular is 
often misunderstood as referring to "spoken language". In the way it is introduced 
by the editors of the CLG, the distinction involves some serious conceptual prob- 
lems. 


En séparant la langue de la parole on sépare du méme coup: 1° ce qui est 
social de ce qui est individuel; 2° ce qui est essentiel de ce qui est accessoire 
et plus ou moins individuel. 


By distinguishing between the language itself and speech we distinguish at 
the same time: (1) what is social from what is individual; (2) what is essential 
from what is ancillary and more or less accidental. (CLG: 30) 


The matter is more complicated than the editors assume. In fact, it is a classical 
case of “cross-classification”: 
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essential accidental 


social langue? 


individual langue, 


Figure 2: Conceptual problems of the langue-parole dichotomy 


We cannot distinguish between the social and the individual and between the es- 
sential and the accidental "at the same time". The two distinctions do not apply to 
the same domains. Alongside la langue “social and essential” and la parole “indi- 
vidual and accidental" (the framed terms in the diagram), there is also a place for 
what is social and accidental (= langue, — grosso modo Eugenio Coseriu's “norm”; 
Coseriu 1967 [1952]) and for what is essential and individual (= langue: the indi- 
vidual speaker's internalized knowledge of the language system, i.e. roughly 
Chomsky's competence"). (For more details see Albrecht 2007: 27-36.) 

One of the toughest problems related to the Jangue-parole dichotomy is that of 
the status of syntax. The Cours says that the sentence belongs to /a parole and not 
to la langue: “La phrase est le type par excellence du syntagme. Mais elle appar- 
tient à la parole, non à la langue" (CLG: 172). This dogmatic affirmation — surely 
made by the editors — has been interpreted in quite different ways. René Amacker 
(2001: 1745), one of the guardians of the "pure doctrine" of the Geneva School, as- 
serts that this has to be understood literally, and the French linguist Frangois Ras- 
tier (2006: 7) holds the view that even the “lexies” (for instance, machine à écrire 
"typewriter") do not belong to the langue. By contrast, another Swiss Saussure 
specialist, Peter Wunderli (1981: 75—92), holds that this notorious affirmation in 
the Cours refers only to the sentence as a concrete utterance (token), not to the syn- 
tactic pattern “behind” the actual utterance (type). This seems to be a plausible in- 
terpretation. Neither the “apocryphal” nor the “real” Saussure appears to have pro- 
vided a place for the text (or discourse) at a level between langue and parole, 
governed by traditional rules that are less coercive than the rules of a language, for 
instance the elocutionis genera of classical rhetoric. 


3:2. Synchronie — diachronie 


Today, the distinction between two dimensions of linguistic investigation, the de- 
scription of the "state" of a language at a given moment, on the one hand, and the 
study of a language from the point of view of its historical development, on the 
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other hand, appears so natural as to render any explanation superfluous. Saussure, 
however, did not regard this distinction as self-evident. He attached great signifi- 
cance to it for two reasons. (1) In his day, the diachronic approach was the prevail- 
ing paradigm in the field of linguistic investigation. The description of the state of 
a given language was generally considered to be of merely practical interest. (2) 
The emphasis he placed on the priority of the synchronic over the diachronic was 
closely related to his conception of the language as a system of coexisting units and 
relations, as "une série de differences de sons combinées avec une série de differ- 
ences d'idées" [a series of phonetic differences matched with a series of conceptual 
differences] (CLG: 166). Accordingly, he introduced the somewhat artificial 
terms synchronique ‘together at the same time’ and diachronique ‘through time’ 
(the former figuring in Littré’s famous dictionary of the French language) in order 
to stress the incompatibility of the two approaches: “L’ opposition entre les deux 
points de vue — synchronique et diachronique — est absolue et ne souffre pas de 
compromis” [The difference between the two points of view — synchronic and 
diachronic — is absolute and admits no compromise] (CLG: 119). Roman Jakob- 
son’s “diachronic phonology” and Eugenio Coseriu’s “diachronic structural seman- 
tics” have shown that Saussure was wrong in this point. A “systemic” approach to 
problems of linguistic change is perfectly feasible (see Jakobson 1975 [1931]; 
Coseriu 1978 [1964]; Albrecht 2007: 164—175). 

At the end of this section two popular misunderstandings must be rejected: 
(1) Synchronic investigation is not necessarily limited to the modern stage of a lan- 
guage; a synchronic description of Old English is entirely possible; (2) It was not 
Saussure’s intention to “abolish” historical linguistics. Some important chapters in 
the Cours deal precisely with this subject. 


3.3. Signifiant — signifié [signifier — signified] 


At first sight, Saussure’s distinction between “signifier” and “signified” seems to 
be more or less trivial. The French terms are adaptations of the classical medieval 
expressions significans and significatum, probably derived by Augustine from the 
Stoic terms semainon and semainömenon. (Milner 2002: 41—42, note 13); Albrecht 
2007: 45) This might lead us to assume that we are dealing here with a mere refor- 
mulation of the medieval explanation of the sign: aliquid stat pro aliquo, which 
means that a sign (in the sense of "signifier") "stands for" or "represents" an object 
referred to, a "real" thing or fact. In reality, the relationship between Saussure's 
signifiant and signifié cannot be considered one of "representation" but rather of 
"mutual implication". In addition, this relation between the two components of the 
linguistic sign is "arbitrary", i. e. “conventional”. There is no "natural" reason why, 
for example, the “sound image" (image acoustique) l'hevn/ should be associated 
with the concept of “heaven” and not with some other concept.> The relationship 
between signifier and signified constitutes the linguistic sign, while the rapports 
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between the different signs in a language system determine the “value” (valeur) 
each of them has. As both, signifier and signified, are units at the level of /a langue, 
they are psychical in nature. The signifiant is not simply a series of sounds, but a 
series of sound conceptions, a “sound image" [image acoustique]; and the signifié 
is not a thing or a fact that can be found in the "real world" but a concept (in the 
logical sense of the term, which does not coincide with “percepts” or internal rep- 
resentations, for instance the “ideas” of John Locke). But there is more to this di- 
chotomy. While in traditional semantics the vocabulary of a language was con- 
sidered a kind of nomenclature designed to give names to the well-ordered 
phenomena of the “world”, for Saussure its function is quite different. In his 
opinion, the network of linguistic signs — i.e. those inseparable unities of “sound 
images" and “concepts” — does not “name” previously identified phenomena in 
extra-linguistic reality, but imposes a "structure" on the malleable material of 
sound, on the one hand, and the "swirling cloud" [nébuleuse], the nebulous world 
of thought, on the other hand. 


The characteristic role of a language in relation to thought is not to supply the material 
phonetic means by which ideas may be expressed. It is to act as an intermediary between 
thought and sound, in such a way that the combination of both necessarily produces a 
mutually complementary delimitation of units. (CLG: 156) 


So two phenomena like ‘heaven’ and ‘sky’ are not “named” but in a certain sense 
“created” by the English language. 

This leads us to the conclusion that Saussure’s concept of the sign is not “dyadic” 
as has been often maintained, but triadic like in most of the traditional semiotic 
models: e.g. that of the Stoics, the Christian idea of the trinity, the triad vox — con- 
ceptus — res in scholastic philosophy, or the so-called “semiotic triangle” of Ogden 
and Richards. But unlike the traditional semiotic models, Saussure almost com- 
pletely neglects the third component, the referent or “thing meant", which remains 
more or less virtual in the Cours and in the rest of his writings. He was primarily in- 
terested in language as a "form", not in its relation to extra-linguistic reality. 


3.4. Syntagmatique — associatif (paradigmatique) 


The distinction between the syntagmatic and the “associative” dimension of lan- 
guage corresponds roughly to the difference between the best-known tropes of 
ancient rhetoric: metonymy and metaphor. Syntagmatic means “the axis of poss- 
ible combinations (in praesentia)", associative (paradigmatic) “the axis of poss- 
ible substitutions (in absentia)". In his contribution to the 4" International Con- 
gress of Linguists, Louis Hjelmslev proposed replacing the term associatif, as it 
involved too many psychological connotations, by paradigmatique. His proposal 
was successful, but the paradigmatic axis of language can be interpreted in two dif- 
ferent ways (contrast (a) and (b) in Figure 3): 
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the axis of possible combinations (in praesentia) 


— — — 


pretty 
(a) the fair-haired girl 
the axis whimsical 
of 
possible 
substitutions 


fair-haired 


(b) the auburn girl 
black-haired 


(in absentia) 


(a) The choice options at a given point in the syntagma are not mutually exclusive: 
“the pretty, fair-haired and whimsical girl > “distribution class” 
(American Structuralism) 

(b) The choice options are mutually exclusive: “* the fair-haired, auburn and 
black-haired girl” > “paradigm” (European structuralism; see section 5 below) 


Figure 3: The syntagmatic and paradigmatic axes in language 


The literature on this dichotomy is anything but extensive, for the simple reason 
that apart from the corresponding remarks in the Cours (pp. 170—175) next to no- 
thing has been found in Saussure's unpublished notes. Nevertheless, this dicho- 
tomy had a very important influence on Hjelmslev (system vs. process [forlgb]) 
and the London School, especially on Halliday (system vs. structure, see also sec- 
tion 4 below). 


4. The most important schools of European Structuralism 
(and their methods) 


Linguistic structuralism has never manifested itself as a unified doctrine but 
rather in the form of “schools” differing considerably from each other in their the- 
oretical orientation. The most important centres were located in Czechoslovakia, 
Denmark and Switzerland (more precisely Geneva, the home town of Ferdinand 
de Saussure). Less important “schools” (lato sensu), which only can be mentioned 
briefly in this article, were to be found in the former Soviet Union and in Great 
Britain. In the historiography of linguistics, Germany and Italy are generally 
considered "countries without structuralism". A recent investigation has shown, 
however, that, even during the Nazi period, Germany was less hostile to struc- 
turalist thought than has been often maintained. The review Acta Linguistica 
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(Hafniensia), one of the official publications of the Copenhagen School, was 
supported by the German administration during the occupation of Denmark (see 
Ehlers 2005) 


4.1. The Prague School 


The Prague School (Cercle linguistique de Prague) emerged in the second half 
of the 1920s (cf. also chapter 48 by Sornicola, this volume). Its Czechoslovakian 
members, for instance Vilém Mathesius (1882-1945), Bohumil Trnka (1895- 
1985), Josef Vachek and Bohuslav Havránek (1893-1985), are less well-known — 
at least today — than two important external members of Russian origin: Prince 
Nikolaj Sergeevié Trubeckoj (Trubetzkoy) (1890-1938) and Roman Jakobson 
(1896-1982), who concluded his career at the Massachusetts Institute of Technol- 
ogy as a colleague of Noam Chomsky. There is no doubt that some members of 
the circle were deeply influenced by the Cours. Sergej (Serge) Karcevskij 
(1884-1955), a disciple of Saussure's and a member of the Prague School, who 
later returned to Switzerland, was something like a "living link" between Prague 
and Geneva. However, at least with regard to two arguments, the members of the 
Prague circle adopted a critical attitude towards Saussure. First of all, they did not 
share Saussure's concept of language as “pure form". For them, language was 
"formed substance" — in the best Aristotelian tradition. Secondly, Roman Jakobson 
and other members of the Prague group rejected the somewhat dogmatic assertion 
in the Cours that the contrast between the synchronic and the diachronic point of 
view admits no compromise. 

Although the term structuralism first appears in the so-called manifesto of the 
Prague School and in the early writings of Roman Jakobson, most of its members 
preferred the epithet functionalist to structuralist. Unlike other schools notable 
which were known primarily for their highly theoretical speculations, the Prague 
circle did a lot of fruitful practical work in language analysis and description. This 
is why today — decades after the “death of structuralism" — most of the important 
theoretical concepts of the School are still common knowledge among students of 
linguistics. This is true for at least the first part of Trubetzkoy's Principles of Pho- 
nology (1939), the definitions of the phoneme as “the minimal unit in the sound 
system of a language”, or of the morpheme as “the minimal unit of a language that 
is recurrent and meaningful". Accordingly, it seems preferable to refer to a less 
familiar example of the linguistic activities of the Prague circle, instead of repeat- 
ing well-known concepts that can be found in every introduction to linguistics. 

In his Principles of Historical Phonology (1975 [1931]) Jakobson turns his 
attention not to isolated sound changes without consequence for the phonemic sys- 
tem of the language, but to those cases where a sound change induces a change 
in the whole sound system. The most important examples of this kind are the fol- 
lowing: 
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(a) The loss of a phonemic distinction by the fusion of two formerly distinct 
phonemes: Middle High German /z/ « Germanic /t/ vs. Middle High German 
Isl: gewizzen ‘knowledge, savoir vivre’ vs. gewisse 'sure(ly)' — Present-day 
German Gewissen ‘conscience’ vs. gewiss ‘sure(ly)’. 


(b) The emergence of two phonemes from the combinatory variants of one pho- 
neme: In Classical Latin, the sound difference between the initial consonant 
of campus /k/ and civitas /k’/, later /č/ was only combinatory, not distinctive. 
The difference between the corresponding Italian words campo ‘field’ and 
città “town, city’ has become distinctive because of the emergence of a new 
/k/ from the Latin /kw/: chi ‘who’ vs. ci ‘there’. 


(c) The expression of a phonemic distinction by different phonetic means: The 
difference between /&/ and /&/ in Classical Latin is maintained in Italian, 
but instead of the quantity of the vowel it is now its quality ([e] vs. [e]) that 
makes the difference: legge ‘he reads’ and legge ‘law’. 


The German occupation of Bohemia (the Czech part of Czechoslovakia) rudely 
interrupted the work of the Prague circle. The so-called New Prague School that 
emerged in the 1950s was primarily concerned with stylistics, sociolinguistics and 
discourse pragmatics (figuring at the time under the label functional sentence per- 
spective). Not only for political reasons it moved further and further from the struc- 
turalist paradigm. 


4.2. The Geneva School 


From a formal point of view, the Geneva school is closest to the Saussurean tradi- 
tion. Charles Bally (1865-1947) was Saussure’s immediate successor to the chair 
of General Linguistics at Geneva, followed by Albert Sechehaye (1870-1946), the 
second editor of the Cours to hold the chair. Bally's works on stylistics and com- 
parative linguistics (Bally 1909; 1965 [1932]) had a notable influence on the de- 
velopment of linguistically informed translation studies. Henri Frei (1899-1980) 
caused quite a stir with his Grammaire des fautes [Grammar of mistakes] (1929), 
an attempt to "predict" the future development of the French language on the basis 
of current errors made by uneducated speakers. 

Outside Geneva the school did not meet with a wide response in the world of 
linguistics. Some introductions to the history of linguistics make no mention of it 
at all. But after the publication of Robert Godel's (1902-1984) Les sources manu- 
scrites du Cours de linguistique générale de F. de Saussure (1957) (the original 
students' notes on which the Cours is based), the Geneva School changed its orien- 
tation. The main focus of its activities shifted more and more from general lin- 
guistics to an attempt for preserving the “pure” Saussurean doctrine and promoting 
our knowledge of the “real” thoughts of the founding father of structuralism. The 
years from 1967 to 1974 saw the appearance Rudolph Englers so-called "critical 
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edition" of the Cours (Saussure and Engler 1967 ff.), a synoptical edition of the stu- 
dents' notes containing even more texts than Godel's Sources manuscrites. In 
the course of the last decades of the 20th century many critical investigations and 
editions of recently found notes were published by members of the school (among 
many others Bouquet 1997; Bouquet and Engler 2002; for more details see AI- 
brecht 2007: 65-70.) 


4.3. The Copenhagen School 


The Cercle linguistique de Copenhagen (CLC) was founded in 1931; its name was 
derived from the Cercle linguistique de Prague. However, unlike its "sister organ- 
ization” (Rischel 2001: 1790) the CLC insisted on the epithet "structuralist" and 
considered itself the legitimate heir of Saussure's linguistic theory (for more de- 
tails see Siertsema 1955: ch. 1). The Copenhagen school is often identified with 
glossematics, a highly specific theoretical approach conceived by Louis Hjelmslev 
(1899-1965) and his close associate Hans-Jørgen Uldall (1907-1957). But in fact, 
the Copenhagen circle was designed as a forum "for a wide variety of linguistic 
and philological topics [...] many of which had nothing to do with glossematics 
as such" (Rischel 2001: 1791). One of its most important members, who disagreed 
at least partly with Hjelmslev's approach, was Viggo Brondal (1887-1957). Other 
important members were Paul Diderichsen, Eli-Fischer-Jórgensen and Knud To- 
geby. The "school internal" contributions of the members were published in the 
series Travaux du Cercle linguistique de Prague and in the journal Acta Linguis- 
tica (Hafniensia) referred to above. 

Though the Copenhagen school cannot be reduced to the theories of Hjelmslev 
and Uldall, glossematics (the name is derived from Greek glossa ‘language’ ) is in- 
deed the approach that the world tends to associate with it. Hjelmslev and Uldall, 
the creators of this theory, never managed to elaborate a comprehensive presenta- 
tion of their ideas, because the two scholars “had been geographically separated for 
many years and had drifted apart in their views" (Rischel 2001: 1792). Our main 
source of information today is Hjelmslev's introductory sketch Omkring Sprogteo- 
riens Grundleggelse (1943). The English translation by Francis J. Whitfield Prole- 
gomena to a Theory of Language (2nd edition 1961) initiated a world-wide recep- 
tion of glossematic theory, though not without some distortions, for Whitfield's 
translation, though authorized by Hjelmslev himself, has many of the character- 
istics of a “belle infidéle". For reasons of space we are obliged to limit ourselves 
to a short schematic illustration of two crucial points in glossematics: the theory of 
functions and the twofold dichotomies content vs. expression and form vs. sub- 
stance. 

Hjelmslev uses the term function exclusively in the sense of f, above (sec- 
tion 2), i.e function as a relation between two or more elements within a system, 
and in a slightly different way as it is used in mathematics. Instead of the usual for- 
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mula y = f (x) “y is a function of x”, for him the function is a relation between the 
two "functives" y and x. The leading Danish figure in theoretical linguistics distin- 
guishes between three general types of relations, more precisely "dependencies 
and combinations" [afhaengigheder eller forbindelselinier |: unilateral depend- 
ency = determination; mutual dependency = interdependence; free combination 
without dependency = constellation. The cover term for determination and inter- 
dependence is cohesion, that for interdependence and constellation reciprocity (see 
Figure 4 below). Every relation that fulfils the condition for an analysis is called a 
function. The three generic functions are given different names, depending on 
whether they refer to the syntagmatic dimension (= process [forløb]; et-relations in 
praesentia) or the paradigmatic dimension (2 system; aut-functions in absentia) 
(see above, section 3.4). In the first case they are called selection, solidarity and 
combination, in the second their names are specification, complementarity and au- 
tonomy. (Only the first type is illustrated by examples in the Prolegomena). The 
following diagram presents a schematic survey of the different kinds of functions: 


syntagmatic: paradigmatic: 
relation correlation 
(connexion) (equivalence) 
function: et (‘and’)-function: aut (‘or’)-function: 
determination selection specification 
cohesion solidarity complementarity 
interdependence 
reciprocity Er 
constellation combination autonomy 


Figure 4: Hjelmslev system of relations in language (adapted from Hjelmslev 1961: 41) 


In ordinary language the dichotomies expression vs. content and form vs. sub- 
stance are used more or less synonymously. In the Prolegomena, however, the use 
of these terms is entirely different and has given rise to misunderstandings and 
confusions. The relation that holds between expression and content is that of soli- 
darity, the et-form of interdependence as can be seen in Figure 4. This use does 
not match very well with the usual meanings of the two words. An expression is 
normally taken to express a content, Le at least intuitively one has the idea of a 
unidirectional relation. For Hjelmslev, however, there is no difference, generally 
speaking, between "expressing content" and "containing expression". A relation of 
mutual implication holds between the two planes, exactly like between Saussure's 
signifiant and signifié. Only by taking into consideration the dichotomy form vs. 
substance, subordinated to that of expression vs. content, does the latter distinction 
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undergo a specification. Expression-form and content-form both purely refer to the 
intrinsic (language-internal) aspects of language, whereas expression-substance 
and content-substance relate to the extrinsic aspects of language. The following 
diagram takes into account Hjelmslev's theory of stratification, published more 
than ten years after the Prolegomena (Hjelmslev 1954). 


expression content 
ius ` 6 Kr <> = solidarity 
substan orm rm —> substance à 
id 9 — = selection 
f kenematics plerematics : 
phonetics semantics 


(from Greek (from Greek 


(ancillary : : _ - «nap. | (ancillary 
Science) kenos ‘empty’) | pleres ‘full’) TIAN 
TE, 
glossematics 


Figure 5: Hjelmslev's "stratification of language" (adapted from Apresjan 1972 [1966]: 62) 


Figure 5 indicates that the relation that holds between form and substance is 
unidirectional. The form "selects" its (optional) substance. Since according to 
Hjelmslev (just like for Saussure) language is “pure form", only the central planes, 
those of “expression-form” and “content-form” are considered to be the subject 
matter of glossematics. “Expression-substance” (an amorphous matter of “sounds 
that can be articulated") and “content-substance” (an amorphous matter of “phe- 
nomena that may be conceived in form of ideas") can only claim a minor status in 
the study of language. What counts is the pure structure, the grid of distinctions or 
more precisely “oppositions” projected onto the two domains. 

The most important methodological operation in glossematic analysis is the so- 
called commutation test (corresponding roughly to the "substitution" of the Prague 
Circle). Any change on the expression plane is linguistically significant only when 
entailing a change on the content plane, and vice versa. This is a completely “sound” 
principle. Unfortunately it is not respected by the representatives of many branches 
of present-day linguistics. (For more details see Siertsema 1955; Rischel 2001; AI- 
brecht 2007: 70-76; 138-147, and Fischer Jørgensen 1979 [1966], the first critical 
account from an “insider”). 


4.4. The Russian School 


If it were purely for geographical and political reasons, speaking of a “Russian” 
school would not be appropriate. Not all linguists working in the tsarist empire or 
in the Soviet Union were Russians, and "schools" with a precise program did not 
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exist. But since the early beginnings of European Structuralism go back to the tsar- 
ist era, several important scholars who had some influence on the development of 
the structuralist paradigm should at least be mentioned here: Filipp Fedorovié For- 
tunatov (1844-1914), founder and head of the Moscow Linguistic School; Evgenij 
Dimitrievié Polivanov (1891-1938), a student of Baudouin de Courtenay (see sec- 
tion 1 above), who paid for his sharp criticism of N. I Marr’s “new theory of lan- 
guage” (see below) with his own life, and Lev Vladimirovič Ščerba (1880-1944), 
the founder of the St. Petersburg Linguistic School and one of the pioneers of pho- 
nology. In the 1920s and 1930s, the absolute sway rule of Nikolaj Jakovlevič 
Marr's pseudo-Marxist language theory, which survived its founder, made any 
serious research in linguistics extremely difficult, if not impossible. Only after the 
intervention of Jossif Wissarionovié Džugašvili alias Stalin in some Pravda ar- 
ticles in 1950 was it possible to overcome the isolation of Soviet linguistics from 
the rest of the world. Leading Russian linguists like Sebastian K. Saumjan, Juri D. 
Apresjan and Igor Mel čuk took part in the international linguistic discussions. But 
in spite of Stalin's interventions, which denied the class character of language and 
assigned to it a place between Marx's "basis" and "superstructure", the general 
ideological climate remained hostile towards any kind of “formalism”. As a con- 
sequence, many Russian linguists emigrated to western countries. 


4.5. The London School 


The expression "London School" is commonly used, but on closer inspection it is 
as inadequate as the label *Russian School" in the previous section. The founder of 
the school, John Rupert Firth (1890—1960) did indeed work in London. From 1944 
to 1956 he held the first chair of general linguistics in Great Britain. His followers, 
however, especially the so-called “Neo-Firthians”, were scattered all over Great 
Britain and some other English speaking countries. Only some of the most import- 
ant representatives can be mentioned here: David Abercrombie, a phonetician; 
John C. Catford, who developed a highly original theory of translation closely re- 
lated to Malinowski's and Firth's theory of language (see below); John Lyons, who 
held chairs of general linguistics at the Universities of Edinburgh, Sussex and 
Cambridge (a rather critical disciple of his master); Robert H. Robins, a specialist 
in the historiography of linguistics; Frank Palmer, a linguist whose main interest 
was devoted to the theory of grammar; and, last but not least Michael Alexander 
Kirkwood Halliday (see below). 

In his earlier years Firth was in close contact with Hjelmslev, but it was un- 
doubtedly the anthropologist Bronislaw Malinowski who had the greatest in- 
fluence on his linguistic thought, at the centre of which stood the concept of “con- 
text in situation". Firth refused to grant language too much autonomy, i.e. when 
isolated from its social and cultural context. Rather he “approached language as a 
complex social phenomenon rather than as a mental code" (Butt 2001: 1813). In a 
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stimulating article entitled “Atlantic linguistics" (19582) Firth characterized his 
approach as a kind of "link" between European and American Structuralism. In 
fact, he shared with Bloomfield and his followers the conviction that, as an entity 
distinct from the concrete linguistic activities of the speaker-hearer, Saussure's 
langue was a metaphysical concept like Descartes’ “mind” and its opposition 
"body". On the other hand, Firth disagreed with some methodological principles of 
the Bloomfield School, for instance the segregation of descriptions of different lan- 
guage strata like phonology, morphology, syntax and so on. Far from strictly ex- 
cluding meaning as far as possible from linguistic investigation, he considered 
meaning as the central linguistic category. On closer inspection, however, the dif- 
ference between the two approaches is perhaps less pronounced than is generally 
maintained. The concept of meaning that Bloomfield refused to accept as source of 
information was mentalistic, while Firth’s concept was not mentalistic, at all. Like 
Malinowski he regarded language as a “mode of action, not as a counter-sign to 
thought” (Butt 2001: 1810). Firth did not attach much importance to lexical mean- 
ing as an isolated phenomenon. Not only the “context of situation”, but also the 
"co-text", i.e. the purely linguistic part of the context, was the decisive factor for 
him. Today, his conception of “meaning by collocation’, in which “collocation” is 
defined as “mutual expectancies of words and sentences” (Firth 1958b: 195) be- 
longs to the standard textbook knowledge of any student of linguistics, albeit 
usually without the knowledge of the theoretical background. One of Firth's most 
innovative fields of work was prosodic phonology (suprasegmental in American 
terminology). For him prosody was a linguistic, not a “paralinguistic” phenom- 
enon. His research in this domain convinced him of the fact that there was a strong 
connection between phonology and grammar and that this should be respected by 
linguistic analysis. (For more details see Langendoen 1968 and Monaghan 1979.) 

Firth never published an extensive account of his linguistic theory. He pre- 
ferred to make stimulating proposals in form of short articles, leaving it to his dis- 
ciples to take them up and to develop them. Probably the most original and system- 
atic mind among his followers is M.A.K. Halliday. His “systemic functional 
grammar” shows some striking analogies with glossematics (structure for the syn- 
tagmatic, system for the paradigmatic axis of language). Given its strong ties with 
the concept of “context of situation” it belongs rather to the functionalist than to 
the structuralist paradigm. 


4.6. Scholars between the schools 


There is a large number of scholars who made some contribution to the paradigm 
of European Structuralism, but who cannot be assigned to any specific school. We 
can briefly mention only a few of them. 

The first of them is Antoine Meillet (1866-1936). Though a student of Saus- 
sure’s, he has never been suspected of being a structuralist. However, he is the 
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author of a sentence, which has always been attributed to Saussure: “La langue est 
un systéme oü tout se tient" [Language involves a system where everything hangs 
together.] 

Roman Jakobson has been presented here as an external member of the Prague 
School, but it should also be mentioned that after escaping from the German occu- 
pation, he spent some years in Scandinavia. There he had intensive contacts with 
Hjelmslev and other members of the Copenhagen Circle. Later he went to the 
United States, where he contributed to the development of generative phonology. 

A leading figure in both the structuralist and the functionalist paradigm de- 
serves particular mention: the French linguist André Martinet (1908-1999). He 
maintained close contacts with members of the Prague and the Copenhagen 
Circles. After the Second World War he held the chair of general linguistics at the 
Columbia University in New York for seven years, and it was there where he ac- 
quired an intimate knowledge of American Structuralism. His Eléments de linguis- 
tique générale [Elements of General Linguistics] (1960) can still be recommended 
as an excellent introduction to the theory of language. Apart from some technical 
details it is not at all “outdated”. Everybody knows (at least intuitively) that lan- 
guage can be broken down into smaller elements on two levels, first into meaning- 
ful units called morphemes (or monémes in Martinet's terminology), and second 
into distinctive elements called phonemes. But only few are able to formulate this 
insight as clearly and concisely as Martinet did in the short section of his book en- 
titled La double articulation du langage [The Double Articulation of Language] 
(1960: ch. 1.8). His strong rejection of Bloomfield's methodological principle to 
largely exclude meaning from linguistic analysis is reminiscent of Chomsky's fa- 
mous review of Skinner's Verbal Behavior (Martinet 1960: ch. 2.7). 

In Spain, a group of scholars (not a "school" stricto sensu) with traditional 
philological training, Emilio Alarcos Llorach, M. Sánchez Ruipérez, F. Rodriguez 
Adrados, L. Michelena, F. Lázaro Carreter and Diego Catalan made remarkable ef- 
forts at preserving the heritage of the Prague and the Copenhagen Schools. They 
applied the methods of these schools to the analysis and description of the Iberian 
and the classical languages (Latin, Classical Greek). 

In concluding this section, one scholar deserves mention who has frequently 
been considered a structuralist, although his ideas went far beyond the structuralist 
paradigm. Eugenio Coseriu (1921—2002) devoted a good deal of his scholarly work 
to the task of transferring the methods developed in the domain of phonology to the 
lexicon of a language. "Structural Semantics" was established by Coseriu and some 
of his colleagues, for instance Algirdas Greimas and Bernard Pottier. This approach 
is presently considered an “outdated paradigm", one that has been displaced by the 
cognitivist approach to lexical semantics. This is unjustified. “Structuralist” and 
"cognitivist" approaches are more complementary than antagonistic. It is only that 
they are concerned with different kinds of phenomena; prelogical “percepts” or “in- 
ternal representations", on the one hand, and logical “concepts”, on the other hand. 
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5. European versus American Structuralism 


In many introductions to linguistics or the history of linguistics, especially in those 
written in English or German, the term structuralism is used as if it were a gen- 
erally recognized paradigm in linguistics. But on closer inspection it becomes clear 
that it refers exclusively to American Structuralism. This has led to many and di- 
verse misunderstandings. Since the present article deals with European, not with 
American, Structuralism, it seems useful to point to some fundamental differences 
between these two research paradigms. All of them can be interpreted as corol- 
laries of the different status of meaning in the two approaches. Bloomfield was 
deeply influenced by the radical empiricist psychology of his time, therefore he 
could not admit of introspection as a valid source of information. Meaning could 
only be accepted as data for investigation once it had been "translated" into observ- 
able behavior. At the phonemic level, behavior means “distribution”, i.e. the lin- 
guistic contexts or environments in which a unit can occur. A phoneme is not con- 
ceived as a bundle of distinctive features as in Trubetzkoy's phonology but as a 
unit with a characteristic distribution. Trubetzkoy's “neutralization”, i.e. the loss of 
the distinction between two phonemes in a particular environment (for instance 
German voiceless and voiced plosives in final position, Rad ‘wheel’ and Rat "ad. 
vice, council’, both pronounced [ra:t]) was rejected as a typically “mentalistic” cat- 
egory by the Bloomfieldians, who preferred to speak of “defective distribution” in 
such cases. Chomsky’s polemic against the predominantly “taxonomic” approach 
of structuralist linguists using so-called discovery procedures, i.e. techniques that 
can be mechanically applied to a sample of language, was directed against Ameri- 
can Structuralism; European Structuralism was not the target of Chomsky’s criti- 
cism. It goes without saying that Bloomfieldians could not accept Saussure’s 
langue. When conceived as an autonomous system where everything hangs to- 
gether, it qualified as a “metaphysical” concept; when understood as an internal- 
ized system of rules observed by an “ideal speaker/hearer", it was a mentalistic 
concept. Distributionalists worked exclusively with “extendable corpora", i.e. cor- 
pora that could be extended until the point at which no new phenomenon appeared. 
The present upswing of corpus linguistics continues the tradition of American 
Structuralism. (See also section 3.4 above for the difference between paradigm and 
distribution class. For more details see Albrecht 2007: ch. 5) 


6. Concluding remarks 


Since the 1970s the “death of structuralism" has been repeatedly proclaimed. But 
this has nothing to do with linguistic structuralism. The reference is to the fashion- 
able structuralism of the Parisian intellectual elite, a current of thought that has 
only very loose relations with linguistics. In the monumental Histoire du structu- 
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ralisme [History of Structuralism] by Frangois Dosse (two volumes and more than 
1000 pages] linguistics plays only a minor role (Dosse 1991-1992). 

No one can deny that in linguistics, too, structuralism has been marginalized by 
more recent trends. There are several characteristics of European Structuralism in 
particular, which have no place in the theoretical frameworks of present-day lin- 
guistics and deviate from the practical goals of mainstream linguistics. Structuralism 


e considers the language system to be an autonomous object of investigation, 
separated as far as possible from the psychological, social and practical condi- 
tions of language use; 

e is interested only in the analysis and description of the language system, not in 
algorithms “modeling” the production of sentences and texts; 

€ takes meaning into account only as a criterion for communication, i.e. as a fac- 
tor related to the inter-subjective dimension of language, while neglecting the 
objective dimension (truth and reference, on the one hand, cognition or “inter- 
nal representation", on the other hand). 


However, as an approach focussing chiefly on recurrent regular patterns of language, 
structuralism in the broadest sense of the term will never be completely "outdated", 
because it corresponds to the methods employed by every human being thinking 
about their language with spontaneous curiosity. Structuralism today may be perhaps 
intellectually less attractive now than it was some decades ago, but its methods will 
survive, at least in practical disciplines like lexicography and the writing of grammars. 

But in conclusion we should point to an argument of even greater importance. 
European Structuralism emphasizes more than any other approach - even than the 
so-called “traditional grammar" — the specific characteristics of each individual 
language and the differences holding between different languages. Anyone setting 
out to describe a specific language completely unknown to his readers cannot help 
but go about it in structuralist terms. 


Notes 


1 For the arguments put forward against any kind of “holism” by philosophers like Moritz 
Schlick or Karl Popper, see Albrecht (2007: 221—222). 

2 This is true of the “apocryphal” Saussure of the CLG, not for the "real" Saussure: In the 
Ecrits de linguistique générale (= Bouquet and Engler 2002) the term occurs at least 
twice in highly significant contexts (pp. 25; 114). 

3 The Oxford English Dictionary, second edition, Oxford 1989, tome XVI, 958-959. The 
numbers are those of the different definitions in the dictionary. For more technical details, 
especially with regard to “structure” in mathematics, see Albrecht (2007: 216-217). 

4 Bouquet and Engler (2002: 17-19; 298-299); John Lyons (1973: 11-15) gives a quite dif- 
ferent list of the dichotomies, which can neither be accepted nor discussed here. 

5 For the history of the notion “T arbitraire du signe" see Coseriu (19672). 
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1. Functionalism in twentieth-century Europe 

2. Problems in defining “European functionalism” 

3. Aspects of functionalism 

3.1. Syntax 

3.2. Phonology 

3.3. Representations of synchrony and diachrony 

3.4. Typology vs. characterology 

4. Perspectives of current functionalist research 

1. Functionalism in twentieth-century Europe 


The traditions of linguistic functionalism are manifold. In the course of the twen- 
tieth century they branched off in various directions from the mainstream of lin- 
guistic disciplines that had developed in Europe in the years spanning the end of 
the nineteenth and beginning of the twentieth century. Sketching their broad out- 
lines is no easy task, as they evolved in different countries and at different times, 
often with distinctive features that reflected the more general ideological and cul- 
tural characteristics of the contexts in which they arose. The traditions themselves, 
moreover, have not always shown a high level of awareness of common intellec- 
tual bases. This has been the case especially in recent decades, in which functional- 
ism has become a general theoretico-methodological approach definable as “anti- 
reductionist”, often lacking the teleological and metaphysical implications of some 
of the initial theories. This approach has inevitably been interpreted in many dif- 
ferent ways by individual scholars. 

The history of functionalism reached a crucial juncture in the third decade 
of the twentieth century with the founding, in 1926, of the Prague Linguistic 
Circle. It was then that the contributions of numerous individuals from different 
countries and scholarly backgrounds were welded into a unified programme 
of research, summarised in the Theses presented by members of the Circle at 
the First International Congress of Linguists in the Hague in 1928. This pro- 
gramme would continue to serve as a point of reference in the development of 
many later functionalist models in both Europe and North America. But the fif- 
teen years of the Circle’s activity that preceded the outbreak of the Second World 
War in many ways represented a unique period in general linguistics, brought 
about by the convergence of numerous cultural and historico-political develop- 
ments. 
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The starting-point for a discussion of these developments must be the state of 
linguistics in the first two decades of the twentieth century and its relationship to 
the more general intellectual and cultural climate of the time. New directions of re- 
search had, from the end of the nineteenth century, revealed anxieties and dissatis- 
factions created by the realisation that there was as yet no general theoretico-me- 
thodological underpinning for the vast body of philologico-linguistic knowledge 
that had accumulated over the preceding decades. An awareness of the need for 
a "refounding" of the linguistic sciences may be seen, with various inflections, in 
the work of a number of protagonists of early twentieth-century linguistics, the 
attempts associated with that awareness also reflecting changes in the general pan- 
orama of ideas. It should not be forgotten that this same period, from 1900 to 1920, 
saw the emergence of quantum mechanics and the theory of relativity, and that the 
concepts of relativism, pluralism and experimentalism became central to both phil- 
osophy and literature. 

Also to be taken into account are the extraordinary historico-political up- 
heavals witnessed by eastern Europe in this period, notably the Russian Revolution 
and the end of the Austro-Hungarian Empire. The emergence of the state of Cze- 
choslovakia from the ashes of the Empire and the immigration of groups of Rus- 
sian intellectuals in the wake of the Revolution combined to create in Prague a very 
particular atmosphere. The optimism and ethical commitment to the common good 
that characterised the élites of the young Central European republic fused with the 
Russian intelligentsia's Romantic view of research as perpetual experimentation 
and their fervent faith in collective action. 

The events that were to mark the history of functionalist ideas and traditions 
after the 1920s are no less remarkable. Once again, collective and individual des- 
tinies are seen to have been intertwined. The tragic death of Trubetzkoy, a leading 
member of the Prague Circle, in Vienna in 1938, the Nazi invasion of Czechoslova- 
kia in 1939, Jakobson's emigration to North America (an experience shared by many 
intellectuals of the *Old World"), the difficulties of survival and the isolation, both 
during and after the war, of the linguists of Prague, Brno and other universities of 
eastern Europe by comparison with their colleagues in the west — all are elements of 
the dramatic history of Europe in the twentieth century. After the war, linguistic re- 
search continued in a number of eastern European universities, with the functionalist 
tradition maintaining a central role. The isolation in which the exponents of func- 
tionalism worked in eastern Europe favoured the preservation and independent de- 
velopment of various ideas that had been fundamental to the first Prague period, es- 
pecially those deriving ultimately from Mathesius, definable as empirico-historicist 
and text-oriented. Albeit often from a distance, the scholars of the so-called Second 
Prague School — Danes, Firbas, Hajicová, Sgall, Svoboda, Vachek and others — con- 
tributed to the development, consolidation and circulation in western Europe of the 
functionalist ideas of the early twentieth century. Prague and Brno thus continued to 
serve as important centres of research, even after the Second World War. 
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The end of the 1950s and above all the 1960s saw the emergence in western Eu- 
rope of a broad network of research centres that developed the conceptual heritage 
of Prague in different ways and in many new directions. Its protagonists included 
linguists of a generation that had been directly associated with the activities of the 
Prague Circle, such as A. W. De Groot in the Netherlands and André Martinet and 
Lucien Tesniére in France, as well as younger scholars who had come into contact 
with those activities only indirectly, such as Simon Dik, again in the Netherlands, 
and Michael Halliday in Great Britain. The models devised differ from one another 
significantly, showing the influence of individual training and the cultural diversity 
of the countries in question. Interest in the formalisation of functionalist concepts 
becomes more marked from the 1960s (in the thinking of Halliday and above all 
of Dik, but also in that of Hajicová and Sgall), perhaps also under the influence of 
North American linguistics, which was becoming increasingly formalistic in orien- 
tation. 

In conclusion, then, the history of functionalism is the history of individuals, 
no less than of cultural and ideological contexts. It should also be remembered 
that two important lines of linguistic research of the second half of the twentieth 
century — textual linguistics, originating in the German cultural sphere, and later, 
from the 1980s, functional-typological linguistics, whose leading representative 
has been the British Slavist Bernard Comrie — developed models that were per- 
meated in various ways by the Prague functionalist principles. 


2. Problems in defining “European functionalism” 


A discussion of “European functionalism" must begin by tackling a number of ter- 
minological and conceptual questions. The first and most obvious of these relates 
to what might be meant by the term functionalism, given that notions of “function”, 
“functionality” and “functioning” have formed a part of theories and procedures 
of linguistic analysis that are not only different from one another, but sometimes, 
in certain historical phases, even explicitly contraposed. The contention of André 
Martinet (2006: 677) that the concept of “function” should be limited to its “most 
usual meaning of ‘adapted to achieve some end’” has the obvious advantage of 
drawing attention to a key idea, one that profoundly influenced the pioneering 
environment of the Prague Circle, with which the term functionalism has been pri- 
marily associated in many studies. If the definition is restricted in this way, how- 
ever, it becomes difficult to see the connections between the various senses of the 
term and their historical origins, which might in some ways be less diverse than is 
sometimes supposed. The complex evolution of functionalist ideas is not merely of 
historiographical interest, but has implications for the study of contemporary the- 
ories, methodologies and analytical practices and for a deeper understanding of the 
problems they pose. 
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A second question pertains to chronology. If the true manifestos of functional 
linguistics were formulated in the late 1920s, in the Theses of the Prague Circle and 
in the article “Funkční lingvistika” (“Functional linguistics") by Mathesius, the 
Circle's first president, the core elements of the ideas they presented had had a much 
longer gestation. Indeed, these take us back a number of decades, at least to the 
1870s and 1880s, when in various spheres of European linguistics a constellation of 
concepts opposed to organicism had taken shape, one that pointed to new epistemo- 


29 66 


logical horizons, with its ideas of “the centrality of the speaker", “the centrality of 
meaning", *the centrality of the observation of a present reality". At the other end of 
the chronological spectrum, as has already been noted, the use of the term func- 
tionalism becomes more generalised after the Second World War and above all in 
the last decades of the twentieth century, losing some of its initial characteristics. 

A third and final question concerns spatial delimitation. As various historians 
have observed, all conceptions of Europe are ideologically rather than geographi- 
cally determined, and therefore reflect the cultural and political orientations of a 
particular period and are subject to relative changes. The subject of “European 
functionalism" provides a particularly interesting example of this historiographical 
problem. Long-term historico-cultural factors certainly play a part in its definition, 
such as the notable differences between the various centres of the Slavic area, par- 
ticularly as regards their attitude towards western Burope. (Warsaw and Prague al- 
ways played an active part in a Central European cultural bloc, while St Petersburg 
and Moscow were capitals of a distant empire, whose destiny had always followed 
a different course from that of western European countries.) The picture has never- 
theless been complicated by a number of contingent historical factors that con- 
tributed to the convergence and divergence of ideas, examples including the flight 
of Russian intellectuals to Vienna, Prague and Paris after the October Revolution, 
and, as mentioned above, the isolation of the countries of eastern Europe after the 
Second World War and the emigration to North America of some of functionalism's 
leading exponents. The Europe of the linguists who continued to work, in the 
Prague tradition, on the other side of the Iron Curtain was a Europe of the imagin- 
ation, no less than that which Roman Jakobson took with him into the universities 
of the United States. After the Prague years, Jakobson's personal conception of 
functionalism, which had been profoundly influenced by the Moscow and St Pe- 
tersburg circles in which he had been trained, took a new turn, marked by his intro- 
duction to the formalising climate of North America and his exposure to the emerg- 
ing models of the mathematical theory of communication. But in the centres of 
Slavonic and general linguistic research at Harvard and Bloomington, to which 
Jakobson made an important contribution, some of the functionalist ideas of the 
Old World continued to exist under a different guise. 

The “inter-Slavic” character of the semiotic tradition has often been underlined 
(see Toman 2006: 104-133). But the Prague community was less unified than 
is usually supposed. The unanimity of the stance taken by the Prague Circle in its 
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early years (seen, in particular, in the collective character of the Theses) concealed, 
below the surface, a number of differences in orientation, interests and "style" 
among its members, in which it is possible to discern characteristic differences in 
cultural background and theoretico-methodological approach (see Toman 2006: 
7-69). Already evident in the founding phase of the Circle, when the work of the 
Prague scholars seems most intense and unified, such rifts would persist and in- 
deed deepen in the Prague Circle's long period of crisis and dispersal, which began 
with the Second World War. 


3. Aspects of functionalism 
3.1. Syntax 


3.1.1. The primary ideas of Prague syntax 


Syntax, and in particular the theory of the sentence, constitutes one of the oldest 
and most fruitful areas of study within the functionalist traditions. It is already in 
evidence in Prague before the formation of the Circle, in Mathesius's many works 
on subjects such as word order in English, non-verbal predication, nominal con- 
structions and passive constructions in modern English, attributive sentences, the 
pronominal Subject in colloquial Czech, and the function of the Subject in modern 
English. But it is above all in two important works of the 1920s that Mathesius 
(1924, 1929) reflects more broadly on the general theory of the sentence, tackling a 
number of problems that the linguistics of the last decades of the nineteenth cen- 
tury had bequeathed to the twentieth, such as the definition of syntax, the definition 
of the sentence, the nature of the relation between Subject and Predicate, and the 
different types of predication, thetic and categorical (Graffi 2001: 73-132), 
Mathesius's writings show a profound knowledge of the grammatical and syn- 
tactic research undertaken in various European centres in the years spanning the 
end of the nineteenth and the beginning of the twentieth century. Despite the many 
influences in evidence, traceable to Wegener, Sweet, Jespersen, Brentano and 
Marty (all duly acknowledged), it is possible to discern in these writings an orig- 
inal conception, in which a number of ideas fundamental to the most important 
functional theories of syntax of the latter part of the twentieth century are already 
seen to be taking shape (see below). The key points may be summarised as follows: 


i. The study of the sentence is central not only to syntactic theory but to linguistics 
as a whole. This viewpoint was still far from common in the 1920s, even if we 
should note the interesting similarity to positions taken by Meillet and Bloomfield 
(see Sornicola 1992: 37). It would be reaffirmed in various ways by the syntactic 
theories that developed from the 1950s onwards. 
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ii. The logical conception of the sentence is rejected and superseded. The sentence 
does not correspond to a logical judgement (Mathesius 1924: 4). This point of view 
was not new and it would be reaffirmed within a different conceptual framework by 
Hjelmslev. The way in which it is argued is nevertheless interesting, making refer- 
ence to the complex factors that determine the “fluency” of speakers. 


iii. The psychological conception of the sentence is also subjected to rigorous 
criticism. Mathesius (1924: 1-2) regards as insufficient the definitions of the sen- 
tence given by Wundt and Paul, which are in various ways psychologically based. 
Indeed, it is not necessary to ask through which mental processes the sentence is 
created, since these "are very difficult to grasp and [...] are moreover very unstable 
during speech" (Mathesius 1924: 2).! Seeing the communicative act as a “chain of 
three rings", consisting of the whole situation, the linguistic conception of this situ- 
ation, and the expression of that linguistic conception in articulated sounds, Mathe- 
sius emphasises that for linguistics it is the linguistic conception that is of greatest 
importance, “the middle element in the chain". This involves a dual procedure, the 
first part analytical, in which the situation is selectively broken down into “name- 
able elements", and the second synthetic, in which these elements are correlated in 
the act of the formation of the sentence. It is worth noting that, as for the rejection 
of the sentence understood as a logical judgement, this anti-psychologism finds an 
echo in Hjelmslev, seeming to point to the existence of areas of common ground 
within nascent structuralism. Both anti-logicism and anti-psychologism, more- 
over, are standpoints whose fortunes fluctuated in twentieth-century thinking about 
syntax. 


iv. From the objections expressed under (ii) and (iii) above, it follows that the sen- 
tence must be defined in purely linguistic terms. Indeed, the primary aim of the 
sentence is not to represent concepts, but to establish communication between 
speaker and listener. The functional definition of the sentence begins with its com- 
municative objectives: according to Mathesius (1924: 4—5), “The sentence is an 
elementary communicative utterance, with which the speaker, actively, and in such 
a way as to leave an impression of familiarity and subjective completeness in formal 
terms, approaches a particular reality." In this definition the sentence is assigned 
three constitutive components: (a) communicative character, (b) the moment of ac- 
tuality (see v), and (c) completeness understood in the formal, subjective sense, in 
which intonation plays a crucial part. 


v. The *moment of actuality" is what "creates the sentence from the words". It is 
related to the principle of the “actual articulation of the sentence" (aktuální členění 
vétné ‘real articulation of the sentence’ = Functional Sentence Perspective). This 
principle concerns the observation of the way “the sentence is inserted into the real 
context from which it has emerged”, and is in opposition to the principle of “formal 
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articulation", which concerns "the composition of the sentence in grammatical 
elements" (Mathesius 1939: 171). There is in evidence here a dualism between 
"dynamic" and “static” syntax that has its roots in nineteenth-century linguistic 
thinking, in the Humboldtian antinomy between energeia, the incessant linguistic 
activity of the speaker, and ergon, the finished, petrified product of this activity, 
between the actual "functioning" of language in speech, activated by the individual 
speaker, and consideration of the abstract logical relations between the compo- 
nents of the sentence. This polarisation derives from the well-known Romantic 
opposition (found in philosophy before linguistics) between “what is living" and 
"what is dead", an opposition that had been given expression by Bréal and We- 
gener (see Sornicola 1992: 45-48, Nerlich 1990). In formal articulation the consti- 
tutive elements are the grammatical Subject and Predicate, whereas in actual ar- 
ticulation they are the “point of departure" of the utterance, or the “theme” - i.e. 
that which is known in a particular situation and from which the speaker starts out — 
and the "nucleus" of the utterance — i.e. that which the speaker asserts in relation to 
the point of departure (Mathesius 1939: 171). The components of formal articu- 
lation do not necessarily correspond to those of actual articulation. This potential 
discrepancy between the functions of Subject and Predicate, on the one hand, and 
of point of departure and nucleus of the utterance, on the other hand, had already 
been discussed by Hermann Paul, who differentiates, respectively, between “gram- 
matical Subject and Predicate” and “psychological Subject and Predicate” (Paul 
[1880] 1920: 124—125). 


3.1.2. Developments in functional syntax 


The above-mentioned discrepancy raises the important theoretical question of how 
to represent the relations between functions that pertain to different types of articu- 
lation. Debated at length in the principal grammatical theories of the twentieth cen- 
tury, the question was tackled by various exponents of functionalism through the 
development of models that have become classics, such as those by Danes (1964), 
Halliday (1967-1968), and Dik (1989). These works present multilayered repre- 
sentations of grammar that postulate the existence of interdependent but auton- 
omous levels — syntactic, semantic and pragmatic (this last corresponding to Func- 
tional Sentence Perspective) — for each of which the relative functions are defined. 
Especially problematic and controversial is the definition of the pragmatic func- 
tions of Topic (or in some models, Theme). Multilayered functional represen- 
tations were widely employed in research on syntactic typology in the last decades 
of the twentieth century. They provided the theoretical frame of reference for many 
comparative studies of the typological properties of languages, which made refer- 
ence to Mathesius’s theory (1924) of the non-alignment of the syntactic, semantic 
and pragmatic functions, enriching it with an imposing mass of empirical data and 
developing it into more refined forms. And it is worth noting, finally, that from the 
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1980s multilayered functional models came to be integrated into formal grammati- 
cal theories (for example, in generative grammar), which originally regarded the 
concept of function as deriving from that of structure. 

The actual articulation of the sentence is closely related to the “processes” of 
temporal linguistic organisation, one of the principal manifestations of these pro- 
cesses being the order of constituent elements (see Mathesius 1941-1942). These 
concerns also constitute one of the leitmotifs of functional syntax in the so-called 
"Second Prague School". Particularly important in this respect are the works of 
Firbas, who reformulates and develops ideas about the actual articulation of the 
sentence through the concepts of “communicative dynamism” and of “theme” and 
"rheme" (Firbas 1964, 1992). These concepts would be utilised by other func- 
tionalist theories, such as that of Halliday, and in text linguistics. They have been 
applied in the study of various typologies of written and spoken texts, which has 
often made use of statistical methods (see the many examples provided by the 
Prague Studies in Mathematical Linguistics). An idea very characteristic of the 
Prague tradition is that of the sentence (utterance) as a "field of tension" (Span- 
nungsfeld) between functions of distinct though interdependent levels (see Sorni- 
cola 1992: 50). This tension, seen in the many stylistic and/or typological options 
for the articulation of the sentence, ultimately reveals the unresolved conflict, to 
which all speakers are subject, between the structural possibilities of a system and 
the functioning of language in concrete linguistic activity. This conflict is clearly 
evident in the thinking of another functionalist, Tesniére, who speaks in his syn- 
tactic theory of an “antinomie entre l'ordre structural, qui est à plusieurs dimen- 
sions, et l'ordre linéaire, qui est à une dimension”. For Tesniére (1959: 21), “cette 
antinomie est la “quadrature du cercle’ du langage. Sa resolution est la condition 
sine qua non de la parole." The problem of the representation of the sentence as 
a process - that is, as the linear projection of its elements — has also been tackled by 
the Dutch functional tradition within a different theoretical framework. It is one of 
the key ideas of functional syntax. 

It should be noted, finally, that the Prague functionalist models, and models that 
were inspired by Prague, are essentially models of dependence and in their various 
ways regard properties of constituency as secondary. The models of syntactic func- 
tionalism devised by Martinet (1985) and Halliday (1985) nevertheless attempt to 
reconcile the two types of representation. 


3:2: Phonology 


Phonology is probably the level of analysis whose development in Europe has been 
most influenced by Prague models. Although various scholars of the Prague Circle 
contributed to the identification of its fundamental principles, its definition and its 
presentation as a systematic theory are attributable to Nikolai Trubetzkoy. It could 
be argued that, at least in the decades following the Second World War, the creation 
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of a theoretical framework for phonology was of central importance for the devel- 
opment of structuralist and functionalist theories. The reasons for the pre-emi- 
nence of phonology in this period are multiple and complex, and can be only 
briefly summarised here. Some relate to the fact that thinking about the science of 
sounds, about morphology and about syntax in the decades before and after the war 
was not equally advanced. Anderson (1985: 316) argues that "structuralist lin- 
guistics simply didn’t have a serious ‘theory of syntax", and that this was a deter- 
mining factor in the development of generative syntax, a thesis that has been 
widely embraced in the United States. While this view might seem rather one- 
sided, it is clear that there was less interest in the theory of syntax than in phonol- 
ogy in the first half of the twentieth century. Trubetzkoy's work in this area, 
brought together in his Grundzüge der Phonologie (published posthumously in 
1939), is remarkable for its analytical and logical rigour, and it is easy to under- 
stand the appeal it must have had in phases of linguistic research that were in- 
fluenced by operationalism. But this cannot be the whole explanation. Also to be 
taken into account are the profound changes brought about by the war itself, with 
the difficulties of reorganisation that were experienced by European research 
centres and the emergence of North America as a new international hub of re- 
search. In this transition Jakobson functions as a bridge between the cultural 
heritage of Russia and Prague and the new scholarly stimuli of the New World. The 
fact that it was through him that the Prague theories were disseminated in the 
United States was not without consequence, in terms of both the emphasis on pho- 
nology and the development of new binary phonological models influenced by the 
theory of information. Phonology and more generally the aspect of the signifier 
had always featured prominently among Jakobson's own interests, many and wide- 
ranging as they were. And as noted above, his tendency towards experimentalism 
would give rise in his American period to research that was ever further removed 
from the original conceptual centre of the Prague Circle. Jakobson at any rate se- 
lected only those aspects of the complex thinking of Prague that suited him, pres- 
enting that thinking in a form that was as brilliant as it was personal, and that in 
many contexts would come to be regarded as the most authoritative. 

Another interesting question is that of the close relationship between structure 
and function, a relationship that finds expression in phonological models, to the ex- 
tent that one might ask whether, at least as far as European phonology is concerned, 
it is legitimate to distinguish between functional structuralism and other kinds of 
structuralism. In various works of the late 1920s the idea of there being a differ- 
ence between phonetics, a discipline that studies sound “comme fait physique 
objectif", and phonology, which is concerned with sound “comme élément du sys- 
tème fonctionnel", is clearly articulated (Theses: 37). Objective facts “n’ont qu'un 
rapport indirect avec la linguistique". They correspond to speakers’ subjective 
acoustic and proprioceptive representations, which in turn “ne sont des éléments 
d'un systeme linguistique que dans la mesure où elles remplissent, dans ce sys- 
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téme, une fonction différenciatrice de significations" (Theses: 37). Later, in the 
Grundzüge der Phonologie, Trubetzkoy would observe that the phonologist con- 
siders “only that aspect of sound which fulfills a specific function in the system of 
language [author's emphasis]", and that the emphasis on function is thus what 
clearly distinguishes phonology from phonetics (1939a: 11). Here too the develop- 
ment of functional ideas has important antecedents in the thinking of the last dec- 
ades of the nineteenth century, especially in the distinction drawn by Baudouin de 
Courtenay between “anthropophonetics”, which is concerned with the “conditions 
of pronunciation and of the phonational-auditory production of language”, and 
“psychophonetics”, which studies only what exists as psychological representation 
of sounds, and in the definition of “phoneme” as “the psychological equivalent of 
physical sound" (see Baudouin de Courtenay 1904: 246, 1927: 278—280)). The 
close interweaving of functional and structural analysis is clearly evident in the 
definition of various scholars: phonology is concerned not only with the function 
that sounds have of differentiating meaning, but also with the systemic relation- 
ships between the units that bear such functions, and with the rules that govern 
their combination, which vary from language to language. 

In phonology, moreover, we encounter the whole spectrum of different senses 
of the term function. Function as "functional meaning" appears in Mathesius 
(1929: 129, “phonology studies phonemes, i. e. sounds endowed with functional 
meaning"), while a wider range of senses may be found in the Grundziige der Pho- 
nologie, some of which place greater emphasis on the relational interpretation. The 
concept of "external function" is seen as comprising the triad expressive function, 
function of “appeal” and representational function, is regarded by Trubetzkoy as 
essential to defining the foundation of phonology in speech. Speech always con- 
sists of a triad of components — a speaker, a listener and a topic — and thus every ut- 
terance has three aspects: a manifestation (or expression) of the speaker, an appeal 
to the listener, and a representation of the topic. These functions are associated with 
distinct characteristics of acoustic impressions, characteristics that belong to three 
different levels (Trubetzkoy 1939a: 14—15). The representational level falls firmly 
within the bounds of phonology, while the levels of expression and of appeal be- 
long to the related field of phonostylistics (1939a: 14—26). Trubetzkoy thus pres- 
ents an integrated view of the linguistic *values" (Sprachwerte), which belong to 
all three functional levels and come together in the linguistic system. Even the le- 
vels of analysis are seen as being integrated: “The content of an observed sentence 
can be understood only if its constituent words are related to the lexical and gram- 
matical elements of the system of language; and the signifier aspect of these el- 
ements necessarily consists of phonological units" (1939a: 15). 

Like Mathesius, Trubetzkoy maintains that phonology should ask which differ- 
ences in sounds are related to differences in meaning in a given language, and that 
such correspondences form part of the characteristic functional dimension of pho- 
nology. But this dimension is not constituted only by aspects of semantic function. 
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Indeed, “the signifier of the system of language consists of a number of elements 
whose essential function is to distinguish themselves from each other. Each word 
must distinguish itself by some element from all other words of the same language” 
(Trubetzkoy 1939a: 11). This differentiating function of the units of the signifier 
is realised by a limited number of differentiating elements, whose various combi- 
nations make it possible to distinguish all the words in a language’s vocabulary 
(Trubetzkoy 1939a: 11). Later, the French scholar André Martinet (1960) would 
present the relationship between phonemes and units of a higher order according to 
the model of the double articulation of language, which distinguishes units of the 
first articulation, monemes, endowed with phonetic form and meaning, from pho- 
nemes, units of the second articulation, endowed with differentiating function but 
in themselves devoid of meaning. 

The dual representation of differentiating function, at the level of the signified 
and the level of the signifier, shows the influence of the Saussurian theory of the 
sign. Moreover, the abstract representation of phonemes as “relations, oppositions, 
etc., quite intangible things, which can be neither perceived nor studied with the 
aid of the sense of hearing or touch” (Trubetzkoy 1939a: 13) is strikingly similar 
to the Saussurian theory of “linguistic value” (1914: 136ff.). It differs from earlier 
representations in two important respects: it is founded not on psychology, nor on 
the correspondences between the phoneme and its phonetic variants (which had 
anyway been lucidly examined at the methodological level, with the fine-tuning of 
rules for the definition of phonemes), but on Karl Bühler’s notion of abstraktive 
Relevanz (‘abstract relevance’) (Trubetzkoy 1939a: 42-43), to which should also 
be attributed the well-known definition of a phoneme as “the sum of all the features 
that are phonologically relevant” (Trubetzkoy 1939a: 36). This notion of abstract 
relevance is fundamental to Trubetzkoy's functional conception of phonology, giv- 
ing it a marked theoretico-speculative character that was criticised from various 
positions within American structuralist and generativist scholarship (see Anderson 
1985: 278-303). 

An interest in abstract relations represented according to formal schemes is 
also clearly in evidence in the identification of structural features of the system, 
such as types of opposition (bilateral, constant, equipollent, gradual, etc.) and of 
correlation (of tension, intensity, sonority, etc.) and bundles of correlation, and in 
the development of the theory of markedness (Trubetzkoy 1939a: 90-169), which 
would later be reformulated and re-applied in many different contexts. 


3.3. Representations of synchrony and diachrony 


The relationship between synchrony and diachrony is one of the key points of 
Prague linguistic theory. It is conceived dialectically, without erecting insurmount- 
able barriers between the two dimensions. Diachrony can in fact include areas of 
stability comparable to synchronic states. At the same time, synchrony can incor- 
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porate phenomena proper to the diachronic dimension, for two reasons: it contains 
myriad oscillations in the way in which linguistic phenomena are realised by 
speakers; moreover, speakers have an awareness of elements belonging to different 
temporal layers. The idea of the existence of micro-oscillations was developed by 
Mathesius (1911), and is the basis of his model of the “potentiality” of the phenom- 
ena that characterise a language. As the dvvautc contained in all its states, poten- 
tiality is the force that keeps every language in constant motion, that more or less 
visibly determines its changes. It is interesting that in the very years in which Saus- 
sure was propounding the opposition between synchrony and diachrony in the lec- 
tures of his Cours de linguistique générale (in terms that were far more complex 
and problematic than was suggested by Bally and Sechehaye in the posthumous 
edition of the Cours of 1914), Mathesius was conceiving a distinction between the 
“statics” and the “dynamics” of language, which emphasised not the temporal fac- 
tor but the stability or instability with which phenomena are realised. In synchronic 
statics there thus exist both stable or constant phenomena and oscillating or vari- 
able phenomena. Like linguistic change over time, the latter belong to a dynamic 
dimension. A key point is the centrality of the observation of "individual lan- 
guages” in the present. Mathesius (1911: 30—31 passim) believed that “the pro- 
cedure leading from the static to the dynamic issues is the safest in linguistics", and 
that “the dynamic issues can only be solved after a more thorough research in in- 
dividual languages has firmly established which phenomena can have been re- 
garded in them, at a given time, as constant and which as potential". Indeed, “only 
then will one be in a position to ask how long a potential phenomenon a can still 
have been regarded as basically the same phenomenon, only slightly affected by 
a shift of its potentiality, and when one must have already admitted the existence of 
a new phenomenon f, replacing a”. 

The primacy of the synchronic method over the diachronic is also put forward 
as one of the fundamental principles of modern linguistics in the Theses of 1929 
(33-34), which had by this stage adopted Saussurian terminology. Yet a further 
principle is affirmed here, regarding the interchangeability of the diachronic and 
synchronic laws themselves. In opposition to the Geneva School, the Prague 
scholars maintain that the representation of the langue as a functional system should 
be applied to both synchrony and diachrony. They emphasise the need for historical 
linguistics to move away from a conception of facts that, even if regular, are pro- 
duced arbitrarily and casually, towards a nomogenetic conception of “enchainement 
selon les lois des faits évolutifs" (Theses: 36). Thus linguistic change would not be a 
destructive force, determined by chance and unsystematically, but would tend to 
move in the direction of the system, towards its stabilisation and reconstruction 
(Theses: 34). This idea presupposes that the foundation of both synchronic and 
diachronic linguistic movements is the speaker understood as participant in and pro- 
tagonist of the functioning of language. It is an idea that has had considerable suc- 
cess in recent models of typology and grammaticalisation, which have proposed 
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structural cycles (or laws) of transformation that are clearly indebted conceptually 
to Prague thinking. Yet the nomogenetic model of diachrony is not free from theor- 
etical difficulty. The actions of speakers in synchronic functioning and the diach- 
ronic transformations that pertain to language as a whole relate to dimensions that 
are incommensurable, the former occurring on a small scale and the latter on a large 
scale. It is difficult to believe that Mathesius, who saw this difference very clearly, 
was not aware of the problems entailed in the nomogenetic approach. Here as else- 
where the statements of the Theses reveal an imperfect fusion of different ideas. 


3.4. Typology vs. characterology 


A final dichotomy discernible within the Prague Circle pertains to the theory and 
methods of linguistic comparison. Against the approach that seeks to delineate the 
structural taxonomies of linguistic families, represented by the works of Trubetz- 
koy (1939b) and later Kurytowicz (1964), may be set the “characterological” con- 
ception, concerned with the fundamental "characters" of individual languages and 
represented by Mathesius (1928). The first approach may be seen as a continuation 
along structuralist lines of a nineteenth-century tradition that had been influenced 
by organicism in biology, while the second draws inspiration from historico-em- 
pirical descriptivist traditions, which may be traced back as far as certain gram- 
maticographic endeavours of the seventeenth and eighteenth centuries (see Sorni- 
cola 2001: 68-69). The principle affirmed in the Theses (7) whereby “la meilleure 
facon de connaitre l'essence et le caractére d'une langue, c'est l'analyse synchro- 
nique des faits actuels, qui offrent seuls des matériaux complets et dont on peut 
avoir le sentiment direct" is taken up again by Mathesius (1928: 59), who defines 
characterology as the structural study of a language, founded on the notion of 
“value” and on synchronic relations. 

While there is general agreement on this point within the Circle, opinions differ 
as to the aims and methods of comparative study. In the Theses (8) it is argued that 
such study should not be directed solely towards genetic problems, but should aim 
more broadly, so as to “découvrir les lois de structure des systémes linguistiques 
et de l'évolution de ceux-ci". This objective may be realised not only through study 
of languages that are genetically and structurally different, but also by considering 
languages of the same family, such as the Slavonic languages, and in such cases 
the comparative method can reveal the divergences and convergences that arise in 
relation to a sizable stock of common structures. In line with assumptions about 
diachrony, the phenomena of divergence and convergence are seen not as being ac- 
cidental, but as connected by common laws. It is precisely this that makes it pos- 
sible to assign to linguistic evolution a typology, or “le groupement d'une série de 
faits mutuellement solidaires en un seul tout" (Theses: 8). 

The holistic conception in evidence here is one from which characterology is 
distinguished in a number of ways. The latter aims to provide a more scientific 
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analysis of a single language, while "all attempts at a systematic linguistic typol- 
ogy are, at the present stage of our knowledge, premature and lead therefore to un- 
necessary complications of problems only" (Mathesius 1928: 59). Such scepticism 
seems to reflect a phase in the history of linguistics in which a new awareness 
of the difficulties and limitations of classification — both genetic and morphologi- 
cal — was emerging (Sornicola 2001). There are, moreover, important links be- 
tween the characterological viewpoint and stylistics. Mathesius (1911: 22) holds 
that stylistics and linguistics differ in their aims, but not in their object of study. 
The relationship between the two disciplines may be defined in terms of the theory 
of potentiality. Both must take into consideration the phenomena relative to the 
speech of single individuals, because it is these that reveal “the full extent of the 
potentiality of the concerned language [sic]" (Mathesius 1911: 22), and therefore 
its character and its style, terms that are more clearly analogous in Mathesius 
(1928). In the study of the characters of a language, the epistemological complex 
related to “thought”, “concept” and “meaning” is abandoned in favour of the new 
cognitive constellation associated with "function". This represents a significant 
break from the nineteenth-century typological tradition. 

Characterology thus defined allows for comparative procedures that give rise 
to a model of typology involving comparison of those functional properties of 
individual languages that are regarded as idiosyncratic (Mathesius 1928: 59). Of 
relevance here is the debate about the function of the grammatical Subject, of the 
Subject as agent, and of the thematic Subject in various European languages. By 
comparison with modern German and the Slavonic languages, modern English 
tends to make the grammatical Subject correspond to the thematic Subject, and 
shows a preference for the definite Subject over the indefinite. English tends more- 
over to keep the same Theme unaltered for relatively long portions of text, unlike 
German and Czech (Mathesius 1928: 62). The tendencies identified demonstrate 
the importance of synchronic relations in linguistics, in that they are correlated 
with other characteristic phenomena of English, such as an abundance of passive 
constructions, through which a thematic Subject may be maintained. The systemic 
(correlational) view is thus also present in characterology, but it is arrived at 
through historico-empirical and textual analytical procedures. The resulting mod- 
els are thus inevitably relative and provisional. 


4. Perspectives of current functionalist research 


Tracing the individual branches of European functionalism to the present day in 
any detail is beyond the scope of the present chapter. Such an undertaking would in 
any case be difficult, for a number a reasons. First of all, we would need to ask what 
it was, from the standpoint of the history of ideas, that actually linked the various 
lines of thought. While there certainly existed, from the end of the nineteenth cen- 
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tury and first decades of the twentieth, a nucleus of principles and methods char- 
acteristic of functionalism, it would be difficult to speak in terms of a single, uni- 
fied theory. One need only look at the way the conceptual nucleus has survived 
throughout different historical phases, shaped in various ways by the environments 
and personalities by which it has been adopted, as well as its application to fields as 
different as phonology, morphology, syntax, Indo-European linguistics, dialectol- 
ogy and sociolinguistics, and also to both synchrony and diachrony. A possible 
explanation for this adaptability is functionalism's fundamentally historico-em- 
piricist orientation. Indeed, one might extend to all of its manifestations Vardar's 
definition (1989: 52-53) of Martinet's functionalism: “cette linguistique soucieuse 
des réalités", whose methodological perspective provides for a "saisie réaliste des 
phénoménes", which excludes “tout apriorisme et tout logicisme ainsi que toute 
finalité métaphysique", and which “s’interroge sur les langues connues et non sur 
l'universalité des langues possibles, à partir de procedures de découverte qui don- 
nent le primat à l'observation et à l'interpretation correcte des faits". Another 
obvious reason for the applicability of functionalism to so many different spheres 
is the use of the term function to embrace a whole range of fundamental linguistic 
concepts. Whatever the explanation, the complexity of the picture does not facili- 
tate a linear examination of the history of functionalism in the last century or a con- 
cise summary of the current situation. A second difficulty lies in distinguishing 
functionalism from other theoretico-methodological approaches and areas of lin- 
guistic research. There is, indeed, the question of its relationship with structuralism, 
given that in many European research contexts of both the first and second halves 
of the twentieth century — the Prague Circle, the French groups influenced by Mar- 
tinet, the Dutch scholars of the journal Lingua, the London School of Firth and his 
students — functionalism and structuralism appear to be inextricably intertwined, to 
a point where they could be seen as two sides of the same coin. In the last decades 
of the twentieth century, when classical structuralism is in crisis and disciplines 
such as sociolinguistics and pragmatics come more sharply into focus, the ideas 
constituting the nucleus of functionalism are absorbed into the latter's armoury of 
principles and methods, offering further proof of their remarkable vitality. 

After the Second World War developments in functionalism in the various Euro- 
pean contexts mentioned above begin to acquire distinctive and independent char- 
acteristics, which would need to be analysed in light of the linguistic and cultural 
traditions of the countries concerned. Here we can sketch only their broad outlines. 

The British tradition represented by Malinowski, Gardiner and Firth has at- 
tributed particular importance to the notion of “context of situation". Firth's poly- 
systemic conception of language was developed by Halliday in his theory of struc- 
ture and system, which appeared first in the model of "scale and category 
grammar” and then in that of “systemic functional grammar". This second model 
has played an important part in clinical practice, in speech analysis, in pedagogic 
theory and in computational analysis of natural languages. 
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A conception of the linguistic system that is anti-logical and open to the world 
of empirical reality also characterises Dutch research in the post-war period, which 
becomes centred on the journal Lingua. Functionalist concerns with a psychologi- 
cal slant had already been expressed in the first decades of the twentieth century 
by the Dutch linguist Van Ginneken, professor at the Catholic University in 
Nijmegen, who held feeling to be “the innermost essence of language" (Noorde- 
graaf 1996). Refusal to conceive of the system as an abstract set of rules, enclosed 
within itself and disconnected from the rich extralinguistic universe, is a recurring 
leitmotif in various essays by Dutch linguists of the post-war years. Reichling 
([1947] 1948: 22) defines the system as "the whole of ordered impulses [...] ac- 
cording to which we act in the use and building of our language products", a view 
clearly inspired by pragmatic principles. It thus “has not the absolute validity of a 
logically constructed system, it only has the value of a practical rule of treatment" 
([1947] 1948: 22). An interest in speech and utterance is also prominent in articles 
published in Lingua in the first decades after the war. Criticising the Saussurian 
idea whereby language is an object of study that may be directly accessed in the 
consciousness of speakers, Reichling ([1947] 1948: 15-16 passim) observes that 
"the only positive datum at which the linguist can direct his research is speech 
itself, and the observation of the linguistic forms in the written texts", and that 
therefore “the system must be found inductively and that [sic] by a correct analysis 
of speech, of that which de Saussure calls ‘la parole". This viewpoint forms part of 
a broader conception that tends to reduce the role of psychology within linguistics. 
The importance of considering utterances in speech and of their analysis in relation 
to semantics and context is underlined by Uhlenbeck (1963: 17), who in this con- 
nection articulates from an anti-logicist standpoint one of the objections to the 
Chomsky of Syntactic Structures. Evident in these positions is a continuity with the 
anti-psychologism and anti-logicism of the functionalist thinking of the first dec- 
ades of the twentieth century. Another functionalist theme reappears in the interest 
shown by De Groot (co-editor of Lingua with Reichling and Uhlenbeck) in reflec- 
tions of subjectivity in linguistic structure and use. Also characteristic of the Dutch 
context is an interest in grammatical formalisation, which is to be developed, it is 
argued, along different lines from those of generative grammar (which are useful 
rather for constructing a model for electronic engineers), in such a way that the 
characteristics of natural languages are respected (see Reichling 1961: 17). This 
requirement was the point of departure for the functionally based formalising ap- 
proach of Simon Dik. 

In France, an important part has been played by the Société Internationale de 
Linguistique Fonctionnelle (SILF), founded in 1974, in which André Martinet had 
a central role as both scholar and organiser. The explicit aim of the Société has 
been to promote regular exchanges between linguists who have “la méme concep- 
tion de l'étude du fonctionnement des langues", founded on “une analyse fonction- 
nelle des langues dans leur développement et leur continuel changement dus aux 
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besoins de la communication liés à l'évolution de la société dont elles sont le véhi- 
cule vivant" (Statute of the Société). Martinet's many students and followers have 
produced works relating to a wide range of fields and subjects, including phonol- 
ogy, morphology, syntax, contact between languages, and the problems of bi- and 
multilingualism.The research of Tabouret-Keller, one of the presidents of the So- 
ciété, and of her group of collaborators represents an important step forward, at 
both the scholarly and the political level, in the study of multilingualism in Europe. 

Is it possible, at the beginning of the twenty-first century, to see what connects 
the different traditions of functionalism that have developed since the Second 
World War, besides a common if variously manifested debt to the old nucleus of 
functionalist ideas? A connection may perhaps be discerned in a general anti-re- 
ductionist approach, which may be seen to lie behind certain theoretico-methodo- 
logical choices in the construction of linguistic models. Some of these choices 
have long histories, continuing the old functionalist tradition, examples including 
the empirical commitment to the observation of phenomena, the idea that grammar 
should be a dynamic representation of the collective linguistic abilities of the 
speakers of a language, and the assumption that grammar is not a modular system — 
that is, a collection of autonomous components — but an integrated system of inter- 
acting parts. Others have developed in relation to theories by which linguistics has 
been influenced in the last few decades, an example being the prototypical, non- 
discrete conception of linguistic categories, developed in relation to the theory of 
prototypes. A marked interest in metaphorical lexical developments and the pro- 
cesses of grammaticalisation may also be attributed to a more recent period. 

What is clear is that the fundamental principles of the early twentieth century 
have been reabsorbed and reoriented within new formulations, which need to be 
analysed in terms of the dualism of formal and functional paradigms that emerged 
in the second half of the twentieth century. Within this new framework for re- 
search, functionalism has increasingly come to be defined as the antithesis of a col- 
lection of theoretico-methodological conceptions determined above all by the 
American structuralist tradition and subsequently by generative grammar, such as 
the idea of the independence of grammatical levels and that of the arbitrariness and 
discreteness of linguistic categories, and the representations of grammar as a static 
system and of diachrony as a sequence of synchronic states. The fact that func- 
tionalism has been contraposed to conceptions characteristic of generative gram- 
mar would seem to account especially for its redefinition as an inductive paradigm 
that explicitly rejects the dichotomy between grammatical knowledge (compet- 
ence) and use ability (performance) and is oriented towards the quest for cognitive 
universals that are more general and more profound than those of linguistics. These 
characteristics are indeed defined in polar opposition to those of the formal para- 
digm of generative grammar, which is deductive, explicitly embraces the compet- 
ence/performance dichotomy, and is oriented towards the study of a universal 
grammar. 
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In some of its new manifestations, such as the theory of linguistic prototypes and 
the quest for cognitive universals that determine linguistic universals, as indeed in 
the cognitivist reformulation of old ideas about metaphorical developments of the 
lexicon, and in the recourse to the theory of iconism, functionalism has moved a long 
way away from the anti-psychologistic and anti-metaphysical stance of the func- 
tional structuralism of the early twentieth century. Looking beyond theoretico-me- 
thodological choices, it is possible to discern in many European functionalist trends 
of our own day a surviving interest in understanding society and history and a sense 
of social responsibility. All these concerns and values are characteristic of early 
twentieth-century Europe and form part of the common heritage of the Old World. 


Note 


1 At the same time, Mathesius does not altogether rule out recourse to psychology in lin- 
guistics, but regards the former as a tool that is subordinate or auxiliary to the latter. 
He was critical of Paul's definition of the sentence (see Paul [1880] 1920: 121), consider- 
ing it to be too oriented towards the mental processes from which the sentence emerges 
(see Mathesius 1924: 2). 
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1. How it started 


One of the fascinating issues in the development of a field is how new ideas spread.! 
A naive observer might attribute the success of a scientific approach just to the 
persuasive character of its results. However, whether we like it or not, factors 
extraneous to such an inherent approach play an important role as well. They vary 
from the personal qualities of the protagonists to the social and political conditions 
in the societies in which itis received. The history of generative grammar? in Europe 
provides intriguing illustrations of the role played by such extraneous conditions. 

Generative grammar is first and foremost linked to Noam Chomsky, whose 
ideas on the study of language shaped the field. Chomsky’s ideas were made more 
widely accessible for the first time in his 1956 paper Three models for the Descrip- 
tion of Language. This paper was followed by his 1957 book Syntactic Structures, 
which made an impact on a far broader audience. The question is then, when and 
how his ideas came to be known. One can establish that, for instance, in the Soviet 
Union, Czechoslovakia, the German Democratic Republic, the Federal Republic of 
Germany, and The Netherlands, Chomsky’s program started to receive attention 
fairly soon. 

In this contribution we will sketch the development of generative grammar in 
Europe in the following way. In section 2 we will give a detailed case study of the 
Netherlands and we will sketch the rise of generative grammar in the countries just 
mentioned, and give some brief sketches of generative theorizing in some other 
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European countries. In section 3 we will describe the role that the society GLOW 
played in the development of generative grammar in Europe, and we conclude the 
paper with a few remarks on subsequent developments. 


De The rise of generative grammar (late 1950s until early 1970): 
Some case studies 


2.1. The Netherlands 


In the Netherlands Chomsky’s ideas drew attention almost immediately. Already in 
July 1957, H. Schultink (then working in Leiden) wrote a review of Syntactic 
Structures in the Nieuwe Rotterdamsche Courant, one of the leading Dutch news- 
papers, in which he brought out the merits he saw in Chomsky’s approach (see 
Trommelen 2008). As Trommelen notes, in his positive appraisal Schultink was 
virtually alone (as a linguist) at that time. 

Dutch general linguistics was essentially dominated by the linguistics chairs in 
the universities of Leiden, Amsterdam and Utrecht, and to a lesser extent by the 
chairs in Dutch linguistics at all six universities. The chairs in Amsterdam and 
Leiden were held by A. Reichling and E.M. Uhlenbeck, respectively. Uhlenbeck 
had also been Schultink’s thesis supervisor. The chair in Utrecht was held by A.W. 
de Groot. He worked in the tradition of the Prague School structural linguistics. 
Reichling and Uhlenbeck also worked in the tradition of structuralism, but had de- 
veloped more personal approaches. Both immediately became open opponents to 
Chomsky’s approach. De Groot was critical, too, but did not enter the academic de- 
bate to the same extent. The majority of the chairs of Dutch linguistics were 
negative in their appraisal of generative grammar as well, as were the chairs in the 
other language departments. Given the fact that in the Dutch academic system only 
full professors have the right to approve of a dissertation as a step for granting the 
doctorate, no new direction in a field can become academically established without 
at least one full professor being willing to carry the responsibility. 

Important for the further developments was that Reichling took serious the 
issue of teaching generative grammar. His criticism was published in Reichling 
(1961), coining the term ‘cryptanalytical formalism’ to express his negative ap- 
praisal of formal theories. Despite, or perhaps because of, his criticism, he awa- 
kened the interest of the students in his audience. But even before these publi- 
cations came out, a group of students had formed a reading group studying 
Chomsky’s work, and started corresponding with Chomsky (Noordegraaf 2000). 
The group included A. Kraak, W.G. Klooster and P.A.M. Seuren. One of their 
activities was reading and re-typing Chomsky’s then still unpublished dissertation 
Logical structure of linguistic theory, at the time only available on micro-film. The 
project was never completed. 
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Uhlenbeck published his criticisms in his Lingua article of 1963. An effect of 
this was that Chomsky and Chomskyanism became a broader issue. Chomsky 
responded to both Reichling and Uhlenbeck. However, the impact of Chomsky's 
approach had already become apparent before these responses, not just to the small 
circle of linguists in the major centres in the Netherlands, but also in other univer- 
sities. Crucially, however, none of the members of the younger generation were in 
any position of formal power in the 1960's. There was no infra-structure in the 
form of conferences or accessible journals, no professional association to organize 
meetings, etc. It was very hard to carry out dissertation research in a generative 
framework and to find a thesis supervisor willing to take the responsibility of ad- 
mitting a generative thesis to public defence. 

The first generative thesis in the Netherlands was defended by Kraak. The 
events surrounding Kraak's defence highly contributed to the politicisation of this 
clash of paradigms. Its subject was a generative analysis of negative sentences. The 
intended thesis supervisor was the Professor of Dutch Linguistics (Hellinga) but he 
declared himself unwilling to continue the supervision. This responsibility was 
then taken over by the philosopher J.F. Staal, who held one of the philosophy chairs 
in Amsterdam. Staal was well acquainted with Chomsky's work, and convinced of 
its importance (e.g. Staal 1962). He had been involved in the Euratom project led 
by the logician E.W. Beth (1908-1964), aimed at constructing logical machines for 
proof automatisation (see Van Ulsen 2000). Beth was very much interested in the 
connection between language and (mathematical) logic that Chomsky explored, 
and expressed that explicitly in the Annals of the Royal Academy and in a letter to 
Reichling. The project had a linguistic component based on Chomsky's work on 
formal languages. It is interesting to see that for many the formal aspect of Choms- 
ky’s work provided the initial attraction. 

Because of his willingness to accept generative grammar, many linguists con- 
sidered Staal a key factor in the future of Dutch linguistics. However, soon Staal 
left Amsterdam to become a visiting professor at MIT, later accepting a position as 
Professor of Philosophy and South Asian languages in Berkeley, thereby ending 
his role in the Netherlands at this stage of the history. 

In view of this situation it may seem highly surprising that only a few years 
later the situation drastically changed. How did this come about? There is a saying, 
quoted in many forms, that old theories don't die, just their proponents do. In this 
case, the proponents retired. Successors had to be appointed, and that within a 
relatively short period of time. 

De Groot retired in 1962 and was succeeded by Schultink. Schultink's role 
turned out to be crucial. Although in his own 1962 dissertation he had not yet 
adopted a generative perspective, he subsequently pursued the line set out in his 
1957 review of Syntactic Structures. In 1967 he published an introduction to gen- 
erative grammar in De Nieuwe Taalgids, a journal widely read in the community of 
scholars in Dutch language and linguistics. Even more importantly, being a full 
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professor, it was to him that the new generation of linguists turned. In 1968 the 
South African linguist R.P. Botha defended a dissertation on generative grammar 
from a methodological perspective, in 1969 Seuren defended his dissertation 
Operators and Nucleus. A contribution to the theory of grammar. Both disser- 
tations had a significant impact on the general perception of generative grammar as 
a well-established grammatical theory. In 1971 Klooster and H.J. Verkuyl de- 
fended their dissertations with Schultink as thesis supervisor. Both became profes- 
sors of Dutch linguistics, in Amsterdam (1972) and Utrecht (1979), respectively. 
These were the first of a series of important dissertations in generative linguistics 
to be prepared and defended under Schultink's supervision. 

In a similar way the situation in Amsterdam underwent a significant change. 
Reichling retired in 1968. The events surrounding his succession provide an illus- 
tration of the extent in which linguistic issues had become politicised. Reichling's 
envisaged successor was S.C. Dik, who had just defended his dissertation. Dik's 
dissertation contained extensive criticisms of generative grammar and placed him 
in the functionalist tradition, setting out to develop a functional alternative to gen- 
erative grammar. In his style of doing linguistics he was far closer to the generative 
linguists of his time than to his teacher Reichling. But he was perceived as belong- 
ing to the *old order'. The other candidate was Seuren, who was seen as being 
clearly in the generative ‘camp’. The conflicts around this selection procedure ran 
so high that the polemics carried over to the general press, with a well-known 
weekly even taking sides with the progressive candidate Seuren. In any case, lin- 
guistics never had a higher public profile than in this period. Eventually, Dik was 
appointed. Although firmly committed to a functional perspective he turned out to 
be very open-minded. For example, in 1978 Dik was supervisor of Henk van 
Riemsdijk's dissertation and together with the generativist Jan Kooij he wrote an 
influential introduction to general linguistics, Beginselen van de Algemene Taalwe- 
tenschap (1970). 

Two further crucial appointments took place in Nijmegen and Leiden. The 
chair of general linguistics in Nijmegen became vacant in 1970 and Kraak was ap- 
pointed, becoming the first member of the new 'generative' generation to hold a 
chair. In 1972 J.G. Kooij was appointed to the Leiden chair of Dutch linguistics, 
and later became professor of general linguistics. Kooij was a pupil of Reichling's, 
but in his inaugural lecture declared himself a generativist. 

In Groningen, however, student and staff protests again directly influenced the 
course of events. Whereas many of the faculty members understood the necessity 
for change, the chairs of Dutch linguistics (Van Es) and general linguistics (Ver- 
burg) were opposed to any real concessions, which led to a student occupation of 
the building of the Faculty of Arts and a deadlock. In 1970 the university admin- 
istration decided to resolve the issue and appointed A. Sassen as professor of Dutch 
linguistics. Sassen had been trained as a dialectologist, and did not adhere to any 
theoretical school. He was open-minded, however, and saw the need for modern- 
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ization of the program. Hence he started to attract young promising staff members 
who had been trained in generative theory. Another source of change was the Ger- 
man department where in 1971 W. Abraham was appointed chair of German lin- 
guistics and set out to create a favourable climate for linguistic research. The lin- 
guistics department was virtually dormant until F.W. Heny was appointed in 1976. 

Thus, as of 1972 all universities in the Netherlands had at least one program in 
which generative grammar was offered, and at least one chair to turn to for a dis- 
sertation in the field of generative grammar. A Dutch textbook on generative gram- 
mar (Kraak and Klooster 1968) had been written. On the same footing also a re- 
search infrastructure was developed. In 1970 the Algemene Vereniging voor 
Taalwetenschap (General Association for Linguistics) was revived by Schultink, 
Dik and Kraak, and on the initiative of Dik annual meetings were organized to cre- 
ate a forum for all linguists in the Netherlands. The Dutch linguistics journals were 
open to generative contributions. As a consequence, the “emancipation” of gener- 
ative grammar in Dutch academia was complete. 


For the history of generative grammar in the Netherlands quite a few sources are 
available and easily accessible. We will now discuss the situation in the USSR, 
Czechoslovakia and Germany, where sources are more limited, or even scarce. 


2:9 USSR/Russia 


In Russia, generative grammar for a long time played a marginal role. Given the 
absence of noticeable activities through the eighties and nineties, one would think 
that generative grammar never really made it into the USSR in the early years. 
However, nothing would be further from the truth. Right from the start Chomsky's 
work drew the attention of S.K. Shaumyan, one of the USSR's most prominent lin- 
guists at the time. Influenced by Chomsky, Shaumyan developed his model of ap- 
plikativnaya grammatika, a formal generative model of natural language. His main 
works includes his Osnovaniya porozhdayushchey grammatiki Russkogo yazyka 
(Principles of a generative grammar of the Russian language), with P.A. Soboleva, 
Moscow 1968. Shaumyan did not work in a vacuum. He was working in the rich 
tradition of Russian structuralism. Furthermore, as reported by Segal (1974), in 
1956 a cooperation arose at Moscow university between a number of young lin- 
guists and a group of mathematicians (including I.I. Revzin). The aim was to study 
the possibility for applying linguistic research to machine translation and auto- 
matic data processing. The group studied works by Zellig S. Harris and Y. Bar-Hil- 
lel. Thus, as in the initial stages of the development of generative grammar in the 
Netherlands, the primary source of attraction and inspiration in the USSR was the 
formal character of generative grammar. 

Just like Chomsky's model, Shaumyan's applied model uses formalisms from 
mathematical logic. Shaumyan's approach differs in two important respects from 
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Chomsky's. First, Shaumyan is a typical representative of structuralism in that he 
emphasizes the independent character of the study of language. He does not com- 
mit himself to the interpretation of linguistics as a branch of cognitive psychology. 
His goal is to develop a two-stage model of natural language: "every generative 
grammar of a concrete language must be derived from universal generative 
grammar". Second, his model is based on a different type of formalism. Whereas 
Chomsky's fundamental idea of ‘generation’ is based on Emil Post's work on pro- 
duction systems in mathematical logic, Shaumyan uses the combinatorial logic of 
Haskell Curry. It is interesting that Shaumyan's conception of the relation between 
universal grammar and individual grammars is in fact quite close to the minimalist 
conception of grammar (Chomsky 1995). So, from the current perspective, the 
main contrast between Chomsky and Shaumyan resides in the conception of lin- 
guistics as a cognitive science. 

These differences notwithstanding, it is an interesting question why ultimately 
no version of generative grammar effectively spread. True, many researchers in the 
USSR worked in relative isolation. Chomsky's texts were written in English, and 
English was not widely mastered. However, according to the bibliography of Sgall 
(1967), Russian translations of Chomsky's early work were available. The most 
reasonable explanation for the slow acceptance of generative grammar lies in the 
political climate. The general climate was not conducive to conceptual innovation 
in general. It was more driven by application than by theory formation. Only those 
aspects of Chomsky's work were well received that fitted in best with the interests 
of the established researchers without requiring a paradigm shift. That is, the 
mathematical perspective was appreciated, rather than the psychological one. Fur- 
thermore, it may not be coincidental that during the period of the late sixties and 
early seventies, when in the Netherlands generative grammar became firmly estab- 
lished, the USSR went through a period of increased tensions with the West (the 
end of the Prague Spring by the invasion of Czechoslovakia in 1968). Chomsky's 
political views became more pronounced and publicized, and they were, of course, 
entirely unpalatable to the Soviet political leaders. This, no doubt, contributed to 
the fact that the gap between Soviet structural linguistics and generative linguistics 
never got really bridged. 

In the end things got resolved in the opposite way of what happened in the Ne- 
therlands: the key players left. In 1969 Shaumyan went on a visit to the University 
of Edinburgh for a year, and in 1975 he left the USSR for a position at Yale. In 1976 
LA. Mel'Cuk, who had also been a member of the Moscow group, emigrated to Ca- 
nada. Although Mel’ čuk did not adhere to generative grammar, he was certainly 
one of the forces that contributed to the attempts of opening up Soviet linguistics to 
what was going on in the outside world. It seems fair to say that with Shaumyan 
and Mel'éuk having left, generative grammar was prevented from further develop- 
ing. 
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2.3. Czechoslovakia 


The dominant paradigm in Czechoslovakia was structuralism in the Prague School 
tradition as it had been developed in the late twenties and during the thirties by a 
group of linguists including N. Trubetzkoy, R. Jakobson (who left Czechoslovakia 
in 1939), V. Mathesius, B. Havranek, and others (cf. Toman 1995; see chapter 48 
by Sornicola). When Chomsky’s ideas came to be known around the beginning of 
the sixties, they were initially received with interest. Just like in the USSR their 
main attraction was the application of the tools of mathematical logic to the de- 
scription of natural language. In 1966 a Czech translation of Syntactic Structures 
appeared. The main representatives of this new line of research were P. Sgall and 
E. Hajicová. In 1967 Sgall published his Generative description of language and 
the Czech declination. In this book he proposed his own elaboration of the frame- 
work of generative grammar, with a critical discussion of Chomsky’s work and em- 
phasis on the formal properties of language. Linguistics as a branch of cognitive 
psychology was regarded as controversial. But, crucially, Chomsky’s and Sgall’s 
perspectives were presented as part of the same enterprise. 

The Soviet invasion ending the Prague Spring had a crucial impact on this de- 
velopment. Contacts with the West became more difficult, but most importantly 
public intellectual activities and the room for debate became highly restricted. One 
may expect that the perception by the communist administration of Chomsky’s 
political work hardly contributed to the ‘popularity’ of his linguistic work in those 
circles. The conditions for Sgall and his co-workers to continue their work became 
very difficult. To justify their work they increasingly had to focus on the politically 
neutral mathematical aspects of their enterprise. They were no longer allowed 
to do their work in the philosophical faculty, the home of linguistics, but in their 
Laboratory of Algebraic Linguistics, which had been moved to an isolated position 
in the mathematical faculty. Sgall was no longer allowed to teach. Consequently, 
despite the initial interaction between Czech linguistics and generative grammar, 
developments came to a halt. In the mid eighties, the regime somewhat relaxed its 
grip and Sgall and his group could resume their work. 


2.4. Germany 


Writing about the early history of generative grammar in Germany comes with its 
own challenges. In the area of generative grammar, more went on in Germany in 
the sixties and early seventies than in the USSR. Although we cannot be absolutely 
sure about what went on in the USSR, we do know that it had little impact on sub- 
sequent events. In the case of Germany we are less confident that we can see or 
reconstruct all developments that were relevant. Furthermore, during the period 
under discussion Germany was divided into the Bundesrepublik Deutschland 
(BRD) and the Deutsche Demokratische Republik (DDR). Although there was 
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considerable mutual influence at a scientific level, interaction was difficult.? As one 
may expect, the precise way in which interaction took place deserves more atten- 
tion than can be given here. 

In general, compared with what is known about what went on in the Netherlands, 
it appears that not as much ‘preparatory’ work has been carried out. Although there 
is Work on the history of ideas in which generative grammar in Germany is being 
discussed (cf. Gardt 1999), it does not give the same kind of detail of the actual de- 
velopments, as is available in the sources available for the Netherlands. For in- 
stance, we have no information about the immediate reception of Chomsky (1957), 
neither in the Federal Republic of Germany (FRG) nor in the German Democratic 
Republic (GDR). 

There is a further issue of scale. Whereas the number of universities in the Ne- 
therlands is quite limited, the number of universities in Germany was around 30 in 
the FRG, and around 5 in the GDR. At this moment (2009) the total number of uni- 
versities in Germany is 114 (Statistisches Bundesamt Deutschland). Consequently, 
we will have to content ourselves with sketching the broader picture which our 
limited resources do allow us to reconstruct. 

In the available sources, the influence of generative grammar first becomes vis- 
ible in the work of M. Bierwisch at the Arbeitsstelle für Strukturelle Grammatik of 
the Deutsche Akademie der Wissenschaften in East Berlin, as witnessed by his in- 
fluential 1963 book. Subsequently, W. Motsch published a series of publications on 
adjectives (Motsch 1964). In the FRG H. Schnelle shows an early interest in 
Chomsky's work, initially inspired by Chomsky's formal work. In 1967 Schnelle 
was appointed Professor of Linguistics at the Technische Universität Berlin. The 
earliest work from a linguist in the FRG we have seen referred to is Baumgartner 
(1964). 

One may say that the work in the Arbeitsstelle für Strukturelle Grammatik was 
by a few years ahead of the work in the FRG. Moreover, Germany as a whole was 
ahead of the Netherlands. Of course, this statement does not take into account the 
difference in scale. What one can observe at the end of the sixties is a situation 
in Germany that is comparable in volume to the situation in the Netherlands, but, 
crucially, not in the proportion of the total volume of linguistic research. 

Just like in the Netherlands, the end of the sixties resulted in a confrontation 
between the younger generation and the established faculty (Boeder et al. 1998). 
This revolt included a revolt against the traditional philological conception of 
linguistics, in some universities. However, unlike in the Netherlands, the students’ 
revolt of the sixties did not deeply affect the power structure in the German univer- 
sities. As it is often remarked, in the Netherlands the professors kept their robes, 
but lost their power, in Germany it was the other way round. 

Another difference resides in the requirements for an academic career. The 
Dutch system has the doctorate as the main precondition. The German system has 
two degrees, the doctorate and the Habilitation. This entails that it takes longer be- 
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fore the qualifications for a senior faculty position are met. It is our impression that 
the two factors together contributed to a relative slow-down in development. 

Nevertheless, a movement went on its way to establish modern theoretical lin- 
guistics in the universities’ curricula.‘ In order to achieve this, it was important to 
create a forum. At the end of the sixties J. Bechert and K.H. Wagner founded an an- 
nual conference, the Linguistisches Kolloquium. Although primarily intended for 
the German research community, it was also open for participants from other coun- 
tries (and in fact it was occasionally organized abroad). Another important step in 
this direction was the textbook they wrote with Daniéle Clément and Wolf Thüm- 
mel (Bechert et al. 1970), which made it effectively possible to teach generative 
grammar in university curricula. Furthermore, in 1969 a German linguistics jour- 
nal was founded, Linguistische Berichte, not limited to, but certainly open for work 
in generative grammar. Its founding editors were Peter Hartmann and Arnim von 
Stechow, then of the University of Konstanz. The University of Konstanz was the 
first of a range of new universities to be established in the FRG. It was founded in 
1966, and one of the first universities, perhaps the first one, where modern lin- 
guistics got a prominent position.? 

At the turn of the decade, generative grammar appeared to be ready for increas- 
ing its role in Germany. The old curricula were under pressure, and there was a 
thriving linguistics community that had the necessary fora. Yet, as compared to the 
Netherlands, developments appear to have to come to a halt at the beginning of the 
seventies. If one considers the programs of major linguistics conferences as an in- 
dicator (for instance, GLOW to be discussed in the next section), contributions 
from Germany stayed far behind contributions from other countries. The question 
1s then, what caused this. There are the institutional factors mentioned above, but 
also a rather different factor played a role. 

High expectations had developed concerning the possible role of generative 
grammar in language teaching. With hindsight one can say that these expectations 
were overly optimistic, and that there was little theoretical reason to entertain 
them. And, indeed, soon this approach to foreign language instruction met with 
fundamental criticism. Interestingly, this criticism did not only come from the 
older generation, but also from within the younger generation itself. More than in 
the Netherlands, linguists in Germany were influenced by the critical theory of the 
Frankfurter Schule. Their credo was that theories should not just describe some 
part of reality, but be used to change it. It is from this perspective that what was 
perceived as the failure of generative grammar to deliver became highlighted. Utz 
Maas (1973: 8) refers to the “Skandal einer Linguistik, die blind für die gesell- 
schaftlichen Aufgaben ist, und doch dem gesellschaftlichen Aufgabenbereich 'Er- 
ziehungssektor’ aufgezwungen wird” (‘the scandal of a linguistics that is blind 
to societal demands and yet is forced upon the educational sector’). This criticism 
is elaborated in Maas and Wunderlich (1974). It is under the influence of this criti- 
cism that generative grammar in Germany lost its status of being the hallmark of 
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“young and progressive", and was superseded in that role by forms of linguistics 
inspired by critical theory. 

In order to understand the subsequent developments, one must distinguish be- 
tween generative grammar, on the one hand, as a program demanding explicitness 
and the use of formal methods in linguistics and, on the other hand, as a program 
providing a particular conception of its object of inquiry. It seems fair to say that in 
its former capacity it influenced linguistics in Germany, also in the later seventies 
and eighties. In its latter capacity generative grammar did not take off. In a sense, 
“critical linguistics” took over that role, leaving behind what one may call a “scep- 
tical generation" when critical linguistics eventually subsided. 

Thus, many linguists of the generation of the sixties took routes leading away 
from what was perceived as controversial in mainstream generative grammar. 
Dieter Wunderlich, who gave an enthusiastic report on new developments in gen- 
erative semantics at the Regensburg Kolloqium, received a professorship for Ger- 
manic Linguistics at the Freie Universität Berlin in 1970, and in 1973 the chair 
in General Linguistics of the Heinrich-Heine-Universität in Düsseldorf. But his 
important work in the seventies and eighties mostly involved issues in discourse 
theory and lexical decomposition. Ekkehard Kónig, appointed chair at the English 
department of the Universitàt Hannover in 1973, initially worked within main- 
stream generative grammar, too. But his important contributions to linguistics are 
in linguistic typology, to which he switched his attention. Arnim von Stechow, who 
was appointed chair of the linguistics department of the University of Konstanz in 
1972 (and later moved to Tübingen), started working on formal semantics and its 
interfaces with syntax, where his work became extremely influential.6 

It is only after the Wende, when the universities in the former GDR were re- 
structured and revitalized, that a significant number of positions became available 
to be filled by linguists that come after this "sceptical generation", but that leads us 
beyond this sketch of how it started. 


2.5. Some observations on other European countries 


As explained above, it will not be possible to give a thorough description of devel- 
opments in all European countries. In this section we make some remarks on 
France, Hungary, Italy, Portugal, and Spain, primarily based on reports in the 
GLOW newsletters, supplemented with observations from Graffi (1990), Haider, 
Prinzhorn and Van Riemsdijk (1987) and Pica and Rooryck (1994). 


2.5.1. France 


In France, generative grammar became first institutionally visible in Paris, es- 
pecially at the Université de Vincennes à Saint-Denis (Paris VIII). In this univer- 
sity, founded in 1969 as a direct response to the student revolt of May 1968, a small 
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group of linguists established a program of general linguistics which was very dif- 
ferent from what was available elsewhere. As Pica and Rooryck (1994) write, this 
group, which included M. Gross, J. Dubois and J.-Cl. Chevalier, was concentrated 
around N. Ruwet, who had been a visiting scholar at MIT in 1966. In the same 
period a small group of generativists was also formed at the University of Paris X 
(Nanterre) around J. Dubois, J.-C. Milner, A. Delaveau and F. Kerleroux. 

Ruwet and Milner deserve to be mentioned separately. In his linguistic career 
Ruwet was influenced by the work of Jakobson, and later by Chomsky. Much of his 
early influence in France came through his book Introduction à la grammaire gén- 
érative, which was published in 1967. Milner did part of his studies at MIT, which 
led to his French translation of Chomky's Aspects in 1971, contributing to the re- 
ception of this theory in France. 

The Vincennes program was strengthened by new staff members, including R. 
Kayne, R. Carter, A. Rouveret, and F. Dell, and in the English Department by J. 
Guéron. At the Université Paris VII-Denis Diderot, generative grammar also de- 
veloped in the research group directed by Milner, including linguists like H. Huot, 
C. Dobrovie-Sorin, D. Godard, and J. Lowenstamm. Later CNRS research groups 
were established devoted to generative grammar (among others under the direction 
of H.-G. Obenauer). 

Pica and Rooryck (1994) comment on the present situation of Generative 
Grammar in France when they state that “most French generativists who were 
often part of the elite (most of them were alumni of École Polytechnique or of the 
École Normale Supérieure) never really succeeded in spreading (or were even not 
willing to spread) the main ideas of the field to the vast majority of institution[s], 
such as the universities located outside Paris, the French C.N.R.S, or more ‘periph- 
eral’ institutions, such as ‘l’École Pratique des Hautes Etudes’. Not surprisingly, 
generative grammar is barely represented in these institutions.” 


2.5.2. Hungary 


According to A. Szabolsci (1985), in the early sixties younger linguists in Hungary 
were very much interested in new ways of approaching linguistics, given that the 
university curricula were strongly dominated by descriptive, historical-comparative 
work. The publication of the journal Altaldnos Nyelvészetio Tanulmányok (ANyT; 
‘Studies in general linguistics’) gave room to these new approaches, including gen- 
erative grammar. A group headed by Zs. Telegdi and S. Károly worked on a Hun- 
garian grammar project, from which the results were published in a 1969 volume in 
ÄNYT. Several linguists, among them Gy. Szépe and E Kiefer, visited MIT, which 
resulted in generatively oriented publications. It seems that, like in Germany, the in- 
terest for generative linguistics faded away. According to Szabolsci, the reason for 
that may be that the prevailing analyses in generative grammar were based on Eng- 
lish and therefore, did not at first hand seem to offer many tools for the analysis of 
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Hungarian, “a language at least superficially vastly different". The fact that hardly 
any 'technical literature had been translated into Hungarian also didn't help. 

Two projects deserve to be mentioned, though: the Mother-tongue Education 
project (led by Gy. Szépe, 1973-1978) and the Hungarian Syntax Project (led by 
F. Kiefer, 1979—1990), both projects in the institute of linguistics of the Hungarian 
Academy of Sciences. They brought forth a younger and very influential gener- 
ation of generative linguists, such Z. Bánréti, I. Kenesei, A. Komlósy, K.E Kiss 
and A. Szabolsci. 


2.5.3. Italy 


The academic establishment in Italy was generally hostile to generative linguistics 
(or to any kind of theoretical linguistics). The linguistics that was taught until the 
early sixties, under the name of “glottologia”, was mainly historical-comparative, 
and mainly done in the language departments. While in other countries struc- 
turalism was an obstacle to generative grammar, in Italy the interest for struc- 
turalism might have prepared the way for generative grammar (Graffi 1990): the 
period 1967—1971 saw (first) translations of the works by de Saussure, Hjelmslev, 
Sapir and Chomsky. 

At the end of the 1960s, the only real group of generative linguists was to be 
found at the Institute of Psychology of the Italian Research Council (Consiglio Na- 
zionale delle Ricerche) in Rome; this group was formed by some young re- 
searchers led by D. Parisi. These researchers associated themselves with A. Pug- 
lielli, who graduated in Rome and obtained a PhD. at Cornell University. This 
group was very active, and in 1969 organized a conference explicitly devoted to 
“Italian transformational grammar”. Later, other small groups of young scholars 
interested in generative grammar began to gather in other places as well: Padua 
(Renzi), the Scuola Normale Superiore of Pisa (Stussi). But as Graffi (1990) ob- 
serves, despite the presence of many Italian linguists in the generative enterprise, 
generative grammar in Italy is still not particularly strong, or well-established. 


2.5.4. Spain 


Demonte, García-Billido, Mascaró (1982) observe that in Spain the civil war of 
1936-1939 marks the end of an important philological and neogrammarian tradi- 
tion. Spanish universities and other research institutions became strongly centra- 
lised and bureaucratised. It meant that, with some exceptions, appointments and 
curricula were controlled by the Ministry of Education. Only very few scholars 
worked within generative grammar, and most students interested in generative 
grammar looked abroad (especially to the US) for their education. 

Cultural and linguistic oppression might have had a positive effect on the 
development of generative grammar in Catalonia. Demonte et al. claim that in the 
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late sixties/early seventies students at the University of Barcelona got interested in 
generative grammar and that Gabriel Ferrater (mainly known as a poet; he died in 
1972) played an important role in its developments. Although he did not have a 
university degree “he was accepted by the academic system and began teaching 
courses in both Catalan universities, more or less at the same time that most of the 
new group of generative grammarians were absorbed by the recently [1968] cre- 
ated Universidad Autonoma." [1982:12] 

In 1968, a diverse group of students in Madrid participated in a seminar *Math- 
ematical Linguistics’ of the Universidad Complutense, organized by the philos- 
opher S. de Zavala. He was interested in the development of formal grammars for 
natural languages (like what was described for Russia, Czechoslovakia and the Ne- 
therlands). The main purpose of this seminar was “to discuss current issues in gen- 
erative grammar." These workshops were active for almost four years and later a 
new seminar was organized, but “these sporadic and isolated attempts to keep up 
with current developments of the theory were clearly insufficient because the activ- 
ities were not integrated in the general educational or research system." [1982: 13] 


3. Subsequent developments (mid-70s and later) 


3:1: GLOW 


The spread of generative linguistics in Europe, and the role of European linguists 
in the generative research programme was very much strengthened by the initiative 
of H. van Riemsdijk to start a series of conferences. The conferences were organ- 
ized by GLOW - Generative Linguists in the Old World — an international organ- 
ization founded in 1977 and based in Europe. Its goal was to “to further the study of 
generative grammar". It did so by organizing an annual conference, later by organ- 
izing, or funding, periodical summer schools and publishing a newsletter. 

The idea to start GLOW was a result of Van Riemsdijk's organization of the 
conference Green ideas blown up in Amsterdam in 1976. During this conference 
many young linguists from the Netherlands, France, Italy and the US discussed the 
latest developments in generative grammar. The next year saw another conference 
held in Amsterdam, and the first official GLOW conference took place in 1978, and 
was, again, organized at the University of Amsterdam. After that, the conference 
travelled around Europe, with conferences in cities such as Pisa, Góttingen, Paris, 
Copenhagen, Brussels, Girona/Barcelona, Venice, Budapest, Cambridge/London, 
Leiden, Lisbon, Lund, Vienna, Athens, Berlin/Potsdam, Vitoria-Gasteiz/Bilbao, 
Amsterdam/Utrecht, Geneva, Tromsg, and Wroctaw (in 2010). 

The conferences differed from the conferences in the US in that all selected 
speakers were allowed ample time to address major theoretical issues, speakers 
were partially reimbursed, and no proceedings were published. It meant that only 
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around twenty talks could be selected. Many of the talks later appeared as articles 
in high ranking journals. It resulted in a prestigious conference attracting many 
young scholars from the US, offering a perfect platform for the development of 
generative linguistics in Europe. 

One conference stands out, the 1979 conference in Pisa, organized by Luigi 
Rizzi, because it was here that Noam Chomsky, connected to the GLOW confer- 
ence, gave a number of lectures known as the “Pisa lectures". These lectures were 
the basis for the book Lectures on Government and Binding, one of the most in- 
fluential books in linguistics written by Chomsky and published in 1981 by a Euro- 
pean publisher, Foris Publications. 

In 1978 the first GLOW newsletter was published. It was an important publi- 
cation because it defined what GLOW stood for. As an organization GLOW aimed 
for internal intellectual cohesion rather than wide coverage. In order to define an 
‘identity’ a text was circulated trying to make that explicit. The text became known 
as the GLOW manifesto, published in the first newsletter. The manifesto was 
signed by Jan Koster, Henk van Riemsdijk and Jean-Roger Vergnaud. The text rep- 
resented a nucleus of ideas around which the members of GLOW gravitate (quotes 
below are from H. Bennis and M. van Oostendorp 2002:32). 


“The theory of language is that part of linguistics that is concerned with one specific 
mental organ, human language. The growth of the latter organ, language learning, falls 
strictly within biologically determined cognitive capacities. It appears quite likely that 
the system of mechanisms and principles put to work in the acquisition of the knowl- 
edge of language will turn out to be a highly specific "language faculty". Stimulated by 
appropriate and continuing experience, the language faculty creates a grammar that 
determines the formal and semantic properties of sentences. The grammar is put to use, 
interacting with other mechanisms of mind, in speaking and understanding language. 
The faculty of language, which is common to the species, provides a sensory system for 
the preliminary analysis of linguistic data, and a schematism, "Universal Grammar", 
that determines, quite narrowly, a certain class of grammars. Thus, Universal Grammar 
(UG) is the system of principles, conditions, and rules that are elements or properties of 
all human languages not merely by accident, but by necessity." 


The quotes make clear what the authors take as central to generative grammar. But 
the manifesto also contains passages that specify their position in a debate between 
different branches of generative grammar, setting Extended Standard Theory apart 
from generative semantics (cf. also section 3.2): "From the beginning, generative 
linguists have implicitly or explicitly assumed the thesis of autonomy of formal 
grammar — the thesis that the language faculty constructs an abstract formal skel- 
eton invested with meaning by interpretive rules — and have taken for granted the 
necessity of limiting the scope of inquiry to aspects of formal grammar. [...] In Eu- 
rope, as elsewhere, much critical discussion was concentrated on the alleged “ne- 
glect" of meaning and language use, and not on the new goals, such as narrowing 
of the class of possible human grammars. [...] Scope of description is an entirely 
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irrelevant issue as long as the considered theories are without content, i.e., when 
the rules are unconstrained and do not forbid anything whatsoever. The descriptive 
means of early transformational grammar were so rich in expressive power that it 
was all too easy to go beyond the original limits. The expansion of the scope of lin- 
guistics theory in the hands of generative semanticists was simply an expression of 
a lack of content of the proposed hypotheses." (2002: 35—36) 

Some colleagues reacted. The same newsletter presented the reaction of 
Jean-Claude Milner (translation from French http://glow.uit.no/index.php?page= 
milner#glow). Milner accepts that “Linguistic theory must be interpreted in ‘real- 
istic’ terms; i.e., there is a reality of the language faculty and languages which 
linguistic theory aims to express adequately." However he does not accept that 
"the reality of the language faculty and languages has essential psychological sub- 
stance" and that *a psychological reality is a specifiable state of a mental organ." 
Itis clear that Milner considers himself a generative linguist without accepting that 
"the principles that determine the class of possible human grammars are a geneti- 
cally based property of the human species". 


3.2; Generative grammarians in Europe 


One might ask to what extent the developments in Europe influenced generative 
grammar as a whole. The preceding chapter already illustrated this. But a few more 
things can be said in answering this question. 

There is one external factor that was important for the development of gener- 
ative grammar: European publishers. Chomsky's first books were published by 
Mouton, now Mouton de Gruyter, and one of his most influential books in lin- 
guistics, Lectures on Government and Binding, was published by a small Dutch 
publisher, Foris, which later became part of Mouton de Gruyter. Moreover, many 
leading journals in linguistics are published by European-based major publishers 
such as Mouton de Gruyter (Linguistics, Theoretical Linguistics, The Linguistic 
Review, Probus), Cambridge University Press (Journal of Linguistics, Phonology), 
Oxford University Press (Journal of Semantics), Springer (The Journal of Com- 
parative Germanic Linguistics, Linguistics & Philosophy, Natural Language & 
Linguistic Theory, Natural Language Semantics), Elsevier (Lingua). Generative 
grammarians succeeded in being strongly represented in these journals, and 
linguists from Europe played an important role in the editorial policy of these 
journals. 

But much more important is the work of many European linguists that con- 
tributed to the “generative enterprise’: “There is a far more material that I read with 
interest than was ever true in the past. Most of it comes from Europe, in fact, ...” 
Chomsky (1982: 69). One can safely say that linguists who were educated in Eu- 
rope or who at some point in their career worked in Europe in those years had a 
profound influence on the development of a great variety of subjects, varying from: 
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anaphora, argument structure, case theory, clitics, conditions on transformations/ 
islands, diachronic linguistics, functional projections, levels of interpretation, 
micro-variation, morphology-syntax interface, parameter theory to word order, 
verb movement or government phonology, to name a few significant topics from 
the 1970s and 80s. 


3.3. LFG/HPSG 


From early on, there have been competing frameworks within generative grammar, 
mainly positioning themselves against the (at that time) current ideas of Chomsky, 
such as generative semantics (Lakoff), Relational grammar (Perlmutter) or Arc- 
pair grammar (Johnson and Postal). With the exception of generative semantics 
these frameworks have not been widespread in Europe. The late seventies showed 
a more sustained proliferation of frameworks, most notable Generalized Phrase 
Structure Grammar (Gazdar), later developing into Head-driven Phrase Structure 
Grammar (HPSG, Pollard and Sag) and Lexical Functional Grammar (LFG, Bres- 
nan). So, from the early eighties on there were three competing frameworks in gen- 
erative grammar: Government-Binding theory (later replaced by the Principles 
and Parameter theory and the Minimalist Program), HPSG, originating in the UK, 
and LFG. 

If one looks at the executive committee of the "International Lexical-Func- 
tional Grammar Association', European linguists play a prominent role: Nigel 
Vincent (Manchester) is presently the chair and there are two more members from 
Europe, Miriam Butt (Konstanz) and Helge Lødrup (Oslo). In 2009 the 14th Inter- 
national LFG conference was held at Cambridge University (with earlier confer- 
ences held in Manchester, Athens, Bergen, and Konstanz). 

For HPSG a similar situation presents itself: active research groups, often 
linked to computational linguistics, are found in Germany (Gottingen, Heidelberg, 
Saarbrücken, Tübingen, a.o.), France (Paris), the UK (Edinburgh, King's College 
London, SOAS), Portugal (Lisbon), the Netherlands (Groningen), Belgium 
(Leuven). In 2009 the 16th HPSG International Conference was organised at the 
University of Góttingen (with earlier conferences in Trondheim, Leuven, Lisbon, 
Sofia/Varna). 


3.4. Generative grammar in the 21st century 


What is the role of generative grammarians from the various European countries 
nowadays? It is, of course, very difficult to assess this, but to give some sort of in- 
dication we took a look at the programs of the GLOW conferences and the annual 
LFG and HPSG conferences, counted the number of contributors and looked at 
their affiliations (not their nationality). This gives some indication of the countries 
with activities in the field. The results are summarized in the following table: 


Table 1: 
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National affiliations of contributors to GLOW, LFG and HPSG conferences 
1978-2008 


GLOW 1978-2002 


LFG 1996-2008 


HPSG 1978-2002 


Percentage of 
contributors from 
Europe: 


No or less than 1 % of 
the abstract 
contributors from: 


Approx. in between 
1 and 5 % of the 
abstract contributors 
from: 


Approx. in between 
5 and 10% of the 
abstract contributors 
from: 


Approx. in between 
10 and 15 % of the 
abstract contributors 
from Europe 


More than 15 % of 


the abstract 
contributors from: 


50 96 


Austria, Belgium, 
Bulgaria, Cyprus, 
Czech Republic, 
Denmark, Estonia, 
Finland, Iceland, 
Ireland, Romania, 
Russia, Poland 


Greece, Hungary, 
Portugal, Sweden 


Germany, Norway, 
Spain, Switzerland 


France, Italy, United 
Kingdom 


The Netherlands 
(35 %) 


37 96 


Austria, Belgium, 
Bulgaria, Cyprus, 
Czech Republic, 
Estonia, Finland, 
Greece, Iceland, 
Poland, Portugal, 
Romania, Russia, 
Spain, Sweden, 
Switzerland 


Denmark, France, 
Hungary, Italy, 
The Netherlands 


Norway 


Germany (25 96), 
Ireland (20 96), 
United Kingdom 
(25 96) 


56 % 


Austria, Cyprus, 
Czech Republic, 
Estonia, Finland, 
Greece, Hungary, 
Iceland, Ireland, Italy, 
Romania, Russia, 
Spain, Switzerland 


Bulgaria, Denmark, 
The Netherlands, 
Norway, Poland, 
Portugal, Sweden 


France (26 96), 
Germany (34 96), 
United Kingdom 
(18 96) 


One can observe that linguists from Europe filled a fair share of the conference 
programs (perhaps with the exception of LFG). A few Western European countries 
are (almost) absent from this list: Austria, Cyprus, Finland, Iceland, Romania. But 
still Austria does have active research groups in Salzburg and Vienna, and Reykja- 
vik is a flourishing centre of generative grammar in Iceland. For some countries the 
frameworks seem not to be equally represented (taking into account that the peri- 
ods are unequal): in Germany the conferences see more LFG- and HPSG-contribu- 
tors, in the Netherlands these frameworks are relatively underrepresented, while in 
Ireland LFG is clearly overrepresented. 
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In Eastern Europe much has changed over the last few decades. In Russia the 
interest in generative grammar has been growing with increasingly active centres 
both in Moscow and St. Petersburg. Poland has the Generative Linguistics in 
Poland (GLiP) meetings, and many activities around them. Romania has annual 
meetings and activities as well. Georgia hosts meetings in formal semantics on a 
regular basis. The Eastern European Summer School in Generative Grammar has 
attracted many young linguists to the field. 

Currently the field shows increasing interaction between generative grammar- 
ians, formal semanticists, typologists, experimental psychologists and cognitive 
scientists. This interaction, we believe, will keep the field thriving and set the tone 
for the developments to come. 


See also the following chapters in this volume: 47 by Albrecht and 48 by Sornicola. 


Notes 


1 What we aimed for in this chapter is sketching a picture of the development that took 
place based on the sources we had, but in part also based on or perhaps even biased by our 
personal experience during different stages of this development. An extended version of 
this paper will be separately published. We would like to thank Werner Abraham, Riny 
Huybregts, Ekkehard Kónig, Henk van Riemsdijk, Henk Schultink and the editors of this 
volume for their comments and help in clearing up certain issues. More than ever it is im- 
portant to note that the authors are solely responsible for what is stated in this article. 

2 Note that we take a very broad definition of generative grammar, as a research pro- 
gramme which includes different competing frameworks — such as Lexical Functional 
Grammar, Head-driven Phrase Structure Grammar, the Minimalist Program, Optimality 
Theory, etc. — because they all take linguistics as a science that should provide a precise 
(explicit and formal) model of a cognitively embedded computational system of human 
language, and how it is acquired. 

3 To illustrate this point, Abraham informs us that he visited the DDR-Akademie für Wis- 
senschaften several times illegitimately, but with (illegal) support by E. Lang, R. Steinitz, 
M. Bierwisch, and M. Doherty. He (and others) sent them books and articles. 

4 Linguists who should also be mentioned in this context include W. Abraham, W. Boeder, 
W. Thümmel, H. Vater, and later W. Klein and A. von Stechow. 

5 At the time von Stechow still had a junior position. The responsible chair was P. Hart- 
mann. Hartmann himself was not involved in generative grammar, but he was open to 
new developments, and he was one of the few senior figures to attend the Linguistische 
Kolloquia. Together with Morris Halle, Frits Staal, Pieter Verburg and others, Hartmann 
was a member of the editorial board of the journal Foundations of Language. Inter- 
national Journal of Language and Philosophy, founded in 1965. This journal published 
some of the earliest contributions to generative grammar. 

6 To mention a few other prominent names from this generation: S. Felix, T.J. Gardner, 
T.N. Hóhle, J. Lenerz, J. Meisel, M. Reis, J. Weissenborn. 
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Index of languages and varieties 


A 

Abaza 125, 133, 138-139, 144, 146, 742 

Abkhaz 125, 133, 137—140, 142, 144, 146, 
148, 742 

Adamorobe Sign Language 278 

Adyghe 125, 128, 137, 139-140, 142, 144, 
146-148, 742 

Afrikaans 47 

Aghul 126, 129, 132-133, 135, 138, 146, 
150 

Agul see Aghul 

Akhvakh 126, 128, 138-140, 148 

Albanian 127, 199-207, 292, 297, 
308—318, 320, 322, 410—413, 418, 425, 
430-431, 435—436, 441, 445—449, 458, 
460—461, 502, 579, 646, 649, 652, 684, 
737-138, 712 , see also Arvanitika, 
Geg Albanian, Kosovo Albanian, Tosk 
Albanian 

Alemannic 507, 509, 535 

Alsatian 502, 571, 657 

American Sign Language 269, 278 

Andalusian Arabic 343 

Andi 126 

Anglo-Norman 341, 694 

Arabic 127, 129, 159-160, 163-164, 
174, 241-244, 342-344, 347, 349-350, 
352-353, 395-396, 417, 419, 468, 487, 
594, 691-692, 695, 731-732, 742, 
752-753, see also Andalusian Arabic, 
Cypriot Arabic, Lebanese Arabic, 
Moroccan Arabic, Siculo-Arabic 

Aragonese 69, 209, 347, 396, 398-399, 
401, 513 

Archaic Latin 345 

Archi 126-127, 129, 130-133, 138-140, 
142, 146-147, 149-150 

Armenian 127-132, 134, 137-144, 
146-150, 258, 344, 418, 458, 639-642, 
648, 650, 657, 731, 740—742, 817, see 
also Classical Armenian, Eastern 
Armenian, Western Armenian 

Aromanian 69, 308-309, 311, 313-315, 
317-318, 320-322, 411, 416, 418, 422, 
424, 429 


Arumenian see Aromenian 
Arvanitika 423, 429 

Assyrian 817 

Asturian 69, 78, 93, 396, 398, 400-401 
Aukstaitian see High Lithuanian 
Australian English 420 

Australian Maltese 242 

Australian Sign Language 278 
Austrian German 410, 496, 725 
Austrian Sign Language 275 

Avar 126-130, 136, 139, 150 
Azerbaijani 127-128, 134, 150, 740 
Azeri see Azerbaijani 


B 

Bagvalal 126, 144, 146 

Balkan Romance 258, 310, 313, 315, 
320-321, 430, 738 

Balkan Romani 262, 308-309 

Balkan Spanish see Judeo-Spanish 

Balkan Turkish 173, 308, 733, 737 

Balto-Slavic 291, 679 

Bashkir 161-165, 169, 174, 176 

Basque 209-220, 297, 299, 303-304, 
393-396, 398-403, 430, 458-459, 487, 
490, 502, 555, 644, 647, 657, 676-678, 
683, 685, 695, 806—808, 816, see also 
Euskara Batua 

Bavarian 58, 489, 507—508, 510, 537, 
707 

Belarusian 4, 97, 99, 101, 326, 330, 332, 
374, 516—518, 538-539, 649—650, 657, 
688 

Belarussian see Belarusian 

Belgian Dutch 303, 496 

Belorussian see Belarusian 

Berber 210, 396, 692 

Besleney 137 

Bezhta 126, 150 

Bible Greek, see Koiné Greek 

Bokmal 47, 54, 57, 67, 492 

Bosnian 97-98, 100, 433, 435, 442, 515, 
536, 650, 717, 761, 763—765, 767-768, 
771-773, 784 

Brazilian Portuguese 71, 77—78, 83, 87 
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Breton 31-33, 35-37, 39-43, 457, 461, 
647, 656-657 

British Sign Language 271, 276, 281—282, 
284—286 

Brittonic 693 

Brygian 411 

Budukh 126-131, 137 

Bulgarian 97, 99-102, 105-106, 109, 
111-114, 121, 308-314, 317-320, 322, 
330, 351, 409-410, 412-419, 421, 
423-425, 430, 436, 440, 442, 498, 502, 
514-516, 519, 535-536, 624, 642—643, 
648, 657, 685, 688, 695, 763, 770, 772 

Bulgarian Turkish 410, 423 

Burgenland Croatian 433-435, 441, 
444-445 

Byzantine Greek 346, 352, 412, 414, 
648 

Byzantine-Lombard 684 


C 

Calabrian 460—461 

Caló 401, 551 

Cantonese 623 

Canadian French 77, 89 

Caribbean Spanish 83 

Castilian see Spanish 

Catalan 69—70, 73, 77, 82, 92—93, 
279-280, 283, 296, 302, 397—401, 
403-405, 429, 459, 461, 502, 512-513, 
643, 647, 657 

Catalonian Sign Language 269 

Celtiberian 394—395 

Celtic Lepontic 681 

Central Mexican Spanish 73 

Central Latvian 334 

Chaghatay 159 

Chamalal 126 

Champenois 512 

Chechen 126 

Chinese 78, 387, 623, 661, 802, 806-807, 
see also Cantonese, Mandarin 

Chinese Pidgin Russian 386—387 

Christian Latin, see Late Latin 

Chuvash 159-160, 162-164, 167-170, 
173-174, 176 

Cimbrian 449, 460, 797 

Cimbro see Cimbrian 


Classical Armenian 127, 143, 648 

Classical Georgian 648 

Classical Greek 223-224, 227, 229, 649, 
686, 838 

Classical Latin 72, 75-76, 90, 396, 411, 
693, 809, 832 

Colloquial French 87, 90 

Colloquial Italian 85, 87, 445 

Colloquial Upper Sorbian 99, 297, 429, 
432-433, 437-439, 441, 443-444 

Continental Celtic Lepontic 681 

Cornish 31-32, 458, 461 

Corsican 69, 349, 657 

Cretan Greek 739 

Cretan Turkish 739 

Crimean Ottoman 739 

Crimean Tatar 161, 165, 174 

Croatian 97-101, 104—106, 110, 113, 115, 
119, 333, 410-412, 414-416, 420-421, 
434—435, 442, 444—445, 498, 502, 
514—515, 535-536, 646, 649-651, 657, 
688, 714—717, 721, 725, 761—773, 784, 
see also Burgenland Croatian 

Croatian Sign Language 277, 281 

Curonian Latvian 338 

Curonian Livonian 337—338 

Cypriot Arabic 349 

Cypriot Greek 223—224 

Cypriot Turkish 738-739 

Czech 97-100, 104, 108, 111-112, 
114-115, 117-118, 120, 325, 410, 415, 
495, 502, 516, 536, 541, 624, 633, 641, 
646, 657, 685, 688, 714-715, 719, 
721-725, 801, 849, 858, 873, see also 
Moravian, Zaolzie 


D 

Dacian 308, 411 

Daco-Rumanian 424 

Dalmatian 70, 411—412, 435 

Danish 47-48, 54, 57, 59, 62, 297, 326, 
328, 330, 333-334, 359, 361—362, 
368—370, 377, 449, 457-458, 486, 
496, 607, 624, 641, 645—647, 657, 
660, 779, 782, 801 

Danish Sign Language 271, 281 

Dargwa 126, 130-131, 135, 138, 142-143, 
146, 150 


Demotic Greek 224 

Doric Greek 412 

Dundaga Latvian 338 

Dutch 47, 49-50, 55-57, 59, 62-63, 65, 
269, 293, 296-297, 302-303, 373, 
386-387, 475, 489, 492, 502, 507-508, 
526-528, 531, 562-566, 612, 624, 634, 
645-647, 649, 693, 797, 801-802, 
868-871, see also Belgian Dutch, 
Flemish 


E 

Early New High German 707 

East Slovak Romani 260 

Eastern Algonquian 802 

Eastern Armenian 134, 150-151 

Eastern Italian 442 

Eastern Khanty 191 

Ekavian 515, 762-764, 767 

Emilian 87 

Enets 180 

English 20, 43, 47-59, 61-65, 78-79, 
89-91, 100, 118-119, 151, 164, 174, 
200, 214, 216, 226, 241-243, 250, 259, 
291-293, 296-300, 304, 331, 336, 345, 
362, 370, 374, 376-378, 386, 401, 406, 
411, 420, 424-426, 437, 449, 457-461, 
467, 474, 477-479, 486-487, 492, 495, 
498, 502, 507, 510-511, 526, 535-536, 
540-541, 555, 564, 566, 569, 582—584, 
588, 591—602, 605-617, 622-626, 631, 
634, 641, 645—649, 658—662, 664—665, 
685—686, 692, 694—695, 705—706, 723, 
738-739, 766, 769, 778-779, 785—186, 
801, 806, 816, 821, 824, 826, 829, 833, 
836, 839, 849, 858, 872, 877, see also 
Australian English, Old English, 
Solombala Pidgin English 

Erli Romani 260 

Erzya 179, 181, 183, 193, 382 

Esperanto 664, 815 

Estonian 4, 14, 22-23, 179-190, 192, 194, 
325, 328—330, 333, 335-337, 362, 374, 
455, 538, 624, 645, 647, 657, 689, 747, 
781 

Etruscan 679 

Euskara Batua 209, 398 
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F 

Faroese 47-50, 53-55, 57, 59-62, 64-65, 
297, 359, 361, 369, 377, 457, 607 

Finnish 22, 179-195, 297, 325, 328, 
330-331, 333, 335-337, 359, 361-368, 
371-378, 382, 468, 492, 495, 538, 
607-608, 611-617, 624, 645, 647, 657, 
689, see also Stadin Slangi Finnish 

Finnish Romani 260, 366 

Flemish 302, 458, 533, 563-565, 797, see 
also French Flemish 

Flemish Sign Language 269, 273, 
278-282 

Francien 703 

Francoprovengal see Franco-Provengal 

Franco-Provengal 70, 449, 456, 512 

French 33, 43, 55, 69-93, 103, 107, 
164, 200, 209, 291, 293, 296-297, 
300-302, 331, 344-345, 347-348, 
367, 374, 386, 397, 399, 403, 405, 
410, 417-421, 424-425, 429-430, 
442, 457, 459, 467, 477, 486, 495-496, 
502, 511-512, 535-536, 561-572, 
591-592, 600-602, 609, 612, 617, 
622-626, 633-634, 643, 645, 647-648, 
655-658, 660-662, 690, 694, 703-704, 
706, 708, 732, 775, 718—180, 785, 
801, 806, 826—828, 832, 877, 881, 
see also Anglo-Norman, Champeonois, 
Colloquial French, Francien, Old French, 
Picard, Walloon 

French Flemish 302 

French Sign Language 269, 271, 276 

Frisian 47, 49—50, 55, 57, 60, 62—63, 
65, 2977, 362, 449, 458, 462—464, 657, 
see also North Frisian, West Frisian 

Friulian 297, 434, 441, 462-463, 
714-715 


G 

Gagauz 161, 163-165, 170-176, 423, 
736-737, 739 

Galego see Galician 

Galician 69-70, 72, 77, 93, 398, 400—403, 
459, 497, 502, 657, 719, 721—722 

Galician Polish 725 

Gallic 430, 692 

Geg Albanian 199—200, 207, 317 
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Georgian 125, 127-129, 133-134, 
136-137, 140-142, 144—145, 147, 
149—150, 210, 328, 641—643, 648, 742, 
see also Classical Georgian 

German 4, 47-57, 59, 62-65, 89, 93, 100, 
115, 118, 181, 259, 291, 296-297, 
300—302, 326, 329—330, 337, 362, 364, 
367—368, 371, 374—375, 386, 405, 410, 
412, 415, 418, 420, 423, 425, 429-430, 
432-434, 437—438, 441, 443-445, 
448-449, 455, 457-464, 467, 475, 486, 
489—490, 493, 495—496, 502, 507—510, 
526, 535, 538, 541, 561—571, 582, 585, 
591—592, 594, 600—602, 606—609, 617, 
622—624, 626, 633-634, 646—647, 655, 
657-658, 660-662, 685, 688, 689—690, 
693, 695, 707—708, 714-726, 764, 766, 
779-782, 786, 793, 797, 799, 801-802, 
806, 809, 816, 832, 839, 858, see also 
Alemannic, Alsatian, Austrian German, 
Bavarian, High German, Limbourgian, 
Low Saxonian, Mócheno, Swiss Ger- 
man, Walser 

German Romani 260, 262 

German Sign Language 279, 282—283 

Gheg see Geg Albanian 

Godoberi 126, 127 

Gothic 48, 51, 53, 58, 640, 686, 695, 797 

Greek 75, 102, 200-201, 223-237, 258, 
261, 263—264, 266, 297, 308—310, 
312-317, 320—322, 330, 333, 342-347, 
350-353, 374, 394, 409-419, 421—423, 
425, 429, 446, 458, 461, 468, 538, 624, 
632, 640—644, 648—651, 657—658, 
661—662, 678, 684, 686—687, 691, 693, 
695, 699—700, 703—704, 706, 708—709, 
731, 734, 736, 738—139, 740—742, 772, 
777, 796-797, 808, 811, 816, 833, 838, 
see also Byzantine Greek, Classical 
Greek, Cretan Greek, Cypriot Greek, 
Demotic Greek, Doric Greek, Griko, 
Koiné Greek, Pontic 

Greek Sign Language 279—280, 283 

Griko 353, 684 


H 
Hebrew 72, 329, 349, 396, 658, 685—686, 
700, 796 


High German 4, 57, 368, 463—464, 509, see 
also Old High German, Middle High 
German, Early New High German 

High Latvian 4-5, 11, 325, 335-336, 336 

High Lithuanian 9, 334 

Hong Kong Sign Language 278 

Hungarian 7, 180-190, 192-196, 325, 327, 
328, 331, 337, 412-416, 418, 423, 502, 
538, 624, 657, 695, 714—718, 720, 
722—125, 764, 801, 877-878 

Hunzib 126 

Huron 802 


I 

Iberian 210 

Icelandic 33, 47-49, 52-62, 64-65, 
292-293, 359, 361, 369-370, 377, 
455-456, 607, 801 

Ijekavian 762-763, 767 

Illyrian 308, 411-413, 686, 692 

Indo-Pakistani Sign Language 278 

Ingrian 179, 325 

Ingush 126, 139 

Iranian 159 

Irish 31-35, 39-40, 43, 297, 456-457, 490, 
492, 495, 538, 550, 623-624, 641, 645, 
647, 657, 678, 693, see also Northern 
Irish, Middle Irish, Old Irish, Southern 
Irish, West Irish 

Irish Sign Language 273-275, 279—282 

Istrorumanian see Istro-Rumanian 

Istro-Rumanian/Istrorumanian 411, 421 

Italian 69—75, 82—88, 90, 93, 241—244, 
246, 248, 250-251, 297—298, 302, 330, 
344, 346—349, 351, 374, 401, 410, 416, 
421, 425, 429-430, 434—442, 444—445, 
447, 449, 456, 460, 462—463, 467-468, 
490, 512, 526, 530—531, 594, 624, 643, 
645, 647, 657-658, 660-661, 690, 692, 
702-703, 706, 708—709, 714—715, 
718—719, 721—722, 780, 786, 801—802, 
813, 832, 878 , see also Byzantine-Lom- 
bard, Calabrian, Colloquial Italian, East- 
ern Italian, Emilian, Friulian, Lombardic, 
Molisian Italian, Napolitan, Northern 
Italian, Pantesco, Salentino, Sicilian, 
Southern Italian, Tuscan, Venetian 

Italian Sign Language 271, 282 


J 

Japanese 159, 206, 623, 661 

Jordanian Sign Language 278 

Judaeo-Spanish see Judeo-Spanish 

Judeo-Spanish 72, 308, 418, 421, 423—424, 
742 

Judezmo see Judeo-Spanish 


K 

Kabardian 125, 128, 132, 137, 139, 146, 
see also Besleney 

Kajkavian 414, 420, 433, 514, 762—763, 
766, 772 

Kamass 180, 186 

Karaim 161-165, 168, 170-176, 325-327, 
329, 461 

Karata 126 

Karelian 179-180, 182, 325-326, 362, 383, 
385, see also Lude Karelian 

Kashubian 97, 99, 326, 331, 658 

Kazakh 162, 756-757 

Ket 746 

Khalaj 160 

Khanty 180, 181, 187, 191, 328, see also 
Eastern Khanty 

Khinalug 126-132, 135, 137, 139, 144, 
146, 150 

Koiné Greek 342, 346, 351, 691, 693 

Komi-Permyak 179 

Komi-Zyryan 179 

Korean 159, 594 

Kosovo Albanian 420 

Kryz 126, 130 

Kumyk 127, 129-130, 140, 161-162, 165, 
169 

Kurdish 741 


L 

Ladin 457-458, 461-463, 555-556, 714, 
see also Ladin Dolomitan 

Ladin Dolomitan 457, 462, 555 

Ladino see Judeo-Spanish 

Lak 126, 130, 135, 138, 142, 145-146 

Late Latin 342, 345-346, 351-352, 
692-693 

Latgalian see High Latvian 

Latin 21, 69, 71-80, 82-83, 85-90, 99, 
102, 127, 161, 165, 176, 181, 183, 
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199-201, 211, 227, 241, 298, 303, 308, 
342-343, 345-346, 350, 351-353, 371, 
374, 394, 396, 399, 410-415, 422, 430, 
440, 457—458, 487, 495, 513, 529—530, 
639—644, 646—649, 656, 658, 661—662, 
664, 684, 686-688, 690—694, 698—700, 
702-704, 706, 708—709, 724, 735, 
752-753, 163—164, 768, 773, 777-778, 
792, 794, 796-802, 806, 808, 811, 816, 
832, 838, see also Archaic Latin, Classi- 
cal Latin, Late Latin, Vulgar Latin 

Latin American Spanish 85 

Latvian 3-27, 183, 195, 325, 327—338, 
362, 374, 456, 624, 645, 647, 689, see 
also Central Latvian, Curonian Latvian, 
Dundaga Latvian, High Latvian, Low 
Latvian, Nehrungskurisch, Old Latvian, 
Tamian Latvian 

Latvian Romani 327 

Laz 125, 133, 137, 141, 145, 147, 742 

Lebanese Arabic 349 

Lemnian 679 

Lepontic 460 

Letgalian see High Latvian 

Lezgian 126, 128, 130, 136, 138, 147 

Ligurian 394, 681 

Limbourgian 565 

Lithuanian 3-27, 325, 328-330, 332, 334, 
336, 338, 362, 374, 383, 456, 624, 641, 
646, 657, 659, 689, 757, 811, see also 
High Lithuanian, Old Lithuanian 

Livonian 3, 179, 182-184, 187-188, 190, 
195, 325, 327-328, 330, 333-334, 
336—338, 460—461, 689, see also 
Curonian Livonian, Salis-Livonian 

Llanito see Yanito 

Lombardic 87, 462, 512, 528—531 

Low German 4, 53, 57, 326, 371, 375, 386, 
462—464, 489—490, 493, 646, 689, 707 

Low Latvian 4, 336 

Lower Saxonian 552, see also Old Saxon 

Lower Silesian 461 

Lower Sorbian 97, 99, 121, 432, 444, 449, 
459, 531, 533, 536, 539 

Lude Karelian 182 

Lusitian 681 

Luxembourgeois 567 
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M 

Macedonian 97-100, 102, 105, 106, 
111-112, 121, 308-314, 316, 318-322, 
410-411, 414-415, 417-420, 423-424, 
429, 436, 440, 442, 502, 514-515, 
536-537, 685, 688, 695, 735, 740, 765, 
770, 772, see also Usikova 

Macedonian Turkish 740 

Macedo-Rumanian see Aromenian 

Maltese 241-253, 297, 298, 342, 349, 350, 

352, 624, 659, 692, see also Australian 

Maltese 

Mandarin 623 

Mediterranean Lingua Franca 348 

Megleno-Romanian 308-311, 313-314, 

317-322, 411, 422 

Megleno-Rumanian see Megleno-Roman- 

ian 

Megrelian, see Mingrelian 

Mexican Spanish 74 

Middle High German 55, 832 

Middle Irish 694 

Middle Welsh 694 

Mingrelian 125, 128, 133-134, 144-145 

Minoan 679 

Mirandese 398,550 

Mocheno 449 

Moksha-Mordvinic 382 

Moldavian 72, 502, 513 

Moldovan see Moldavian 

Molise Slavic 435-449 

Molisian Italian 435, 442 

Mongolian 690, 746, 752—753 

Montenegran 515, 765, 767—768, 774 

Moravian 516, 714 

Mordvin 179-183, 185, 188-189, 

192-194, 328, 331, 334, 336, 689 

Moroccan Arabic 351 

Mozarabic 343, 396, 691 


N 
Nakh 125-126, 128, 130-131, 138, 140, 
142, 144, 153-156 

Napolitan 77, 490 

Navarro-Aragonese see Aragonese 
Nehrungskurisch 329—330 

Nenets 180 

Nivkh 746 


Nogai see Noghay 

Noghay 127, 161—163, 165, 168, 176 

Norn 460 

North Frisian 58, 462—463 

North Saami 180 

Northern Irish 32 

Northern Italian 83, 85, 442, 462, 690 

Norwegian 47-48, 53-55, 57, 59, 61-62, 
64, 67, 180, 333, 359, 361-362, 
367—372, 377, 379, 384—387, 390, 492, 
499, 502, 607, 646, 689, 782, 787, see 
also Bokmál, Nynorsk 

Nynorsk 47, 53—54, 57, 67, 362, 371, 489, 

496 


O 

Occitan 69, 70, 73, 84, 86-87, 93, 302, 
397-399, 402-403, 407, 449, 458, 512, 
656, 658, 801 

Old Church Slavonic 107, 121—122, 
412-414, 487, 768, 770, 781 

Old English 58, 695, 799, 828 

Old French 72, 80, 84, 87 

Old High German 55, 414 

Old Latvian 337 

Old Irish 456, 694 

Old Lithuanian 9, 13, 16 

Old Norse 58, 367, 369—370, 690 

Old Prussian 3, 325, 338 

Old Saxon 58 

Old Turkic 159, 162, 167, 177 

Old Welsh 694 

Oscan 458 

Ossetic 127—129, 131, 134, 137-139, 
142-143, 147, 150-151, 153-154, 
328 

Ottoman Turkish 159, 174, 413, 417, 729, 
731-733, 736, 740 


P 

Paeonian 411 

Pantesco 342, 350, 352 

Pennsylvania German 47 

Persian 159, 163-164, 173-174, 419, 640, 
651, 695, 731—732, 797, 816 

Phoenician 394 

Picard 512 

Pictish 678 


Piedmontese 69, 86-87 

Piemontese see Piedmontese 

Pipil 303-304 

Plattdeutsch see Low German 

Polabian 688 

Polish 4, 97, 99-100, 102, 104-105, 
107—121, 326, 329, 332, 362, 374, 383, 
429, 467, 516, 517—518, 536—537, 541, 
624, 634, 636, 658, 688—689, 714—715, 
717-722, 725, 742, 749, 770, see also 
Galician Polish, Zaolzie 

Pontic 223, 235 

Portuguese 69-70, 73-74, 77, 86-87, 93, 
349, 398, 497, 502, 513, 548, 624, 643, 
645, 647, 656, 705, 801-802, see also 
Brazilian Portuguese 


R 

Republican Turkish 732 

Resian 434-435, 441, 444-445, 448 

Riksmäl see Bokmäl 

Romani see Balkan Romani, Caló, East 
Slovak Romani, Erli Romani, Finnish 
Romani, German Romani, Latvian 
Romani, Turkish Romani, West Slovak 
Romani 

Romanian 69-74, 77-78, 83-84, 86, 88, 
93, 164, 261, 264, 297, 308—315, 317, 
320-322, 410-412, 414—416, 422-424, 
442, 502, 513, 534, 536-539, 624, 
645-646, 648-649, 658, 687, 714, 
716-717, 720, 722, 725, 772, see also 
Daco-Rumanian 

Rumanian see Romanian 

Rumantsch Grischun 462, 465 

Russenorsk 365, 385, 689 

Russian 4, 97, 99, 101-121, 128, 150, 
161-164, 174, 180-182, 195, 326, 
330-331, 333-334, 337, 362, 365, 
374, 381-390, 410, 415, 421-423, 
425, 489, 502, 513, 516-518, 534, 537, 
539-541, 550, 608, 622, 643, 648, 
658-661, 688—690, 717, 725, 745—758, 
762-764, 770, 782, 786, 801—802, 
871-872 

Russian Romani 260 

Rutul 126-127, 131, 137 
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S 

Saami 179—194, 325, 327, 336, 359, 
362-367, 371—372, 385—388, 461, 497, 
502, 579, 657, 674-675, 689, see also 
North Saami 

Sámi see Saami 

Salentino 347, 353 

Salis-Livonian 337-338 

Sanskrit 797, 806—809, 816 

Sardinian 69-70, 72—73, 77, 82, 85, 93, 350 

Scots 31-36, 39-40, 43, 456, 459—460, 
510, 646 

Scottish Gaelic see Scots 

Scythian 797 

Selkup 180, 757 

Sephardic Spanish see Judeo-Spanish 

Serbian 97-101, 104-106, 108-113, 115, 
119, 121, 308, 333, 410-412, 414—416, 
419-420, 433, 435, 442, 498, 514-516, 
519, 535—536, 643, 648-649, 657-658, 
688, 714, 716-, 761-773, see also 
Ekavian, Slaveno-Serbian, Usikova 

Serbo-Croatian 98, 433, 442, 444-445, 
498, 515, 649-651, 721, 761-766, 770, 
772-7173, see also Croatian, Serbian 

Sicilian 77, 242, 346—347, 350, 490, 692 

Siculo-Arabic 241 

Sign Language of the Netherlands 269, 
279, 282 

Slaveno-Serbian 762, 764 

Slovak 97, 99-100, 109, 114, 325, 456, 
502, 516, 518, 624, 657—658, 688, 
714-716, 718, 720, 723, 725 

Slovenian see Slovene 

Slovene 97,99, 100—101, 107, 109, 121, 333, 
410, 416, 420, 434, 441—442, 445, 449, 
502, 514, 624, 641, 646, 657—658, 685, 
688, 714-716, 718, 721—722, 125, 165, 772 

Solombala Pidgin English 385-386 

Sorbian 102-103, 107, 121, 330, 432-433, 
438, 442, 445, 449, 459, 532, 657, 688, 
see also Upper Sorbian, Lower Sorbian, 
Sorbian-Lusatian 

Sorbian-Lusatian 685 

Southern Irish 32 

South Italian, see Southern Italian 

Southern Italian 77, 84, 92, 349, 352, 446, 
490 
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Spanish 69-75, 77-86, 88, 90-93, 209, 
211, 293, 297, 299, 345, 347—349, 
351-352, 393-396, 398-401, 403-405, 
459, 467-468, 502, 512-513, 600-602, 
622-624, 641, 643, 645-647, 658, 
660-661, 685, 704-708, 801, see also 
Asturian, Caribbean Spanish, Central 
Mexican Spanish, Judeo-Spanish, Latin 
American Spanish, Mexican Spanish, 
Mirandese, Yanito 

Spanish Sign Language 269 

Stadin Slangi Finnish 330 

Surselvian 462 

Surzhyk 388-390 

Svan 125, 127-131, 142 

Swedish 47-48, 52-54, 57, 59, 62, 64, 326, 
328, 330—331, 333, 359, 361-368, 
371-375, 311—318, 384—385, 387—388, 
468, 492, 495—496, 502, 540, 582—583, 
585, 607, 610—611, 624, 645—647, 660, 
801—802 

Swedish Sign Language 271, 281—282, 363 

Swiss German 486, 509 

Syriac 640, 742 


T 

Tabasaran 126, 138 

Tagalog 802 

Tahitian 658 

Tamian Latvian 329, 332—334, 338 

Tartessian 394, 678—679 

Tatar 161—165, 174, 176, 264, see also 
Crimean Tatar 

Teutonic 797 

Thracian 308, 411—413, 692 

Tindi 126 

Tosk Albanian 199—200, 207, 313, 412, 436 

Trasianka 388—390 

Tsakhur 126-127, 129, 134, 137 

Turkish 160-165, 168, 170, 172-176, 259, 
261, 264, 308, 345, 409, 410, 412, 414, 
417-419, 421, 423-425, 467-468, 498, 
650-651, 658, 688, 695, 729, 731-743, 
see also Balkan Turkish, Bulgarian Tur- 
kish, Cretan Turkish, Ottoman Turkish, 
Cypriot Turkish, Macedonian Turkish, 
Republican Turkish 

Turkish Romani 261 


Turkish Sign Language 282-283 
Tuscan 346, 658, 702 


U 
Ubykh 125, 128-130, 133 

Udi 126-128, 130-131, 134-135, 138, 
145-147, 150 

Udmurt 179, 180, 183, 188, 331 

Ugaritic 684 

Ukrainian 97, 99, 103, 121, 374, 381—383, 
388-390, 502, 516, 518, 534, 649, 657, 
688, 714-715, 719, 746, 749, 758 
Jmbrian 458 

Jpper Sorbian 99, 297, 432—433, 438, 443, 
449, 531, 536, see also Colloquial Upper 
Sorbian 

Usikova 420 


EC 


V 

Valencian 69, 403, 657-658 

Venetian 69, 344, 347, 412, 416—417, 434, 
662, 688 

Veps 179-180, 182, 326, 330-331, 333, 
337, 689 

Vepsian see Veps 

Vlach see Aromenian 

Vodish see Votian 

Vote see Votian 

Votian 179, 325, 330, 461 

Vulgar Latin 72-73, 75, 351, 394, 396, 411, 
693 


W 

Walloon 512, 562, 564—566 

Walser 449 

Welsh 31-33, 35-41, 43, 490, 645, 647, 
657, see also Middle Welsh, Old Welsh 

West Frisian 47, 463 

West Irish 32 

West Slovak Romani 260 

Western Armenian 127, 134 


Y 

Yanito 401 

Yiddish 47, 293, 297, 326, 362, 658, 714, 
717 


Z 
Zaolzie 516 
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A Arkadiev, Peter M. 144 
Aasen, Ivar 370-371 Arnason, Kristjan 370, 493 
Abaev, Vasilij I. 151 Arnauld, Antoine 794 
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Abraham, Werner 55, 871, 884 Arzoz, Xabier 467 
Adams, Douglas Q. 676, 681 Aschenberg, Heidi 701, 709 
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Ager, Dennis E. 397, 783 ASic, Tijana 422-424 
Ahlgren, Inger 271 Askedal, John Ole 48-49, 54, 64 
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Allan, Keith 273 Baker, Philip 470 
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Bergaigne, Abel 814 Breva-Claramonte, Manuel 658, 662 
Berger, Tilman 438 Brincat, Joseph 241—242, 343, 350, 
Bergman, Brita 270-271, 281-282 692 
Bergroth, Hugo 366 Britain, David 504—505 
Bergs, Alexander 459 Brix, Emil 717 
Bergvall, Frans 331 Broch, Ingvild 385-7 
Bernini, Giuliano 86-87, 301 Broch, Olaf 385—388 
Berns, Margie 607, 617 Brgndal, Viggo 833 
Bertuch, Johann 806 Browning, Robert 642 
Besch, Werner 707 Bruchis, Michael 758 
Bevington, Gary Loyd 200, 207 Brugmann, Karl 812-813 
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Bóttger, Claudia 600 Campbell, Lyle 303, 341—342 
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Bouquet, Simon 826, 833, 840 Carmichael, Cathie 747 
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Bovingdon, Roderick 242 Cassirer, Ernst 823 
Bradaric-Joncié, Sandra 277, 281 Castrén, Mattias A. 817 
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Fodor, Istvan 782 Gorter, Durk 467—468, 473-474, 478, 548, 
Fons, Montserrat 405 611, 656-657 
Formigari, Lia 807 Graddol, David 605, 706 
Forsyth, Katherine 678 Graffi, Giorgio 849, 876, 878 
Fort-Cafiellas, Maria Rosa 400, 513 Grandi, Nicola 351 


Fortunatov, Filipp F. 836 Green, David 600 


Greenberg, Joseph H. 50 

Greenberg, Robert 515, 763, 767 

Greene, Kathryn L. 585 

Greenfeld, Liah 778 

Grégoire, Abbé Henri 779 

Grenoble, Lenore A. 746-747, 751—752, 
755, 757 

Greppin, John A.C. 151 

Grimm, Jacob 809-10 

Grin, François 554, 557 

Grondelaers, Stefan 489, 496 

Grünthal, Riho 184 

Grzega, Joachim 655, 658, 663, 665 

Gupta, Anthea Fraser 595 

Gustavsson, Sven 765 

Gutschmidt, Karl 755 

Gyarmathi, Sámuel 808 


H 

Haak,Wofgang 675 

Haarmann, Harald 381, 456, 526, 622 

Haase, Martin 210, 218, 303, 393, 399 

Haberland, Hartmut 59 

Habermas, Jürgen 592 

Haeseryn, Walter 55 

Hagége, Claude 782 

Haider, Hübert 876 

Haig, Geoffrey 165 

Hajiéová, Eva 846-847, 873 

Haiman, John 462 

Halhed, Nathaniel Brassey 807 

Halliday, Michael A. K. 830, 836-837, 
847, 851—852, 859 

Hannan, Kevin 516 

Hansen, Bjoern 12, 20, 109, 113, 300 

Harbert, Wayne 47-49, 53, 57-59, 62, 64 

Harnisch, Rüdiger 509-510 

Harris, Alice C. 131, 145-146 

Harris, Roy 826 

Harris, Zellig 871 

Hary, Benjamin 648-649 

Hasebrink, Uwe 607 

Haspelmath, Martin 42, 147, 292-293, 
296-300, 303, 352, 525-526, 533 

Hastings, Adrian 650, 777 

Haugen, Arne 756 

Haugen, Einar 460, 495, 551—552, 701, 
TIS 
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Häusler, Alexander 675-676 

Hausmann, Frank-Rutger 698-699 

Havas, Ferenc 191 

Havet, Louis 815 

Havränek, Bohuslav 831, 873 

Hazai, Györg 738 

Heer, Hannes 621 

Heine, Bernd 292-293, 298, 303-304, 
683 

Heinecke, Johannes 32-33, 36-37, 39-41 

Helimski, Eugen 184 

Hellinger, Marlis 579-580 

Heltoft, Lars 59 

Hemon, Ropaz 36-9, 42 

Hentschel, Gerd 389 

Hermann, Tanja 526 

Herrlitz, Wolfgang 661 

Herson-Finn, Viktoria 515 

Hewitt, B. George 142, 148-149, 151 

Hickmann, Anton 718 

Hilgendorf, Suzanne K. 607 

Hill, Peter M. 346, 411, 420, 688, 737 

Hinderling, Robert 449 

Hinrichs, Uwe 321, 409-415, 417-419, 
421, 424-425, 541 

Hinskens, Frans 505, 507, 509 

Hirsch, Francine 756 

Hjelmslev, Louis 829-830, 833-836, 838, 
850, 878 

Hobsbawm, Eric 761, 781 

Hock, Hans Henrich 292 

Hockett, Charles F. 825 

Hoenigswald, Henry M. 805, 809-810 

Hofbauer, Gottfried 810 

Högström, Pehr 385 

Holton, David 223-224, 235 

Holtus, Günter 72 

Holvoet, Axel 332-333 

Honti, Läszlö 184, 186, 188 

Hooks, Bell 588 

Hopper, Paul 80 

Horn, Dagmar 757 

Horner, Kristine 568 

Horrocks, Geoffrey 223 

Hotchkiss, Valerie R. 649 

House, Juliane 591, 595-600, 609 

Hroch, Miroslav 723 

Hubbard, Philip 207 
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Hübschmann, Heinrich 817 
Hudlett, Albert 570 

Hughes, Stephanie 571 

Huguet, Angel 405 

Hull, Geoffrey 241 

Humphreys, Humphrey Lloyd 32 
Huntington, Samuel P. 650 
Huson, Daniel H. 327 


I 

Igla, Birgit 268 

Ijsewijn, Jozef 699 

Imart, Guy 753 

Inglehart, Ronald F. 548, 556 
Ising, Erika 801 

Ivić, Pavle 442 


J 

Jaeggli, Oswaldo 205 

Jäger, Ludwig 826 

Jahr, Ernst Hákon 385-387 

Jakobson, Roman 100, 823, 828, 831, 838, 
846, 848, 853, 873, 877 

Jakovlev, Nikolai F. 752 

Janes, Judit 405 

Janhunen, Juha 180, 182, 185 

Jankowsky, Kurt R. 813 

Jantunen, Tommi 281 

Jensen, Frede 73 

Jespersen, Otto 301-302, 813, 815-816, 
849 

Jessner, Ulrike 607 

Johanson, Lars 141, 160, 163, 172, 176, 
688, 731, 739—740 

Johnson, Samuel 706, 882 

Johnston, Trevor 271 

Jones, William J. 797, 807 

Jørgensen, J. Normann 471 

Jørgensen, Jens 486 

Joseph, Brian D. 223, 225-227, 229, 308, 
322 

Joseph, John Earl 647 

Josephson, Olle 376 

Juarros-Daussà, Eva 405 

Judson, Pieter M. 724 

Jungmann, Josef 724 

Jutroniéc-Tihomirovié, Dunja 419 

Juvonen, Päivi 331 


K 

Kabatek, Johannes 401—402, 404 

Kachru, Braj B. 595, 609 

Kahane, Henry 348 

Kahane, Renée 348 

Kaiser, Georg 84, 462 

Kalinina, Elena 146 

Kallio, Petri 672—676, 678—680 

Kallulli, Dalina 200-201, 205-207 

Kaltz, Barbara 658 

Kamusella, Tomasz 781 

Kann, Robert A. 717 

Kappeler, Andreas 750 

Karadžić, Vuk 410, 763, 767 

Karakog, Birsel 168, 174 

Karcevskij, Sergej 831 

Karlsson, Fred 270 

Kask, Arnold 336 

Kaufman, Terrence 351—352, 430 

Kazenin, Konstantin I. 146, 148 

Keating, Michael 557 

Keenan, Edward L. 90, 526, 540 

Keller, Rudolf 706—708 

Kelly, Louis G. 662 

Kelly-Holmes, Helen 506 

Kertzer, David 471-472 

Khasanov, Bakhytzhan Kh. 750 

Kibrik, Alexandr E. 130, 134, 136, 
139—140, 144-147 

Kibrik, Andrej 148 

Kiefer, Ferenc 184, 187, 877—878 

Kieffer, Jean-Louis 570-571 

King, Ruth 580, 585 

Kintana, Noemi 683 

Kitis, Eliza 235 

Klein, Carlo 566, 569 

Klemola, Juhani 695 

Klimov, Georgij A. 75, 151 

Klooster, Willem G. 868, 870-871 

Kloss, Heinz 487, 498, 501, 551—552, 567, 
779, 781-782 

Knot, Antoni 719 

Koch, Peter 73, 79, 703 

Kodzasov, Sandro V. 130, 136 

Koerner, Ernst F. Konrad 805, 813, 822 

Körner, Karl-Hermann 79 

Koivulehto, Jorma 184, 675-676, 680 

Koneski, Blaze 318, 415, 772 


Kónig, Ekkehard 301, 525, 876, 884 

Kontzi, Reinhold 347 

Kooij, Jan 870 

Kopitar, Jernej 307—308 

Koptjevskaja-Tamm, Maria 297, 326, 
341—342, 352, 374, 689, 691 

Kordić, Snježana 766 

Korhonen, Mikko 184, 187 

Kortmann, Bernd 42, 301, 526-7, 531, 542, 
648 

Koryakov, Yury B. 151 

Koster, Jan 880 

Kostov, Kiril 421 

Kowalski, Tadeusz 329, 736 

Kowner, Rotem 606 

Kozinskij, Isaak Š. 386 

Kraak, Albert 868-871 

Krahe, Hans 677, 686 

Krauss, Michael 128 

Kreindler, Isabelle T. 745, 750 

Kremer, Ludger 507 

Kress, Bruno 55, 58, 

Kristiansen, Tore 368, 378, 486, 492, 607, 
610, 617 

Krueger, John R. 169, 176 

Krysin, Leonid P. 748, 751 

Krzyzanowski, Michal 621-622, 626 

Kulonen, Ulla-Maija 186 

Kumakhov, Mukhadin A. 132 

Kunzmann-Miiller, Barbara 769 

Kurylowicz, Jerzy 857 

Kuteva, Tania 292—293, 298, 303-304, 
534, 540, 683 

Kutscher, Silvia 742 

Kyle, Jim 270-1 

Kymlicka, Will 557 


L 

Laakso, Johanna 9, 12, 387 
Labov, William 583 
Labrie, Normand 558 
Lacroix, René 137, 141 
Laitin, David D. 745, 747, 749 
Lancelot, Claude 794 
Lander, Yury 142, 148 
Lane, Michael 824 
Langendoen, Donald T. 837 
Langston, Keith 766 
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Langstrof, Christian 531 

Lanz, Tilman 405 

Lapesa, Rafael 704 

Larsen, Amund B. 375 

Laumesfeld, Daniel 570 

Lauwers, Peter 814 

Law, Vivien 793 

Lazarus, Moritz 810 

Leau, Léopold 816 

Ledgeway, Adam 72, 74, 76 

Lehmann, Christian 73 

Lehmann, Winfred P. 805 

Leibniz, Gottfried Wilhelm 807 

Leith, Dick 706 

Lenin, Vladimir Ilyich 746, 751, 755, 758 

Lenz, Alexandra 491 

Leppänen, Sirpa 610, 617 

Lepschy, Giulio C. 824 

Leskien, August 100, 812 

Letuchiy, Alexander 142 

Levey, David 401 

Levin, Beth. 206—207 

Lewis, Evan Glyn 753 

Lewis, Geoffrey M. 732 

Lewis, Rhodri 793 

Lewy, Ernst 209 

Liddell, Scott 274 

Lindstedt, Jouko 308, 321 

Liskovec, Irina V. 389 

Livia, Anna 578 

Llamas, Carmen 510-511, 517 

Lobin, Günter 664 

Lodge, Anthony 704 

Lomonosov, Mixail 809 

Lomtatidze, Ketevan V. 151 

Loomba, Ania 578 

Luhtala, Anneli 794 

Lunden, Siri Sverdrup 387 

Lundén, Thomas 519 

Lundin Akesson, Katarina 361 

Luther, Martin 646-647, 699—701, 707, 
778, 793 

Lyons, John 823, 825, 836, 840 

Lyutikova, Ekaterina A. 143 


M 
Maas, Utz 498, 875 
Maat, Jaap 794 
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MacAulay, Donald 31, 33-35, 40-41 
Mac Eoin, Gearöid 34-5 

Mackenzie, Ian 81 

Mackridge, Peter 223—224, 235 
Mehlum, Brit 372 

Magocsi, Paul Robert 714 

Maiden, Martin 72 

Maisak, Timur A. 133 
Malikouti-Drachman, Angeliki 224 
Malinowski, Bronislaw 825 
Mallory, James 676, 681, 686 
Manchester, Martin L. 807, 882 
Manolessou, Io 227 

Marazzini, Claudio 658 

Marino, John A. 697 

Marmaridou, Sophia S. 586—587 
Marr, Nikolai J. 753, 836 

Martin, Peter 474 

Martinet, André 825-826, 838, 847, 852, 
855, 859—861 

Martínez González, Antonio 397, 401 
Marty, Anton 849 

Mascaró, Joan 878 

Massey, Victoria M. 207 

Mathesius, Vilém 831, 846, 848—852, 854, 
856-858, 862, 873 

Matras, Yaron 260, 262, 267—268, 321, 459 
Mattfolk, Leila 378 

Mattheier, Klaus J. 369, 496, 498 
Matthews, Peter 823 

Mauranen, Anna 596, 599 
Mayrhofer, Manfred 807 

Mazilu, Dan Horia 646 

Mazower, Mark 421 
McConnell-Ginet, Sally 586 
McLelland, Nicola 802 

McNamara, Tim 632-633 

McRae, Kenneth D. 556 

Meillet, Antoine 814—815, 837, 849 
Meinhof, Carl 817 

Mel’£uk, Igor 836 

Menz, Astrid 163, 171—172, 737 
Merdanova, Solmaz R. 133 
Mesthrie, Rajend 609 

Methodius 346, 414, 641, 687 
Metzeltin, Michael 72, 344, 348 
Metzger, Bruce M. 649 

Meyer, Paul 815 


Meyer, Roland 114 
Michatk, Frido 449 
Miestamo, Matti 186 
Mifsud, Manwel 249, 349 


Milani, Tommaso M. 632 
Milković, Marina 277, 281 
Mill, John Stuart 780 
Milles, Kari 584 

Mills, Sara 578, 586 


Miréev, Kiril 413, 415 

Misteli, Franz 810 

Moldovan, Alekandr M. 122 
Mollin, Sandra 609 

Monaghan, James 837 
Mgnnesland, Svein 768 
Monteagudo, Henrique 400 
Mørck, Endre 61 

Morf, Heinrich 814 

Moritsch, Andreas 718 
Morpurgo-Davies, Anna 805, 813 
Moschonas, Spiros A. 224 
Moser, Hugo 498 

Motsch, Wolfgang 874 
Moulin, Claudine 567 

Mounin, Georges 825 
Mühlhäusler, Peter 595 

Müller, Friedrich Max 811,814 
Müller, Natascha 82 

Mueller- Volmer, Kurt 807 
Mulcaster, Richard 705 
Muljacié, Zarko 411 

Mullonen, Maria 331, 333 
Mufioz, Carmen 403 

Murelli, Adriano 301, 529, 533-534 
Mutalov, Rasul 146 

Muysken, Pieter 683 

Myhill, John 643, 650-652 


N 
Narbona, Antonio 401 

Nau, Nicole 7, 11, 335-336 
Naylor, Kenneth E. 763 
Nebrija, Antonio de 704, 801 
Neis, Cordula 795 

Nelde, Peter H. 548, 554 
Németh, Julius 733, 737, 740 


Miklosich, Franyo (Franz) 308, 738, 809 


Milner, Jean-Claude 826, 828, 877, 881 


Nerlich, Brigitte 851 
Neuckermans, Annemie 303, 373 
Neumann, Bernd 810 

Neumann, Günter 808 
Neweklowsky, Gerhard 433, 441, 445 
Newerkla, Stefan Michael 724 
Newmark, Leonard D. 200-201, 203-204, 
207, 318 

Newton, Gerald 567 

Nic Craith, Máiréad 467, 623, 626 
Nichols, Johanna 130, 139-40 
Niebaum, Hermann 507 
Nikčević, Vojislav 767—768 
Nikoéevié, Lidija 514, 517 
Nikolaev, Sergej L. 125 

Nikolov, Marianne 478 

Nilsson, Ana-Lena 271 
Noordegraaf, Jan 860, 868 
Norberg, Madlena 432 

Norton, Glyn P. 700 

Nuyts, Jan 300 


(0) 

O'Dowd, Liam 519 

Oesterreicher, Wulf 73, 703 

Ogden, Charles 664, 829 

Öhlfearnäin, Tadhg 490 

Okuka, Miloš 122, 763, 766 

Ongstad, Sigmund 661 

Onysko, Alexander 607—608, 610, 617 
Osthoff, Hermann 812 

Ostman, Jan-Ola 364, 374—375, 607 


P 

Pahta, Päivi 608, 611-612 

Palmer, Frank 836 

Palou, Juli 405 

Panzer, Baldur 384 

Parayre, Sonia 549, 552-553 

Paris, Gaston 815 

Parry, Gwynedd 553 

Parveva-Kern, Elka 415—416 

Passy, Paul 813 

Patten, Alan 557 

Paul, Hermann 812, 816, 822, 850-851, 
862 

Pauwels, Anne 576, 578—579, 583 

Peano, Giuseppe 815 
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Pennycook, Alastair 605 

Perekhvalskaya, Elena V. 386-387 

Perniss, Pamela 279, 282-283 

Peters, Manfred 796 

Petersen, Roger 745 

Peti-Stanti¢, Anita 766 

Pfau, Roland 279—280, 282—283 

Pfeil, Beate Sybille 552 

Philippaki-Warburton, Irene 223—224, 230, 
235 

Phillipson, Robert 477, 595, 609, 623 

Pica, Pierre 876-877 

Pickering, John 806 

Pictet, Adolphe 808 

Pietsch, Richard 329—331 

Pinheiro, José Luís Palmeiro 513 

Pitkänen, Heli 695 

Pizzuto, Elena 271 

Plank, Frans 646, 810 

Poche, Bernhard 456 

Pohl, Jacques 511, 512 

Pokrovskaja, Ljudmila A. 163 

Polinsky, Maria 139, 147 

Polivanov, Evgenij D. 836 

Pomino, Natasha 78 

Popović, Ivan 410, 412, 414—415 

Porák, Jaroslav 300 

Pott, August Friedrich 809 

Poulain, Michel 470—471 

Poussa, Patricia 375 

Preisler, Bent 610 

Preston, Dennis 506 

Price, Cathy 600 

Price, David 649 

Prinzhorn, Martin 876 

Priscian 662, 792, 794, 799 

Probst, Ernst 672-673, 675—676, 680 

Prosper, Blanca M. 673, 677 

Puech, Gilbert 242 

Puff, Helmut 801 

Pusch, Claus D. 404 

Putzu, Ignazio 350—351, 685 


Q 

Quer, Josep 282-283 

Quintilian, Marcus Fabianus 699—700, 
798-799 
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R 

Raible, Wolfgang 91 

Ralli, Angela 225, 228—229 

Ramat, Paolo 86-87, 301, 341—342, 
349-352, 685 

Rammelmeyer, Matthias 415 

Ranger, Terence 781 

Rappaport-Hovav, Malka 206-207 

Rask, Rasmus 808, 810 

Raynouard, Francois Juste Marie 809 

Reichling, Anton 860, 868—870 

Rehder, Peter 122 

Reiter, Norbert 321 

Remy, Johannes 749 

Renfrew, Colin 675—676, 685—686 

Rensch, Karl-Heinz 824 

ReSetar, Milan 435, 437 

Reuter, Mikael 366 

Richards, Jack C. 663, 829 

Rindler Schjerve, Rosita 548, 720 

Rischel, Jórgen 833, 835 

Rivero, Maria-Luisa 205 

Robbeets, Martine Irma 159 

Robins, Robert H. 822, 836 

Rodgers, Theodore S. 663 

Rogers, Margaret 607 

Rohlfs, Gerhard 347, 353, 446 

Roma, Elisa 694—695 

Rooryck, Johan 876-877 

Rosenhouse, Judith 606 

Rossebastiano, Alda 662 

Rothstein, Bjorn 111 

Rousselot, Pierre 813 

Roussou, Anna 235 

Rubin, Joan 552 

Rubin, Donald L. 585 

Runnels, Curtis N. 673 


S 

Sajnovics, János 808 

Sallandre, Marie-Anne 276 
SamardZija, Marko 421 
Sammallahti, Pekka 181—182 
Sandfeld, Kristian 308 

Sandgy, Helge 377 

Sankoff, David 400 

Sanso, Andrea 292, 352, 691 
Sapiro, Abram B. 534, 537, 540 


Sarhimaa, Anneli 326, 385 

Sasse, Hans-Jiirgen 213 

Saumjan, Sebastian K. 836 

Savoia, Leonardo M. 436 

Scaliger, Julius Caesar 800 

Sterba, Lev V. 836 

Schalber, Katharina 275 

Schaller, Helmut Wilhelm 308, 417 

Schalley, Eva 300 

Schembri, Adam 273, 283 

Schenker, Alexander 100 

Scherer, Wilhelm 816 

Schermer, Trude 270 

Scheuringer, Hermann 508 

Schifferle, Hans-Peter 509 

Schjerve-Rindler, Rosita 720 

Schleicher, August 811-812, 814-6 

Schleiermacher, Friedrich 724 

Schlobinski, Peter 509 

Schmidt, Johannes 811 

Schmidt, Karl Horst 31 

Schmidt-Rohr, Georg 551 

Schmitt, Christian 72 

Schoenthal, Gisela 581 

Scholze, Lenka 433, 437—438, 443-444 

Schottelius, Justus Georg 708, 793, 797, 
801-802 

Schräpel, Beate 583 

Schrijver, Peter 678 

Schröder, Konrad 622, 659 

Schubert, Gabriella 773 

Schuchardt, Hugo 348, 395, 809, 811, 
813-815, 823 

Schultink, Henk 868-871, 884 

Schultze, Benjamin 807 

Schulze, Wolfgang 143, 151 

Schumacher, Stefan 679 

Sebeok, Thomas A. 805 

Sechehaye, Albert 832 

Segal, Dmitri 871 

Seidlhofer, Barbara 596, 599, 665 

Seiler, Guido 532 

Setälä, Eemil Nestor 337 

Seuren, Pieter A.M. 868, 870 

Sgall, Petr 846-847, 872-873 

Sharp, Harriett 610 

Shaumyan, Sebastian K. 871-872 

Shohamy, Elana 611 


Siemund, Peter 683 

Siertsema, Bertha 833, 835 

Sievers, Eduard 813 

Siewierska, Anna 118 

Siguán, Miguel 663 

Sikimić, Biljana 422-424 

Sinner, Carsten 400 

Skutnabb-Kangas, Tove 595 

Slezkine, Yuri 749 

Slobin, Dan 386 

Sloboda, Marian 516—517 

Slocum, John W. 745 

Smirniotopoulos, Jane C. 229 

Smith, Anthony D. 778 

Smith, John Charles 72 

Smith, Michael G. 752 

Soboleva, Polina A. 871 

Solta, Georg Renatus 308, 410-412, 419 

Sornicola, Rosanna 831, 849, 851-852, 
857-858, 873 

Speranskij, Michail M. 750 

Spielmann, Daniel 599 

Staal, Frits 869, 884 

Stadnik, Elena 102 

Stammerjohann, Harro 805 

Stankiewicz, Eward 100, 106 

Stark, Elisabeth 78 

Starkey, Hugh 632 

Starostin, Sergej A. 125 

Stassen, Leon 193 

Steenwijk, Han 441, 445, 449 

Steinbach, Markus 279—280, 282—283 

Steinbauer, Dieter 679 

Steinthal, Heymann 810 

Stephens, Janig 34 

Stern, Dieter 332, 334, 338, 365, 385, 387, 
689 

Sternemann, Reinhard 809 

Stevin, Simon 801 

Stojkov, Strojko 536 

Stokoe, William 269 

Stolz, Christel 248 

Stolz, Thomas 249, 292-293, 297, 341 

Stourzh, Gerald 718, 721—722, 724, 726 

Strasser, Gerhard 794 

Strubell, Miquel 548 

Strunk, Klaus 678 

Stukenbrock, Anja 761 
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Subbiondo, Jo 793 

Sumbatova, Nina 146 
Sundhaussen, Holm 762 
Sussex, Roland 100, 102 
Sutton-Spence, Rachel 271, 276 
Svoboda, Aleš 846 

Sweet, Henry 813, 849 
Swiggers, Pierre 806—808, 814 
Szabolsci, Anna 877—878 
Szucsich, Luka 433-434 


T 

Taavitsainen, Irma 608, 611—2 

Tabouret-Keller, Andrée 861 

Tagliavini, Carlo 805 

Tatevosov, Sergei 141 

Taylor, Talbot J. 826 

ten Thije, Jan 593 

Ternes, Elmar 32, 38, 41, 43 

Tervoort, Bernard 269 

Tesi, Riccardo 702-703 

Tesniére, Lucien 847, 852 

Testelec, Jakov G. 115, 143, 146 

Tecavéic, Pavao 74 

Teyssier, Paul 705 

Thieroff, Rolf 111 

Thggersen, Jacob 364 

Thomas, Alan 32-33, 461 

Thomas, George 725 

Thomason, Sarah Grey 351—352, 430 

Thompson, Sandra A. 80 

Thompson, Robert L. 461 

Thráinsson, Hóskuldur 48, 55, 60, 64 

Thümmel, Wolf 875, 884 

Tiersma, Pieter Meijes 55, 57, 60 

Timberlake, Alan 383-384 

Timmermans, Nina 271 

Tito, Josip Broz 420, 762, 764—765, 771 

Titone, Renzo 662—664 

Togeby, Knud 836 

Toldova, Svetlana Yu 143 

Tollefson, James W. 562 

Toman, Jindrich 848—849, 873 

Tomić, Olga Miseska 83, 199, 307, 309, 
322, 346, 515, 535, 537, 695, 738, 769, 
772 

Torp, Arne 361 

Trask, Robert Lawrence 210, 677—678, 681 
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Treffers-Daller, Jeanine 564, 572 
Triandafyllidou, Anna 621 
Trnka, Bohumil 831 

"Trommelen. Mieke 868 
Trosterud, Trond 185 


Trubetzkoy, Nikolaj S. 100, 150, 307, 684, 


831, 839, 846, 852—855, 857, 873 
Trudgill, Peter 223, 423, 489, 498 
Tsangalidis, Anastasios 227, 235 
Tsimpli, Ianthi M. 227-8 
Tsutsiev, Artur 151 
Tudor, Vera Alyosxa 576 
Tuite, Kevin 125, 137 
Turrell, M. Teresa 400, 406 
Turner, Graham 284 
Turunen, Rigina 185 


U 
Uhlenbeck, E. Marius 860, 868, 869 
Uldall, Hans-Jørgen 833 

Ulrich, Miorița 81 

Unger, Johnny 621 

Urbanitsch, Peter 714, 717—724, 726 
Ureland, Per Sture 384 

Utas, Bo 141 


V 

Vaba, Lembit 335 

Vachek, Josef 831, 846 

Vachtin, Nikolaj B. 748 

Valuev, Petr A. 749 

van Andel, Tjeerd H. 673 

van der Auwera, Johan 42, 87, 292-293, 
297-298, 300, 303, 321, 525, 541, 693 

van der Plank, Pieter H. 646 

van de Ven, Piet-Hein 661 

van Durme, Luc 561 

van Els, Theo 592, 623 

van Ginneken, Jacobus 860 

van Hout, Roeland 489, 496 

Vanhove, Martine 250 

van Oostendorp, Marc 880 

van Pottelberge, Jeroen 55 
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Map 1. Ethnographic Map of Austria-Hungary (based on the 1890 census). The Times Atlas 1895 


